Di Huang

24
Papers
321
Total Citations

Papers (24)

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers

ICLR 2025arXiv
101
citations

InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

CVPR 2024
81
citations

GVGEN: Text-to-3D Generation with Volumetric Representation

ECCV 2024arXiv
51
citations

Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models

CVPR 2025
26
citations

ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems

CVPR 2025
15
citations

Towards Training-free Anomaly Detection with Vision and Language Foundation Models

CVPR 2025
10
citations

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

ICLR 2025arXiv
9
citations

InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct

AAAI 2025
9
citations

Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images

AAAI 2025
8
citations

Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction

ICLR 2025arXiv
4
citations

Constraint-Aware Feature Learning for Parametric Point Cloud

ICCV 2025arXiv
3
citations

Progressive Parameter Efficient Transfer Learning for Semantic Segmentation

ICLR 2025
2
citations

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

ICCV 2025
1
citations

GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction

AAAI 2025
1
citations

3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

AAAI 2025
0
citations

CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization

CVPR 2025
0
citations

Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning

AAAI 2024
0
citations

Emergent Communication for Numerical Concepts Generalization

AAAI 2024
0
citations

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

CVPR 2024
0
citations

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

CVPR 2024
0
citations

APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers

CVPR 2025
0
citations

QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code

NeurIPS 2025
0
citations

FiT: Flexible Vision Transformer for Diffusion Model

ICML 2024
0
citations

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

AAAI 2025
0
citations