Di Huang
24
Papers
321
Total Citations
Papers (24)
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
ICLR 2025arXiv
101
citations
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
CVPR 2024
81
citations
GVGEN: Text-to-3D Generation with Volumetric Representation
ECCV 2024arXiv
51
citations
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
CVPR 2025
26
citations
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
CVPR 2025
15
citations
Towards Training-free Anomaly Detection with Vision and Language Foundation Models
CVPR 2025
10
citations
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
ICLR 2025arXiv
9
citations
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
AAAI 2025
9
citations
Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images
AAAI 2025
8
citations
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
ICLR 2025arXiv
4
citations
Constraint-Aware Feature Learning for Parametric Point Cloud
ICCV 2025arXiv
3
citations
Progressive Parameter Efficient Transfer Learning for Semantic Segmentation
ICLR 2025
2
citations
ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning
ICCV 2025
1
citations
GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction
AAAI 2025
1
citations
3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
AAAI 2025
0
citations
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization
CVPR 2025
0
citations
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning
AAAI 2024
0
citations
Emergent Communication for Numerical Concepts Generalization
AAAI 2024
0
citations
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
CVPR 2024
0
citations
Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
CVPR 2024
0
citations
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
CVPR 2025
0
citations
QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code
NeurIPS 2025
0
citations
FiT: Flexible Vision Transformer for Diffusion Model
ICML 2024
0
citations
Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
AAAI 2025
0
citations