Di Huang
53
Papers
356
Total Citations
Papers (53)
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
ICLR 2025arXiv
101
citations
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
CVPR 2024
81
citations
GVGEN: Text-to-3D Generation with Volumetric Representation
ECCV 2024arXiv
51
citations
Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction
ECCV 2020
35
citations
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
CVPR 2025
26
citations
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
CVPR 2025
15
citations
Towards Training-free Anomaly Detection with Vision and Language Foundation Models
CVPR 2025
10
citations
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
ICLR 2025arXiv
9
citations
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
AAAI 2025
9
citations
Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images
AAAI 2025
8
citations
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
ICLR 2025arXiv
4
citations
Constraint-Aware Feature Learning for Parametric Point Cloud
ICCV 2025arXiv
3
citations
Progressive Parameter Efficient Transfer Learning for Semantic Segmentation
ICLR 2025
2
citations
ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning
ICCV 2025
1
citations
GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction
AAAI 2025
1
citations
Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation
CVPR 2020arXiv
0
citations
ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo
CVPR 2022
0
citations
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
CVPR 2022arXiv
0
citations
ImFace: A Nonlinear 3D Morphable Face Model With Implicit Neural Representations
CVPR 2022arXiv
0
citations
CAT-Det: Contrastively Augmented Transformer for Multi-Modal 3D Object Detection
CVPR 2022
0
citations
Entropy-Based Active Learning for Object Detection With Progressive Diversity Constraint
CVPR 2022arXiv
0
citations
OcTr: Octree-Based Transformer for 3D Object Detection
CVPR 2023arXiv
0
citations
NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images
CVPR 2023arXiv
0
citations
Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images
CVPR 2023arXiv
0
citations
PR-GCN: A Deep Graph Convolutional Network With Point Refinement for 6D Pose Estimation
ICCV 2021
0
citations
Image Inpainting via Conditional Texture and Structure Dual Generation
ICCV 2021arXiv
0
citations
Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection
ICCV 2023arXiv
0
citations
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
ICCV 2023arXiv
0
citations
DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration
ICCV 2023
0
citations
Multi-Scale Positive Sample Refinement for Few-Shot Object Detection
ECCV 2020
0
citations
Improving Object Detection with Selective Self-Supervised Self-Training
ECCV 2020
0
citations
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles
ECCV 2022
0
citations
Motion Sensitive Contrastive Learning for Self-Supervised Video Representation
ECCV 2022
0
citations
Ponder: Point Cloud Pre-training via Neural Rendering
ICCV 2023arXiv
0
citations
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
CVPR 2025
0
citations
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization
CVPR 2025
0
citations
QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code
NeurIPS 2025
0
citations
Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
AAAI 2025
0
citations
3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
AAAI 2025
0
citations
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning
AAAI 2024
0
citations
Emergent Communication for Numerical Concepts Generalization
AAAI 2024
0
citations
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
CVPR 2024
0
citations
Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
CVPR 2024
0
citations
FiT: Flexible Vision Transformer for Diffusion Model
ICML 2024
0
citations
Learning Face Age Progression: A Pyramid Architecture of GANs
CVPR 2018arXiv
0
citations
Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces
CVPR 2019
0
citations
Adaptive NMS: Refining Pedestrian Detection in a Crowd
CVPR 2019
0
citations
Fixed-Point Back-Propagation Training
CVPR 2020
0
citations
OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models
NeurIPS 2022
0
citations
Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images
NeurIPS 2023
0
citations
Compressed Video Prompt Tuning
NeurIPS 2023
0
citations
Emergent Communication for Rules Reasoning
NeurIPS 2023
0
citations
ANPL: Towards Natural Programming with Interactive Decomposition
NeurIPS 2023
0
citations