Di Huang

53
Papers
356
Total Citations

Papers (53)

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers

ICLR 2025arXiv
101
citations

InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

CVPR 2024
81
citations

GVGEN: Text-to-3D Generation with Volumetric Representation

ECCV 2024arXiv
51
citations

Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction

ECCV 2020
35
citations

Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models

CVPR 2025
26
citations

ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems

CVPR 2025
15
citations

Towards Training-free Anomaly Detection with Vision and Language Foundation Models

CVPR 2025
10
citations

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

ICLR 2025arXiv
9
citations

InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct

AAAI 2025
9
citations

Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images

AAAI 2025
8
citations

Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction

ICLR 2025arXiv
4
citations

Constraint-Aware Feature Learning for Parametric Point Cloud

ICCV 2025arXiv
3
citations

Progressive Parameter Efficient Transfer Learning for Semantic Segmentation

ICLR 2025
2
citations

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

ICCV 2025
1
citations

GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction

AAAI 2025
1
citations

Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

CVPR 2020arXiv
0
citations

ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo

CVPR 2022
0
citations

Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection

CVPR 2022arXiv
0
citations

ImFace: A Nonlinear 3D Morphable Face Model With Implicit Neural Representations

CVPR 2022arXiv
0
citations

CAT-Det: Contrastively Augmented Transformer for Multi-Modal 3D Object Detection

CVPR 2022
0
citations

Entropy-Based Active Learning for Object Detection With Progressive Diversity Constraint

CVPR 2022arXiv
0
citations

OcTr: Octree-Based Transformer for 3D Object Detection

CVPR 2023arXiv
0
citations

NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images

CVPR 2023arXiv
0
citations

Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images

CVPR 2023arXiv
0
citations

PR-GCN: A Deep Graph Convolutional Network With Point Refinement for 6D Pose Estimation

ICCV 2021
0
citations

Image Inpainting via Conditional Texture and Structure Dual Generation

ICCV 2021arXiv
0
citations

Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection

ICCV 2023arXiv
0
citations

Denoising Diffusion Autoencoders are Unified Self-supervised Learners

ICCV 2023arXiv
0
citations

DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration

ICCV 2023
0
citations

Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

ECCV 2020
0
citations

Improving Object Detection with Selective Self-Supervised Self-Training

ECCV 2020
0
citations

Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles

ECCV 2022
0
citations

Motion Sensitive Contrastive Learning for Self-Supervised Video Representation

ECCV 2022
0
citations

Ponder: Point Cloud Pre-training via Neural Rendering

ICCV 2023arXiv
0
citations

APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers

CVPR 2025
0
citations

CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization

CVPR 2025
0
citations

QiMeng-NeuComBack: Self-Evolving Translation from IR to Assembly Code

NeurIPS 2025
0
citations

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

AAAI 2025
0
citations

3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

AAAI 2025
0
citations

Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning

AAAI 2024
0
citations

Emergent Communication for Numerical Concepts Generalization

AAAI 2024
0
citations

UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

CVPR 2024
0
citations

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

CVPR 2024
0
citations

FiT: Flexible Vision Transformer for Diffusion Model

ICML 2024
0
citations

Learning Face Age Progression: A Pyramid Architecture of GANs

CVPR 2018arXiv
0
citations

Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces

CVPR 2019
0
citations

Adaptive NMS: Refining Pedestrian Detection in a Crowd

CVPR 2019
0
citations

Fixed-Point Back-Propagation Training

CVPR 2020
0
citations

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

NeurIPS 2022
0
citations

Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images

NeurIPS 2023
0
citations

Compressed Video Prompt Tuning

NeurIPS 2023
0
citations

Emergent Communication for Rules Reasoning

NeurIPS 2023
0
citations

ANPL: Towards Natural Programming with Interactive Decomposition

NeurIPS 2023
0
citations