Zhenguo Li
74
Papers
536
Total Citations
Papers (74)
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
ICLR 2025
169
citations
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
ICLR 2025
74
citations
Accelerating Diffusion Sampling with Optimized Time Steps
CVPR 2024
51
citations
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
CVPR 2024
45
citations
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
ICLR 2024
44
citations
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control
ICCV 2025
44
citations
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
CVPR 2024
39
citations
Implicit Search via Discrete Diffusion: A Study on Chess
ICLR 2025
13
citations
CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
CVPR 2024
13
citations
Rethinking Performance Estimation in Neural Architecture Search
CVPR 2020arXiv
0
citations
SP-NAS: Serial-to-Parallel Backbone Search for Object Detection
CVPR 2020
0
citations
Boosting Few-Shot Learning With Adaptive Margin Loss
CVPR 2020arXiv
0
citations
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression
CVPR 2021arXiv
0
citations
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search
CVPR 2021
0
citations
Transformation Invariant Few-Shot Object Detection
CVPR 2021
0
citations
ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning
CVPR 2021arXiv
0
citations
Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation
CVPR 2021
0
citations
Adversarial Invariant Learning
CVPR 2021
0
citations
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search
CVPR 2022
0
citations
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation
CVPR 2022arXiv
0
citations
Long-Tail Recognition via Compositional Knowledge Transfer
CVPR 2022
0
citations
Semi-Supervised Object Detection via Multi-Instance Alignment With Global Class Prototypes
CVPR 2022
0
citations
OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
CVPR 2022
0
citations
PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework
CVPR 2022
0
citations
Mixed Autoencoder for Self-Supervised Visual Representation Learning
CVPR 2023arXiv
0
citations
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment
CVPR 2023arXiv
0
citations
ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning
CVPR 2023arXiv
0
citations
Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification
ICCV 2019
0
citations
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation
ICCV 2021
0
citations
DetCo: Unsupervised Contrastive Learning for Object Detection
ICCV 2021arXiv
0
citations
Towards Understanding the Generative Capability of Adversarially Robust Classifiers
ICCV 2021arXiv
0
citations
Adversarial Robustness for Unsupervised Domain Adaptation
ICCV 2021arXiv
0
citations
MultiSiam: Self-Supervised Multi-Instance Siamese Representation Learning for Autonomous Driving
ICCV 2021arXiv
0
citations
NASOA: Towards Faster Task-Oriented Online Fine-Tuning With a Zoo of Models
ICCV 2021arXiv
0
citations
Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection
ICCV 2021
0
citations
NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization
ICCV 2021
0
citations
Beyond One-to-One: Rethinking the Referring Image Segmentation
ICCV 2023
0
citations
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
ICCV 2023
0
citations
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning
ICCV 2023arXiv
0
citations
DDP: Diffusion Model for Dense Visual Prediction
ICCV 2023arXiv
0
citations
AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling
ECCV 2020
0
citations
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending
ECCV 2020
0
citations
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search
ECCV 2020
0
citations
Generative Negative Text Replay for Continual Vision-Language Pretraining
ECCV 2022
0
citations
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving
ECCV 2022
0
citations
DevNet: Self-Supervised Monocular Depth Learning via Density Volume Construction
ECCV 2022
0
citations
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation
ICCV 2023
0
citations
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
CVPR 2025
0
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
0
citations
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
ICCV 2025
0
citations
Masked Diffusion Models as Energy Minimization
NeurIPS 2025
0
citations
Enhancing the Power of OOD Detection via Sample-Aware Model Selection
CVPR 2024
0
citations
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling
ICML 2024
0
citations
New Insights Into Laplacian Similarity Search
CVPR 2015
0
citations
Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection
CVPR 2019
0
citations
Spatial-Aware Graph Relation Network for Large-Scale Object Detection
CVPR 2019
0
citations
Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS
NeurIPS 2020
0
citations
Locally Differentially Private (Contextual) Bandits Learning
NeurIPS 2020
0
citations
On Effective Scheduling of Model-based Reinforcement Learning
NeurIPS 2021
0
citations
MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps
NeurIPS 2021
0
citations
iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder
NeurIPS 2021
0
citations
OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression
NeurIPS 2021
0
citations
Towards a Theoretical Framework of Out-of-Distribution Generalization
NeurIPS 2021
0
citations
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
NeurIPS 2022
0
citations
Understanding Square Loss in Training Overparametrized Neural Network Classifiers
NeurIPS 2022
0
citations
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds
NeurIPS 2022
0
citations
ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
NeurIPS 2022
0
citations
Complexity Matters: Rethinking the Latent Space for Generative Modeling
NeurIPS 2023
0
citations
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
NeurIPS 2023
0
citations
DiffComplete: Diffusion-based Generative 3D Shape Completion
NeurIPS 2023
0
citations
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
NeurIPS 2023
0
citations
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
NeurIPS 2023
0
citations
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
NeurIPS 2023
0
citations