Zhenguo Li

74
Papers
536
Total Citations

Papers (74)

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

ICLR 2025
169
citations

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

ICLR 2025
74
citations

Accelerating Diffusion Sampling with Optimized Time Steps

CVPR 2024
51
citations

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

CVPR 2024
45
citations

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

ICLR 2024
44
citations

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

CVPR 2025
44
citations

MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control

ICCV 2025
44
citations

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

CVPR 2024
39
citations

Implicit Search via Discrete Diffusion: A Study on Chess

ICLR 2025
13
citations

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

CVPR 2024
13
citations

Rethinking Performance Estimation in Neural Architecture Search

CVPR 2020arXiv
0
citations

SP-NAS: Serial-to-Parallel Backbone Search for Object Detection

CVPR 2020
0
citations

Boosting Few-Shot Learning With Adaptive Margin Loss

CVPR 2020arXiv
0
citations

iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression

CVPR 2021arXiv
0
citations

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search

CVPR 2021
0
citations

Transformation Invariant Few-Shot Object Detection

CVPR 2021
0
citations

ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning

CVPR 2021arXiv
0
citations

Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation

CVPR 2021
0
citations

Adversarial Invariant Learning

CVPR 2021
0
citations

Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search

CVPR 2022
0
citations

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation

CVPR 2022arXiv
0
citations

Long-Tail Recognition via Compositional Knowledge Transfer

CVPR 2022
0
citations

Semi-Supervised Object Detection via Multi-Instance Alignment With Global Class Prototypes

CVPR 2022
0
citations

OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization

CVPR 2022
0
citations

PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework

CVPR 2022
0
citations

Mixed Autoencoder for Self-Supervised Visual Representation Learning

CVPR 2023arXiv
0
citations

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment

CVPR 2023arXiv
0
citations

ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning

CVPR 2023arXiv
0
citations

Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification

ICCV 2019
0
citations

G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation

ICCV 2021
0
citations

DetCo: Unsupervised Contrastive Learning for Object Detection

ICCV 2021arXiv
0
citations

Towards Understanding the Generative Capability of Adversarially Robust Classifiers

ICCV 2021arXiv
0
citations

Adversarial Robustness for Unsupervised Domain Adaptation

ICCV 2021arXiv
0
citations

MultiSiam: Self-Supervised Multi-Instance Siamese Representation Learning for Autonomous Driving

ICCV 2021arXiv
0
citations

NASOA: Towards Faster Task-Oriented Online Fine-Tuning With a Zoo of Models

ICCV 2021arXiv
0
citations

Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection

ICCV 2021
0
citations

NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization

ICCV 2021
0
citations

Beyond One-to-One: Rethinking the Referring Image Segmentation

ICCV 2023
0
citations

UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation

ICCV 2023
0
citations

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning

ICCV 2023arXiv
0
citations

DDP: Diffusion Model for Dense Visual Prediction

ICCV 2023arXiv
0
citations

AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling

ECCV 2020
0
citations

CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending

ECCV 2020
0
citations

CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search

ECCV 2020
0
citations

Generative Negative Text Replay for Continual Vision-Language Pretraining

ECCV 2022
0
citations

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving

ECCV 2022
0
citations

DevNet: Self-Supervised Monocular Depth Learning via Density Volume Construction

ECCV 2022
0
citations

MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation

ICCV 2023
0
citations

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

CVPR 2025
0
citations

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

ICCV 2025
0
citations

Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

ICCV 2025
0
citations

Masked Diffusion Models as Energy Minimization

NeurIPS 2025
0
citations

Enhancing the Power of OOD Detection via Sample-Aware Model Selection

CVPR 2024
0
citations

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

ICML 2024
0
citations

New Insights Into Laplacian Similarity Search

CVPR 2015
0
citations

Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection

CVPR 2019
0
citations

Spatial-Aware Graph Relation Network for Large-Scale Object Detection

CVPR 2019
0
citations

Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS

NeurIPS 2020
0
citations

Locally Differentially Private (Contextual) Bandits Learning

NeurIPS 2020
0
citations

On Effective Scheduling of Model-based Reinforcement Learning

NeurIPS 2021
0
citations

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

NeurIPS 2021
0
citations

iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder

NeurIPS 2021
0
citations

OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression

NeurIPS 2021
0
citations

Towards a Theoretical Framework of Out-of-Distribution Generalization

NeurIPS 2021
0
citations

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

NeurIPS 2022
0
citations

Understanding Square Loss in Training Overparametrized Neural Network Classifiers

NeurIPS 2022
0
citations

CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds

NeurIPS 2022
0
citations

ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization

NeurIPS 2022
0
citations

Complexity Matters: Rethinking the Latent Space for Generative Modeling

NeurIPS 2023
0
citations

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation

NeurIPS 2023
0
citations

DiffComplete: Diffusion-based Generative 3D Shape Completion

NeurIPS 2023
0
citations

Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models

NeurIPS 2023
0
citations

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

NeurIPS 2023
0
citations

T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

NeurIPS 2023
0
citations