Zhenguo Li

16
Papers
536
Total Citations

Papers (16)

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

ICLR 2025
169
citations

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

ICLR 2025
74
citations

Accelerating Diffusion Sampling with Optimized Time Steps

CVPR 2024
51
citations

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

CVPR 2024
45
citations

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

CVPR 2025arXiv
44
citations

MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control

ICCV 2025
44
citations

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

ICLR 2024
44
citations

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

CVPR 2024
39
citations

CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs

CVPR 2024
13
citations

Implicit Search via Discrete Diffusion: A Study on Chess

ICLR 2025
13
citations

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

ICCV 2025
0
citations

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

CVPR 2025
0
citations

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

ICML 2024
0
citations

Enhancing the Power of OOD Detection via Sample-Aware Model Selection

CVPR 2024
0
citations

Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

ICCV 2025
0
citations

Masked Diffusion Models as Energy Minimization

NeurIPS 2025
0
citations