Wenqi Shao

28
Papers
607
Total Citations

Papers (28)

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

ICLR 2024
320
citations

GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

ICCV 2025
96
citations

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

ICML 2025
72
citations

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

ICLR 2024
46
citations

OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

CVPR 2025
18
citations

Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models

CVPR 2025
10
citations

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

ICCV 2025
10
citations

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

ICLR 2025
8
citations

Distilling Monocular Foundation Model for Fine-grained Depth Completion

CVPR 2025
8
citations

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

CVPR 2024
7
citations

Cached Transformers: Improving Transformers with Differentiable Memory Cached

AAAI 2024arXiv
5
citations

Cross-Subject Mind Decoding from Inaccurate Representations

ICCV 2025
3
citations

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

CVPR 2025
2
citations

OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis

NeurIPS 2025
2
citations

DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

CVPR 2025
0
citations

LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation

ICCV 2025
0
citations

ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity

ICCV 2025
0
citations

Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation

ICCV 2025
0
citations

Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space

ICCV 2025
0
citations

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

CVPR 2024
0
citations

SSN: Learning Sparse Switchable Normalization via SparsestMax

CVPR 2019
0
citations

Real-Time Controllable Denoising for Image and Video

CVPR 2023arXiv
0
citations

Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks

ICCV 2019
0
citations

DiffRate : Differentiable Compression Rate for Efficient Vision Transformers

ICCV 2023arXiv
0
citations

Beyond One-to-One: Rethinking the Referring Image Segmentation

ICCV 2023
0
citations

Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space

ECCV 2022
0
citations

Rethinking the Pruning Criteria for Convolutional Neural Network

NeurIPS 2021
0
citations

Foundation Model is Efficient Multimodal Multitask Model Selector

NeurIPS 2023
0
citations