Xiawu Zheng

36
Papers
2,290
Total Citations

Papers (36)

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

NeurIPS 2025
1,227
citations

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

NeurIPS 2025arXiv
130
citations

AffineQuant: Affine Transformation Quantization for Large Language Models

ICLR 2024
43
citations

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

CVPR 2024
9
citations

Feature Denoising Diffusion Model for Blind Image Quality Assessment

AAAI 2025
8
citations

Multimodal Quantitative Language for Generative Recommendation

ICLR 2025
8
citations

Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective

ICML 2025
3
citations

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning

ICCV 2025
2
citations

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

ICCV 2025arXiv
2
citations

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation

ICML 2024
0
citations

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer

ICML 2024
0
citations

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment

ICML 2024
0
citations

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

ICML 2024
0
citations

Rethinking Performance Estimation in Neural Architecture Search

CVPR 2020arXiv
0
citations

Neural Architecture Search With Representation Mutual Information

CVPR 2022
0
citations

Training-Free Transformer Architecture Search

CVPR 2022arXiv
0
citations

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

CVPR 2023arXiv
0
citations

Meta Architecture for Point Cloud Analysis

CVPR 2023arXiv
0
citations

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

CVPR 2023
0
citations

Multinomial Distribution Learning for Effective Neural Architecture Search

ICCV 2019
0
citations

EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS

ICCV 2021
0
citations

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

ICCV 2023arXiv
0
citations

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle

ICCV 2023
0
citations

PAMS: Quantized Super-Resolution via Parameterized Max Scale

ECCV 2020
0
citations

Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment

CVPR 2025
0
citations

AllGCD: Leveraging All Unlabeled Data for Generalized Category Discovery

ICCV 2025
0
citations

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025
0
citations

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

AAAI 2025
0
citations

Dynamic Clustering Convolutional Neural Network

AAAI 2025
0
citations

Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning

AAAI 2024
0
citations

GraCo: Granularity-Controllable Interactive Segmentation

CVPR 2024
0
citations

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

CVPR 2024
0
citations

RepAn: Enhanced Annealing through Re-parameterization

CVPR 2024
0
citations

polybasic Speculative Decoding Through a Theoretical Perspective

ICML 2025
0
citations

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning

NeurIPS 2023
0
citations