Xiawu Zheng

24
Papers
2,293
Total Citations

Papers (24)

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

NeurIPS 2025
1,227
citations

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

NeurIPS 2025arXiv
130
citations

AffineQuant: Affine Transformation Quantization for Large Language Models

ICLR 2024
43
citations

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

CVPR 2024
9
citations

Multimodal Quantitative Language for Generative Recommendation

ICLR 2025
8
citations

Feature Denoising Diffusion Model for Blind Image Quality Assessment

AAAI 2025
8
citations

Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment

CVPR 2025
3
citations

Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective

ICML 2025
3
citations

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning

ICCV 2025
2
citations

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

ICCV 2025arXiv
2
citations

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer

ICML 2024
0
citations

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment

ICML 2024
0
citations

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity

ICML 2024
0
citations

AllGCD: Leveraging All Unlabeled Data for Generalized Category Discovery

ICCV 2025
0
citations

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

ICLR 2025
0
citations

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

AAAI 2025
0
citations

Dynamic Clustering Convolutional Neural Network

AAAI 2025
0
citations

Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning

AAAI 2024
0
citations

GraCo: Granularity-Controllable Interactive Segmentation

CVPR 2024
0
citations

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

CVPR 2024
0
citations

RepAn: Enhanced Annealing through Re-parameterization

CVPR 2024
0
citations

polybasic Speculative Decoding Through a Theoretical Perspective

ICML 2025
0
citations

Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation

ICML 2024
0
citations