Xiawu Zheng
24
Papers
2,293
Total Citations
Papers (24)
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
NeurIPS 2025
1,227
citations
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
858
citations
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
NeurIPS 2025arXiv
130
citations
AffineQuant: Affine Transformation Quantization for Large Language Models
ICLR 2024
43
citations
Bilateral Event Mining and Complementary for Event Stream Super-Resolution
CVPR 2024
9
citations
Multimodal Quantitative Language for Generative Recommendation
ICLR 2025
8
citations
Feature Denoising Diffusion Model for Blind Image Quality Assessment
AAAI 2025
8
citations
Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment
CVPR 2025
3
citations
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective
ICML 2025
3
citations
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
ICCV 2025
2
citations
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
ICCV 2025arXiv
2
citations
Outlier-aware Slicing for Post-Training Quantization in Vision Transformer
ICML 2024
0
citations
Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment
ICML 2024
0
citations
Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity
ICML 2024
0
citations
AllGCD: Leveraging All Unlabeled Data for Generalized Category Discovery
ICCV 2025
0
citations
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025
0
citations
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation
AAAI 2025
0
citations
Dynamic Clustering Convolutional Neural Network
AAAI 2025
0
citations
Semi-supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning
AAAI 2024
0
citations
GraCo: Granularity-Controllable Interactive Segmentation
CVPR 2024
0
citations
Solving the Catastrophic Forgetting Problem in Generalized Category Discovery
CVPR 2024
0
citations
RepAn: Enhanced Annealing through Re-parameterization
CVPR 2024
0
citations
polybasic Speculative Decoding Through a Theoretical Perspective
ICML 2025
0
citations
Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation
ICML 2024
0
citations