Yanwei Li
19
Papers
1,667
Total Citations
Papers (19)
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
858
citations
LISA: Reasoning Segmentation via Large Language Model
CVPR 2024
721
citations
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
ICML 2025
88
citations
Attention-Guided Unified Network for Panoptic Segmentation
CVPR 2019
0
citations
Learning Dynamic Routing for Semantic Segmentation
CVPR 2020arXiv
0
citations
Multi-Scale Aligned Distillation for Low-Resolution Detection
CVPR 2021
0
citations
Scale-Aware Automatic Augmentation for Object Detection
CVPR 2021arXiv
0
citations
Voxel Field Fusion for 3D Object Detection
CVPR 2022arXiv
0
citations
Focal Sparse Convolutional Networks for 3D Object Detection
CVPR 2022arXiv
0
citations
End-to-end 3D Tracking with Decoupled Queries
ICCV 2023
0
citations
Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines
ECCV 2022
0
citations
Fully Convolutional Networks for Panoptic Segmentation
CVPR 2021arXiv
0
citations
Aligning Effective Tokens with Video Anomaly in Large Language Models
ICCV 2025
0
citations
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
ICCV 2025
0
citations
Learnable Tree Filter for Structure-preserving Feature Transform
NeurIPS 2019
0
citations
Rethinking Learnable Tree Filter for Generic Feature Transform
NeurIPS 2020
0
citations
Fine-Grained Dynamic Head for Object Detection
NeurIPS 2020
0
citations
Unifying Voxel-based Representation with Transformer for 3D Object Detection
NeurIPS 2022
0
citations
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
NeurIPS 2023
0
citations