Yanwei Li

19
Papers
1,667
Total Citations

Papers (19)

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

CVPR 2025
858
citations

LISA: Reasoning Segmentation via Large Language Model

CVPR 2024
721
citations

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

ICML 2025
88
citations

Attention-Guided Unified Network for Panoptic Segmentation

CVPR 2019
0
citations

Learning Dynamic Routing for Semantic Segmentation

CVPR 2020arXiv
0
citations

Multi-Scale Aligned Distillation for Low-Resolution Detection

CVPR 2021
0
citations

Scale-Aware Automatic Augmentation for Object Detection

CVPR 2021arXiv
0
citations

Voxel Field Fusion for 3D Object Detection

CVPR 2022arXiv
0
citations

Focal Sparse Convolutional Networks for 3D Object Detection

CVPR 2022arXiv
0
citations

End-to-end 3D Tracking with Decoupled Queries

ICCV 2023
0
citations

Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines

ECCV 2022
0
citations

Fully Convolutional Networks for Panoptic Segmentation

CVPR 2021arXiv
0
citations

Aligning Effective Tokens with Video Anomaly in Large Language Models

ICCV 2025
0
citations

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

ICCV 2025
0
citations

Learnable Tree Filter for Structure-preserving Feature Transform

NeurIPS 2019
0
citations

Rethinking Learnable Tree Filter for Generic Feature Transform

NeurIPS 2020
0
citations

Fine-Grained Dynamic Head for Object Detection

NeurIPS 2020
0
citations

Unifying Voxel-based Representation with Transformer for 3D Object Detection

NeurIPS 2022
0
citations

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

NeurIPS 2023
0
citations