Tong Lu
12
Papers
2,562
Total Citations
Papers (12)
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024
2,210
citations
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
CVPR 2024
169
citations
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
ICLR 2024
118
citations
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025arXiv
41
citations
Docopilot: Improving Multimodal Models for Document-Level Understanding
CVPR 2025
14
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers
AAAI 2024
0
citations
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
CVPR 2024
0
citations
AVSegFormer: Audio-Visual Segmentation with Transformer
AAAI 2024arXiv
0
citations
Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation
AAAI 2025
0
citations
MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration
ICCV 2025
0
citations
RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation
CVPR 2024
0
citations