Tong Lu

12

Papers

2,562

Total Citations

Papers (12)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Docopilot: Improving Multimodal Models for Document-Level Understanding

EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs

CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

AVSegFormer: Audio-Visual Segmentation with Transformer

Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation

MOERL: When Mixture-of-Experts Meet Reinforcement Learning for Adverse Weather Image Restoration

RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation