Lin Ma
15
Papers
129
Total Citations
Papers (15)
Making Large Language Models Better Planners with Reasoning-Decision Alignment
ECCV 2024arXiv
35
citations
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024arXiv
24
citations
ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field
AAAI 2024arXiv
15
citations
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
ECCV 2024arXiv
15
citations
RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation
ICCV 2025
13
citations
RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving
ICCV 2025arXiv
11
citations
CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets
ICLR 2025
7
citations
Towards Efficient Foundation Model for Zero-shot Amodal Segmentation
CVPR 2025
3
citations
RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction
ICCV 2025arXiv
3
citations
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
ICCV 2025arXiv
3
citations
Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning
AAAI 2024arXiv
0
citations
Affordances-Oriented Planning Using Foundation Models for Continuous Vision-Language Navigation
AAAI 2025
0
citations
DisTime: Distribution-based Time Representation for Video Large Language Models
ICCV 2025
0
citations
Misalignment-Robust Frequency Distribution Loss for Image Transformation
CVPR 2024
0
citations
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
CVPR 2024
0
citations