Lin Ma

15
Papers
129
Total Citations

Papers (15)

Making Large Language Models Better Planners with Reasoning-Decision Alignment

ECCV 2024arXiv
35
citations

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning

CVPR 2024arXiv
24
citations

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field

AAAI 2024arXiv
15
citations

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

ECCV 2024arXiv
15
citations

RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation

ICCV 2025
13
citations

RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving

ICCV 2025arXiv
11
citations

CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets

ICLR 2025
7
citations

Towards Efficient Foundation Model for Zero-shot Amodal Segmentation

CVPR 2025
3
citations

RoboTron-Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction

ICCV 2025arXiv
3
citations

RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case

ICCV 2025arXiv
3
citations

Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning

AAAI 2024arXiv
0
citations

Affordances-Oriented Planning Using Foundation Models for Continuous Vision-Language Navigation

AAAI 2025
0
citations

DisTime: Distribution-based Time Representation for Video Large Language Models

ICCV 2025
0
citations

Misalignment-Robust Frequency Distribution Loss for Image Transformation

CVPR 2024
0
citations

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

CVPR 2024
0
citations