Chao Ma

22
Papers
300
Total Citations

Papers (22)

Single-Model and Any-Modality for Video Object Tracking

CVPR 2024
96
citations

VidToMe: Video Token Merging for Zero-Shot Video Editing

CVPR 2024
89
citations

Domain-Controlled Prompt Learning

AAAI 2024arXiv
30
citations

Domain Prompt Learning with Quaternion Networks

CVPR 2024
22
citations

HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos

CVPR 2025
17
citations

VEON: Vocabulary-Enhanced Occupancy Prediction

ECCV 2024
15
citations

XTrack: Multimodal Training Boosts RGB-X Video Object Trackers

ICCV 2025
10
citations

Monocular Identity-Conditioned Facial Reflectance Reconstruction

CVPR 2024
7
citations

Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

ICCV 2025arXiv
7
citations

What You Have is What You Track: Adaptive and Robust Multimodal Tracking

ICCV 2025
3
citations

AdaptGrad: Adaptive Sampling to Reduce Noise

NeurIPS 2025
2
citations

Cross-Architecture Distillation Made Simple with Redundancy Suppression

ICCV 2025
2
citations

Towards Causal Foundation Model: on Duality between Optimal Balancing and Attention

ICML 2024
0
citations

VRM: Knowledge Distillation via Virtual Relation Matching

ICCV 2025
0
citations

VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning

ICCV 2025
0
citations

PVMamba: Parallelizing Vision Mamba via Dynamic State Aggregation

ICCV 2025
0
citations

Robust SAM: On the Adversarial Robustness of Vision Foundation Models

AAAI 2025
0
citations

LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation

AAAI 2024
0
citations

SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction

CVPR 2024
0
citations

DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking

CVPR 2024
0
citations

A Fixed-Point Approach for Causal Generative Modeling

ICML 2024
0
citations

S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors

CVPR 2025
0
citations