Shao-Yuan Lo
6
Papers
3
Total Citations
Papers (6)
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
ICML 2025arXiv
3
citations
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
CVPR 2025
0
citations
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
CVPR 2025
0
citations
Bridging Compressed Image Latents and Multimodal Large Language Models
ICLR 2025arXiv
0
citations
Can’t Make an Omelette Without Breaking Some Eggs: Plausible Action Anticipation Using Large Video-Language Models
CVPR 2024
0
citations
Uncertainty-aware Action Decoupling Transformer for Action Anticipation
CVPR 2024
0
citations