Yutaka Matsuo
8
Papers
212
Total Citations
Papers (8)
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
ICLR 2024
141
citations
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
ICCV 2025
25
citations
Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search
NeurIPS 2025arXiv
20
citations
Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties
NeurIPS 2025arXiv
13
citations
Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words
ICLR 2025arXiv
9
citations
Image Referenced Sketch Colorization Based on Animation Creation Workflow
CVPR 2025
3
citations
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields
ICCV 2025
1
citations
Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
NeurIPS 2025arXiv
0
citations