Zsolt Kira
7
Papers
27
Total Citations
Papers (7)
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons
CVPR 2025
23
citations
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering
CVPR 2025
4
citations
Seeing the Unseen: Visual Common Sense for Semantic Placement
CVPR 2024
0
citations
Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
CVPR 2024
0
citations
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
CVPR 2024
0
citations
EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device
ICCV 2025
0
citations
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach
CVPR 2025
0
citations