"egocentric vision" Papers
16 papers found
Conference
Closed-Loop Transfer for Weakly-supervised Affordance Grounding
Jiajin Tang, Zhengxuan Wei, Ge Zheng et al.
ICCV 2025posterarXiv:2510.17384
1
citations
egoEMOTION: Egocentric Vision and Physiological Signals for Emotion and Personality Recognition in Real-world Tasks
Matthias Jammot, Björn Braun, Paul Streli et al.
NEURIPS 2025posterarXiv:2510.22129
EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding
Ege Özsoy, Arda Mamur, Felix Tristram et al.
NEURIPS 2025posterarXiv:2505.24287
5
citations
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li, Yutong Chen, Yiqian Wu et al.
ICCV 2025posterarXiv:2506.07886
4
citations
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Sheng Zhou, Junbin Xiao, Qingyun Li et al.
CVPR 2025posterarXiv:2502.07411
29
citations
Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision
Tianma Shen, Aditya Shrish Puranik, James Vong et al.
ICCV 2025posterarXiv:2503.06089
1
citations
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents
Tristan Tomilin, Meng Fang, Mykola Pechenizkiy
ICLR 2025posterarXiv:2503.08241
5
citations
Is Tracking really more challenging in First Person Egocentric Vision?
Matteo Dunnhofer, Zaira Manigrasso, Christian Micheloni
ICCV 2025highlightarXiv:2507.16015
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
Liang Xu, Chengqun Yang, Zili Lin et al.
ICCV 2025posterarXiv:2508.04681
1
citations
Reading Recognition in the Wild
Charig Yang, Samiul Alam, Shakhrul Iman Siam et al.
NEURIPS 2025posterarXiv:2505.24848
3
citations
Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors
Peiran Xu, Yadong MU
ICLR 2025posterarXiv:2505.24103
5
citations
WearVQA: A Visual Question Answering Benchmark for Wearables in Egocentric Authentic Real-world scenarios
Eun Chang, Zhuangqun Huang, Yiwei Liao et al.
NEURIPS 2025posterarXiv:2511.22154
ActionVOS: Actions as Prompts for Video Object Segmentation
LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.
ECCV 2024posterarXiv:2407.07402
9
citations
Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?
Rosario Leonardi, Antonino Furnari, Francesco Ragusa et al.
ECCV 2024posterarXiv:2312.02672
5
citations
HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization
Sakib Reza, Yuexi Zhang, Mohsen Moghaddam et al.
ECCV 2024posterarXiv:2408.06437
5
citations
PALM: Predicting Actions through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian et al.
ECCV 2024posterarXiv:2311.17944
23
citations