Yifei Huang
22
Papers
157
Total Citations
Papers (22)
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
CVPR 2024
84
citations
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
ICLR 2025
39
citations
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
ICLR 2025
11
citations
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs
NeurIPS 2025
10
citations
ActionVOS: Actions as Prompts for Video Object Segmentation
ECCV 2024
9
citations
Learning Streaming Video Representation via Multitask Training
ICCV 2025
3
citations
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
ICML 2025
1
citations
Interact Before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition
CVPR 2022
0
citations
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
0
citations
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction
CVPR 2023arXiv
0
citations
Weakly Supervised Temporal Sentence Grounding With Uncertainty-Guided Self-Training
CVPR 2023
0
citations
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning
ICCV 2021arXiv
0
citations
Memory-and-Anticipation Transformer for Online Action Understanding
ICCV 2023arXiv
0
citations
Learn to Recover Visible Color for Video Surveillance in a Day
ECCV 2020
0
citations
Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
ICCV 2025
0
citations
Compound Prototype Matching for Few-Shot Action Recognition
ECCV 2022
0
citations
Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance
ICCV 2025
0
citations
Retrieval-Augmented Egocentric Video Captioning
CVPR 2024
0
citations
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
0
citations
Improving Action Segmentation via Graph-Based Temporal Reasoning
CVPR 2020
0
citations
Goal-Oriented Gaze Estimation for Zero-Shot Learning
CVPR 2021arXiv
0
citations
CLRNet: Cross Layer Refinement Network for Lane Detection
CVPR 2022arXiv
0
citations