Yifei Huang

22
Papers
157
Total Citations

Papers (22)

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

CVPR 2024
84
citations

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

ICLR 2025
39
citations

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

ICLR 2025
11
citations

EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs

NeurIPS 2025
10
citations

ActionVOS: Actions as Prompts for Video Object Segmentation

ECCV 2024
9
citations

Learning Streaming Video Representation via Multitask Training

ICCV 2025
3
citations

TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation

ICML 2025
1
citations

Interact Before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition

CVPR 2022
0
citations

Ego4D: Around the World in 3,000 Hours of Egocentric Video

CVPR 2022
0
citations

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

CVPR 2023arXiv
0
citations

Weakly Supervised Temporal Sentence Grounding With Uncertainty-Guided Self-Training

CVPR 2023
0
citations

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning

ICCV 2021arXiv
0
citations

Memory-and-Anticipation Transformer for Online Action Understanding

ICCV 2023arXiv
0
citations

Learn to Recover Visible Color for Video Surveillance in a Day

ECCV 2020
0
citations

Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition

ICCV 2025
0
citations

Compound Prototype Matching for Few-Shot Action Recognition

ECCV 2022
0
citations

Egocentric Action-aware Inertial Localization in Point Clouds with Vision-Language Guidance

ICCV 2025
0
citations

Retrieval-Augmented Egocentric Video Captioning

CVPR 2024
0
citations

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

CVPR 2024
0
citations

Improving Action Segmentation via Graph-Based Temporal Reasoning

CVPR 2020
0
citations

Goal-Oriented Gaze Estimation for Zero-Shot Learning

CVPR 2021arXiv
0
citations

CLRNet: Cross Layer Refinement Network for Lane Detection

CVPR 2022arXiv
0
citations