Sangmin Lee

15
Papers
57
Total Citations

Papers (15)

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders

CVPR 2025
18
citations

Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge

CVPR 2024
15
citations

SocialGesture: Delving into Multi-person Gesture Understanding

CVPR 2025
5
citations

Question-Aware Gaussian Experts for Audio-Visual Question Answering

CVPR 2025
5
citations

Self-supervised Debiasing Using Low Rank Regularization

CVPR 2024
5
citations

Object-aware Sound Source Localization via Audio-Visual Scene Understanding

CVPR 2025arXiv
5
citations

Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI

CVPR 2025
2
citations

MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization

ICCV 2025
1
citations

IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution

ICCV 2025
1
citations

Defining Neural Network Architecture through Polytope Structures of Datasets

ICML 2024
0
citations

DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI

CVPR 2025
0
citations

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection

AAAI 2025
0
citations

LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration

AAAI 2025
0
citations

Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations

CVPR 2024
0
citations

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

CVPR 2025
0
citations