Sangmin Lee
15
Papers
57
Total Citations
Papers (15)
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
CVPR 2025
18
citations
Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge
CVPR 2024
15
citations
SocialGesture: Delving into Multi-person Gesture Understanding
CVPR 2025
5
citations
Question-Aware Gaussian Experts for Audio-Visual Question Answering
CVPR 2025
5
citations
Self-supervised Debiasing Using Low Rank Regularization
CVPR 2024
5
citations
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
CVPR 2025arXiv
5
citations
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI
CVPR 2025
2
citations
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
ICCV 2025
1
citations
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution
ICCV 2025
1
citations
Defining Neural Network Architecture through Polytope Structures of Datasets
ICML 2024
0
citations
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI
CVPR 2025
0
citations
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
AAAI 2025
0
citations
LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration
AAAI 2025
0
citations
Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
CVPR 2024
0
citations
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
CVPR 2025
0
citations