Tae-Hyun Oh
37
Papers
147
Total Citations
Papers (37)
Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
CVPR 2024
75
citations
Noise Map Guidance: Inversion with Spatial Context for Real Image Editing
ICLR 2024
23
citations
BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models
ECCV 2024
10
citations
A Pseudo-Bayesian Algorithm for Robust PCA
NeurIPS 2016
9
citations
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
ICCV 2025
7
citations
Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild
CVPR 2025
6
citations
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
ICCV 2025
6
citations
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
ICCV 2025
4
citations
SoundBrush: Sound as a Brush for Visual Scene Editing
AAAI 2025
3
citations
VSC: Visual Search Compositional Text-to-Image Diffusion Model
ICCV 2025
2
citations
Learning-based Axial Video Motion Magnification
ECCV 2024
2
citations
Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images
CVPR 2019
0
citations
Listen to Look: Action Recognition by Previewing Audio
CVPR 2020arXiv
0
citations
Monocular Reconstruction of Neural Face Reflectance Fields
CVPR 2021arXiv
0
citations
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
CVPR 2023arXiv
0
citations
Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
ICCV 2017arXiv
0
citations
Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning
ICCV 2017arXiv
0
citations
Distilling Global and Local Logits With Densely Connected Relations
ICCV 2021
0
citations
CDS: Cross-Domain Self-Supervised Pre-Training
ICCV 2021
0
citations
Sound Source Localization is All about Cross-Modal Alignment
ICCV 2023arXiv
0
citations
Scratching Visual Transformer's Back with Uniform Attention
ICCV 2023
0
citations
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation
ICCV 2023arXiv
0
citations
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers
ECCV 2022
0
citations
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes
ECCV 2022
0
citations
HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields
ECCV 2022
0
citations
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
CVPR 2025
0
citations
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
CVPR 2025
0
citations
Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior
AAAI 2025
0
citations
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
AAAI 2024
0
citations
Fast Randomized Singular Value Thresholding for Nuclear Norm Minimization
CVPR 2015
0
citations
Globally Optimal Manhattan Frame Estimation in Real-Time
CVPR 2016
0
citations
Video-Story Composition via Plot Analysis
CVPR 2016
0
citations
Learning to Localize Sound Source in Visual Scenes
CVPR 2018arXiv
0
citations
Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation
CVPR 2018
0
citations
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
CVPR 2019
0
citations
Speech2Face: Learning the Face Behind a Voice
CVPR 2019
0
citations
Neural Inverse Knitting: From Images to Manufacturing Instructions
ICML 2019
0
citations