Tae-Hyun Oh

37
Papers
147
Total Citations

Papers (37)

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering

CVPR 2024
75
citations

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

ICLR 2024
23
citations

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

ECCV 2024
10
citations

A Pseudo-Bayesian Algorithm for Robust PCA

NeurIPS 2016
9
citations

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

ICCV 2025
7
citations

Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild

CVPR 2025
6
citations

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

ICCV 2025
6
citations

JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers

ICCV 2025
4
citations

SoundBrush: Sound as a Brush for Visual Scene Editing

AAAI 2025
3
citations

VSC: Visual Search Compositional Text-to-Image Diffusion Model

ICCV 2025
2
citations

Learning-based Axial Video Motion Magnification

ECCV 2024
2
citations

Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images

CVPR 2019
0
citations

Listen to Look: Action Recognition by Previewing Audio

CVPR 2020arXiv
0
citations

Monocular Reconstruction of Neural Face Reflectance Fields

CVPR 2021arXiv
0
citations

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment

CVPR 2023arXiv
0
citations

Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting

ICCV 2017arXiv
0
citations

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning

ICCV 2017arXiv
0
citations

Distilling Global and Local Logits With Densely Connected Relations

ICCV 2021
0
citations

CDS: Cross-Domain Self-Supervised Pre-Training

ICCV 2021
0
citations

Sound Source Localization is All about Cross-Modal Alignment

ICCV 2023arXiv
0
citations

Scratching Visual Transformer's Back with Uniform Attention

ICCV 2023
0
citations

TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation

ICCV 2023arXiv
0
citations

Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers

ECCV 2022
0
citations

CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes

ECCV 2022
0
citations

HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields

ECCV 2022
0
citations

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

CVPR 2025
0
citations

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

CVPR 2025
0
citations

Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

AAAI 2025
0
citations

FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields

AAAI 2024
0
citations

Fast Randomized Singular Value Thresholding for Nuclear Norm Minimization

CVPR 2015
0
citations

Globally Optimal Manhattan Frame Estimation in Real-Time

CVPR 2016
0
citations

Video-Story Composition via Plot Analysis

CVPR 2016
0
citations

Learning to Localize Sound Source in Visual Scenes

CVPR 2018arXiv
0
citations

Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation

CVPR 2018
0
citations

Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning

CVPR 2019
0
citations

Speech2Face: Learning the Face Behind a Voice

CVPR 2019
0
citations

Neural Inverse Knitting: From Images to Manufacturing Instructions

ICML 2019
0
citations