Hsin-Ying Lee

33
Papers
658
Total Citations

Papers (33)

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

CVPR 2024
341
citations

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control

ICLR 2025arXiv
114
citations

RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval

ECCV 2020
56
citations

Exploiting Diffusion Prior for Generalizable Dense Prediction

CVPR 2024
42
citations

SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

CVPR 2024
40
citations

Controllable Image Synthesis via SegVAE

ECCV 2020
23
citations

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

CVPR 2025
18
citations

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

ECCV 2024
12
citations

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

ICLR 2025
9
citations

UniPhy: Learning a Unified Constitutive Model for Inverse Physics Simulation

CVPR 2025
3
citations

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars

CVPR 2023arXiv
0
citations

Unsupervised Volumetric Animation

CVPR 2023arXiv
0
citations

Unsupervised Representation Learning by Sorting Sequences

ICCV 2017arXiv
0
citations

ReDAL: Region-Based and Diversity-Aware Active Learning for Point Cloud Semantic Segmentation

ICCV 2021arXiv
0
citations

Text2Tex: Text-driven Texture Synthesis via Diffusion Models

ICCV 2023arXiv
0
citations

InfiniCity: Infinite-Scale City Synthesis

ICCV 2023arXiv
0
citations

Neural Design Network: Graphic Layout Generation with Constraints

ECCV 2020
0
citations

Semantic View Synthesis

ECCV 2020
0
citations

Cross-Modal 3D Shape Generation and Manipulation

ECCV 2022
0
citations

Vector Quantized Image-to-Image Translation

ECCV 2022
0
citations

D2ADA: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation

ECCV 2022
0
citations

Make-a-Story: Visual Memory Conditioned Consistent Story Generation

CVPR 2023
0
citations

PrEditor3D: Fast and Precise 3D Shape Editing

CVPR 2025
0
citations

T2Bs: Text-to-Character Blendshapes via Video Generation

ICCV 2025
0
citations

Towards Text-guided 3D Scene Composition

CVPR 2024
0
citations

Soft-Segmentation Guided Object Motion Deblurring

CVPR 2016
0
citations

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

CVPR 2019
0
citations

InOut: Diverse Image Outpainting via GAN Inversion

CVPR 2022
0
citations

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

CVPR 2022arXiv
0
citations

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis

CVPR 2023arXiv
0
citations

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

CVPR 2023arXiv
0
citations

Dancing to Music

NeurIPS 2019
0
citations

Exploring Cross-Video and Cross-Modality Signals for Weakly-Supervised Audio-Visual Video Parsing

NeurIPS 2021
0
citations