Kwan-Yee K. Wong

21
Papers
45
Total Citations

Papers (21)

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

CVPR 2024arXiv
23
citations

DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving

CVPR 2025
17
citations

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

ICCV 2025
5
citations

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

CVPR 2025arXiv
0
citations

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

ICCV 2025
0
citations

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

ICLR 2025arXiv
0
citations

Affordances-Oriented Planning Using Foundation Models for Continuous Vision-Language Navigation

AAAI 2025
0
citations

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

CVPR 2024arXiv
0
citations

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

CVPR 2020
0
citations

Progressive Semantic-Aware Style Transformation for Blind Face Restoration

CVPR 2021arXiv
0
citations

Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel

CVPR 2022arXiv
0
citations

JIFF: Jointly-Aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction

CVPR 2022arXiv
0
citations

Learning Attention As Disentangler for Compositional Zero-Shot Learning

CVPR 2023arXiv
0
citations

SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction

CVPR 2023arXiv
0
citations

HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset

ICCV 2021arXiv
0
citations

RIGID: Recurrent GAN Inversion and Editing of Real Face Videos

ICCV 2023arXiv
0
citations

What is Learned in Deep Uncalibrated Photometric Stereo?

ECCV 2020
0
citations

PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo

ECCV 2022
0
citations

S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

NeurIPS 2022arXiv
0
citations

HeadSculpt: Crafting 3D Head Avatars with Text

NeurIPS 2023arXiv
0
citations

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

NeurIPS 2023arXiv
0
citations