Kwan-Yee K. Wong

21

Papers

45

Total Citations

Papers (21)

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Affordances-Oriented Planning Using Foundation Models for Continuous Vision-Language Navigation

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension

Progressive Semantic-Aware Style Transformation for Blind Face Restoration

Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel

JIFF: Jointly-Aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction

Learning Attention As Disentangler for Compositional Zero-Shot Learning

SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction

HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset

RIGID: Recurrent GAN Inversion and Editing of Real Face Videos

What is Learned in Deep Uncalibrated Photometric Stereo?

PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo

S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint

NeurIPS 2022arXiv

HeadSculpt: Crafting 3D Head Avatars with Text

NeurIPS 2023arXiv

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

NeurIPS 2023arXiv