Kwan-Yee K. Wong
21
Papers
45
Total Citations
Papers (21)
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
CVPR 2024arXiv
23
citations
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
CVPR 2025
17
citations
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
ICCV 2025
5
citations
ArtiFade: Learning to Generate High-quality Subject from Blemished Images
CVPR 2025arXiv
0
citations
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
ICCV 2025
0
citations
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
ICLR 2025arXiv
0
citations
Affordances-Oriented Planning Using Foundation Models for Continuous Vision-Language Navigation
AAAI 2025
0
citations
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
CVPR 2024arXiv
0
citations
Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension
CVPR 2020
0
citations
Progressive Semantic-Aware Style Transformation for Blind Face Restoration
CVPR 2021arXiv
0
citations
Blind Image Super-Resolution With Elaborate Degradation Modeling on Noise and Kernel
CVPR 2022arXiv
0
citations
JIFF: Jointly-Aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction
CVPR 2022arXiv
0
citations
Learning Attention As Disentangler for Compositional Zero-Shot Learning
CVPR 2023arXiv
0
citations
SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction
CVPR 2023arXiv
0
citations
HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset
ICCV 2021arXiv
0
citations
RIGID: Recurrent GAN Inversion and Editing of Real Face Videos
ICCV 2023arXiv
0
citations
What is Learned in Deep Uncalibrated Photometric Stereo?
ECCV 2020
0
citations
PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo
ECCV 2022
0
citations
S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
NeurIPS 2022arXiv
0
citations
HeadSculpt: Crafting 3D Head Avatars with Text
NeurIPS 2023arXiv
0
citations
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
NeurIPS 2023arXiv
0
citations