Wei-Shi Zheng
38
Papers
122
Total Citations
Papers (38)
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
CVPR 2025
33
citations
Dexterous Grasp Transformer
CVPR 2024
19
citations
Single-View Scene Point Cloud Human Grasp Generation
CVPR 2024
13
citations
ViSpeak: Visual Instruction Feedback in Streaming Videos
ICCV 2025
11
citations
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
AAAI 2025
10
citations
Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
AAAI 2024
9
citations
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
ECCV 2024
6
citations
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
ICCV 2025
6
citations
NECA: Neural Customizable Human Avatar
CVPR 2024
5
citations
DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering
ICCV 2025arXiv
2
citations
Person De-reidentification: A Variation-guided Identity Shift Modeling
CVPR 2025
2
citations
EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion
CVPR 2025
2
citations
Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On
ICCV 2025
1
citations
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
ICCV 2025
1
citations
Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
CVPR 2024
1
citations
Domain Generalizable Portrait Style Transfer
ICCV 2025
1
citations
monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation
ICCV 2025
0
citations
Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning
ICCV 2025
0
citations
AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance
ICCV 2025
0
citations
Diffusion-based Event Generation for High-Quality Image Deblurring
CVPR 2025
0
citations
Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation
NeurIPS 2025
0
citations
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
AAAI 2025
0
citations
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction
AAAI 2025
0
citations
ParGo: Bridging Vision-Language with Partial and Global Views
AAAI 2025
0
citations
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
CVPR 2025
0
citations
When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data
AAAI 2025
0
citations
Panorama Generation From NFoV Image Done Right
CVPR 2025
0
citations
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images
CVPR 2025
0
citations
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation
CVPR 2025
0
citations
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
CVPR 2024
0
citations
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
CVPR 2024
0
citations
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
CVPR 2024
0
citations
Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
CVPR 2025
0
citations
iManip: Skill-Incremental Learning for Robotic Manipulation
ICCV 2025
0
citations
Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment
CVPR 2024
0
citations
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
ICCV 2025
0
citations
VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification
ICCV 2025
0
citations
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal
ICCV 2025
0
citations