Wei-Shi Zheng

38
Papers
122
Total Citations

Papers (38)

LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models

CVPR 2025
33
citations

Dexterous Grasp Transformer

CVPR 2024
19
citations

Single-View Scene Point Cloud Human Grasp Generation

CVPR 2024
13
citations

ViSpeak: Visual Instruction Feedback in Streaming Videos

ICCV 2025
11
citations

Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation

AAAI 2025
10
citations

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

AAAI 2024
9
citations

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation

ECCV 2024
6
citations

Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework

ICCV 2025
6
citations

NECA: Neural Customizable Human Avatar

CVPR 2024
5
citations

DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering

ICCV 2025arXiv
2
citations

Person De-reidentification: A Variation-guided Identity Shift Modeling

CVPR 2025
2
citations

EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion

CVPR 2025
2
citations

Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On

ICCV 2025
1
citations

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection

ICCV 2025
1
citations

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

CVPR 2024
1
citations

Domain Generalizable Portrait Style Transfer

ICCV 2025
1
citations

monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation

ICCV 2025
0
citations

Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning

ICCV 2025
0
citations

AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance

ICCV 2025
0
citations

Diffusion-based Event Generation for High-Quality Image Deblurring

CVPR 2025
0
citations

Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

NeurIPS 2025
0
citations

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

AAAI 2025
0
citations

CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction

AAAI 2025
0
citations

ParGo: Bridging Vision-Language with Partial and Global Views

AAAI 2025
0
citations

Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks

CVPR 2025
0
citations

When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data

AAAI 2025
0
citations

Panorama Generation From NFoV Image Done Right

CVPR 2025
0
citations

RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images

CVPR 2025
0
citations

ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation

CVPR 2025
0
citations

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

CVPR 2024
0
citations

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

CVPR 2024
0
citations

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training

CVPR 2024
0
citations

Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks

CVPR 2025
0
citations

iManip: Skill-Incremental Learning for Robotic Manipulation

ICCV 2025
0
citations

Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment

CVPR 2024
0
citations

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

ICCV 2025
0
citations

VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification

ICCV 2025
0
citations

Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal

ICCV 2025
0
citations