Wei-Shi Zheng

93
Papers
122
Total Citations

Papers (93)

LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models

CVPR 2025
33
citations

Dexterous Grasp Transformer

CVPR 2024
19
citations

Single-View Scene Point Cloud Human Grasp Generation

CVPR 2024
13
citations

ViSpeak: Visual Instruction Feedback in Streaming Videos

ICCV 2025
11
citations

Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation

AAAI 2025
10
citations

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

AAAI 2024
9
citations

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation

ECCV 2024
6
citations

Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework

ICCV 2025
6
citations

NECA: Neural Customizable Human Avatar

CVPR 2024
5
citations

Person De-reidentification: A Variation-guided Identity Shift Modeling

CVPR 2025
2
citations

DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering

ICCV 2025arXiv
2
citations

EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion

CVPR 2025
2
citations

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection

ICCV 2025
1
citations

Domain Generalizable Portrait Style Transfer

ICCV 2025
1
citations

Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On

ICCV 2025
1
citations

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

CVPR 2024
1
citations

CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction

AAAI 2025
0
citations

ParGo: Bridging Vision-Language with Partial and Global Views

AAAI 2025
0
citations

When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data

AAAI 2025
0
citations

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

CVPR 2024
0
citations

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

CVPR 2024
0
citations

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training

CVPR 2024
0
citations

Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment

CVPR 2024
0
citations

Jointly Learning Heterogeneous Features for RGB-D Activity Recognition

CVPR 2015
0
citations

Top-Push Video-Based Person Re-Identification

CVPR 2016
0
citations

A Matrix Splitting Method for Composite Function Minimization

CVPR 2017arXiv
0
citations

Weakly Supervised Person Re-Identification

CVPR 2019
0
citations

Distilled Person Re-Identification: Towards a More Scalable System

CVPR 2019
0
citations

Unsupervised Person Re-Identification by Soft Multilabel Learning

CVPR 2019
0
citations

Progressive Teacher-Student Learning for Early Action Prediction

CVPR 2019
0
citations

Patch-Based Discriminative Feature Learning for Unsupervised Person Re-Identification

CVPR 2019
0
citations

Learning to Learn Relation for Important People Detection in Still Images

CVPR 2019
0
citations

Weakly Supervised Open-Set Domain Adaptation by Dual-Domain Collaboration

CVPR 2019
0
citations

A Decomposition Algorithm for the Sparse Generalized Eigenvalue Problem

CVPR 2019
0
citations

Underexposed Photo Enhancement Using Deep Illumination Estimation

CVPR 2019
0
citations

Deep Dual Relation Modeling for Egocentric Interaction Recognition

CVPR 2019
0
citations

Learning to Detect Important People in Unlabelled Images for Semi-Supervised Important People Detection

CVPR 2020arXiv
0
citations

Adaptive Interaction Modeling via Graph Operations Search

CVPR 2020arXiv
0
citations

Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification

CVPR 2020
0
citations

Weakly Supervised Discriminative Feature Learning With State Information for Person Identification

CVPR 2020arXiv
0
citations

Squeeze-and-Attention Networks for Semantic Segmentation

CVPR 2020arXiv
0
citations

MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection

CVPR 2021arXiv
0
citations

Graph-Based High-Order Relation Modeling for Long-Term Action Recognition

CVPR 2021
0
citations

Combined Depth Space Based Architecture Search for Person Re-Identification

CVPR 2021arXiv
0
citations

Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification

CVPR 2021
0
citations

Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification

CVPR 2021
0
citations

SIOD: Single Instance Annotated per Category per Image for Object Detection

CVPR 2022arXiv
0
citations

Learning To Imagine: Diversify Memory for Incremental Learning Using Unlabeled Data

CVPR 2022arXiv
0
citations

Likert Scoring With Grade Decoupling for Long-Term Action Assessment

CVPR 2022
0
citations

Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding

CVPR 2023
0
citations

Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification

CVPR 2023arXiv
0
citations

Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding

CVPR 2023
0
citations

Generating Anomalies for Video Anomaly Detection With Prompt-Based Feature Mapping

CVPR 2023
0
citations

AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection

CVPR 2023
0
citations

Multi-Scale Learning for Low-Resolution Person Re-Identification

ICCV 2015
0
citations

Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification

ICCV 2017arXiv
0
citations

RGB-Infrared Cross-Modality Person Re-Identification

ICCV 2017
0
citations

Action Assessment by Joint Relation Graphs

ICCV 2019
0
citations

Unsupervised Person Re-Identification by Camera-Aware Similarity Consistency Learning

ICCV 2019
0
citations

Learning To Know Where To See: A Visibility-Aware Approach for Occluded Person Re-Identification

ICCV 2021
0
citations

Predictive Feature Learning for Future Segmentation Prediction

ICCV 2021
0
citations

Weakly Supervised Text-Based Person Re-Identification

ICCV 2021
0
citations

Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training

ICCV 2023arXiv
0
citations

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation

ICCV 2023arXiv
0
citations

Event-Guided Procedure Planning from Instructional Videos with Text Supervision

ICCV 2023arXiv
0
citations

Revisit PCA-based Technique for Out-of-Distribution Detection

ICCV 2023
0
citations

When Prompt-based Incremental Learning Does Not Meet Strong Pretraining

ICCV 2023arXiv
0
citations

Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians

ECCV 2020
0
citations

MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection

ECCV 2020
0
citations

An Asymmetric Modeling for Action Assessment

ECCV 2020
0
citations

Adversarial Partial Domain Adaptation by Cycle Inconsistency

ECCV 2022
0
citations

AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection

ECCV 2022
0
citations

Partial Person Re-Identification

ICCV 2015
0
citations

Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks

CVPR 2025
0
citations

ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation

CVPR 2025
0
citations

RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images

CVPR 2025
0
citations

Panorama Generation From NFoV Image Done Right

CVPR 2025
0
citations

Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks

CVPR 2025
0
citations

Diffusion-based Event Generation for High-Quality Image Deblurring

CVPR 2025
0
citations

AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance

ICCV 2025
0
citations

Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning

ICCV 2025
0
citations

iManip: Skill-Incremental Learning for Robotic Manipulation

ICCV 2025
0
citations

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

ICCV 2025
0
citations

VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification

ICCV 2025
0
citations

Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal

ICCV 2025
0
citations

monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation

ICCV 2025
0
citations

Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation

NeurIPS 2025
0
citations

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

AAAI 2025
0
citations

Action-guided 3D Human Motion Prediction

NeurIPS 2021
0
citations

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval

NeurIPS 2022
0
citations

Inner-Outer Aware Reconstruction Model for Monocular 3D Scene Reconstruction

NeurIPS 2023
0
citations

Diversifying Spatial-Temporal Perception for Video Domain Generalization

NeurIPS 2023
0
citations

Temporal Continual Learning with Prior Compensation for Human Motion Prediction

NeurIPS 2023
0
citations