Wei-Shi Zheng
93
Papers
122
Total Citations
Papers (93)
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
CVPR 2025
33
citations
Dexterous Grasp Transformer
CVPR 2024
19
citations
Single-View Scene Point Cloud Human Grasp Generation
CVPR 2024
13
citations
ViSpeak: Visual Instruction Feedback in Streaming Videos
ICCV 2025
11
citations
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
AAAI 2025
10
citations
Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
AAAI 2024
9
citations
DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
ECCV 2024
6
citations
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
ICCV 2025
6
citations
NECA: Neural Customizable Human Avatar
CVPR 2024
5
citations
Person De-reidentification: A Variation-guided Identity Shift Modeling
CVPR 2025
2
citations
DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering
ICCV 2025arXiv
2
citations
EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion
CVPR 2025
2
citations
FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
ICCV 2025
1
citations
Domain Generalizable Portrait Style Transfer
ICCV 2025
1
citations
Learning Implicit Features with Flow-Infused Transformations for Realistic Virtual Try-On
ICCV 2025
1
citations
Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
CVPR 2024
1
citations
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction
AAAI 2025
0
citations
ParGo: Bridging Vision-Language with Partial and Global Views
AAAI 2025
0
citations
When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data
AAAI 2025
0
citations
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
CVPR 2024
0
citations
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
CVPR 2024
0
citations
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
CVPR 2024
0
citations
Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment
CVPR 2024
0
citations
Jointly Learning Heterogeneous Features for RGB-D Activity Recognition
CVPR 2015
0
citations
Top-Push Video-Based Person Re-Identification
CVPR 2016
0
citations
A Matrix Splitting Method for Composite Function Minimization
CVPR 2017arXiv
0
citations
Weakly Supervised Person Re-Identification
CVPR 2019
0
citations
Distilled Person Re-Identification: Towards a More Scalable System
CVPR 2019
0
citations
Unsupervised Person Re-Identification by Soft Multilabel Learning
CVPR 2019
0
citations
Progressive Teacher-Student Learning for Early Action Prediction
CVPR 2019
0
citations
Patch-Based Discriminative Feature Learning for Unsupervised Person Re-Identification
CVPR 2019
0
citations
Learning to Learn Relation for Important People Detection in Still Images
CVPR 2019
0
citations
Weakly Supervised Open-Set Domain Adaptation by Dual-Domain Collaboration
CVPR 2019
0
citations
A Decomposition Algorithm for the Sparse Generalized Eigenvalue Problem
CVPR 2019
0
citations
Underexposed Photo Enhancement Using Deep Illumination Estimation
CVPR 2019
0
citations
Deep Dual Relation Modeling for Egocentric Interaction Recognition
CVPR 2019
0
citations
Learning to Detect Important People in Unlabelled Images for Semi-Supervised Important People Detection
CVPR 2020arXiv
0
citations
Adaptive Interaction Modeling via Graph Operations Search
CVPR 2020arXiv
0
citations
Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification
CVPR 2020
0
citations
Weakly Supervised Discriminative Feature Learning With State Information for Person Identification
CVPR 2020arXiv
0
citations
Squeeze-and-Attention Networks for Semantic Segmentation
CVPR 2020arXiv
0
citations
MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection
CVPR 2021arXiv
0
citations
Graph-Based High-Order Relation Modeling for Long-Term Action Recognition
CVPR 2021
0
citations
Combined Depth Space Based Architecture Search for Person Re-Identification
CVPR 2021arXiv
0
citations
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification
CVPR 2021
0
citations
Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification
CVPR 2021
0
citations
SIOD: Single Instance Annotated per Category per Image for Object Detection
CVPR 2022arXiv
0
citations
Learning To Imagine: Diversify Memory for Incremental Learning Using Unlabeled Data
CVPR 2022arXiv
0
citations
Likert Scoring With Grade Decoupling for Long-Term Action Assessment
CVPR 2022
0
citations
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding
CVPR 2023
0
citations
Shape-Erased Feature Learning for Visible-Infrared Person Re-Identification
CVPR 2023arXiv
0
citations
Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding
CVPR 2023
0
citations
Generating Anomalies for Video Anomaly Detection With Prompt-Based Feature Mapping
CVPR 2023
0
citations
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection
CVPR 2023
0
citations
Multi-Scale Learning for Low-Resolution Person Re-Identification
ICCV 2015
0
citations
Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification
ICCV 2017arXiv
0
citations
RGB-Infrared Cross-Modality Person Re-Identification
ICCV 2017
0
citations
Action Assessment by Joint Relation Graphs
ICCV 2019
0
citations
Unsupervised Person Re-Identification by Camera-Aware Similarity Consistency Learning
ICCV 2019
0
citations
Learning To Know Where To See: A Visibility-Aware Approach for Occluded Person Re-Identification
ICCV 2021
0
citations
Predictive Feature Learning for Future Segmentation Prediction
ICCV 2021
0
citations
Weakly Supervised Text-Based Person Re-Identification
ICCV 2021
0
citations
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
ICCV 2023arXiv
0
citations
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
ICCV 2023arXiv
0
citations
Event-Guided Procedure Planning from Instructional Videos with Text Supervision
ICCV 2023arXiv
0
citations
Revisit PCA-based Technique for Out-of-Distribution Detection
ICCV 2023
0
citations
When Prompt-based Incremental Learning Does Not Meet Strong Pretraining
ICCV 2023arXiv
0
citations
Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians
ECCV 2020
0
citations
MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection
ECCV 2020
0
citations
An Asymmetric Modeling for Action Assessment
ECCV 2020
0
citations
Adversarial Partial Domain Adaptation by Cycle Inconsistency
ECCV 2022
0
citations
AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection
ECCV 2022
0
citations
Partial Person Re-Identification
ICCV 2015
0
citations
Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
CVPR 2025
0
citations
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation
CVPR 2025
0
citations
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images
CVPR 2025
0
citations
Panorama Generation From NFoV Image Done Right
CVPR 2025
0
citations
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
CVPR 2025
0
citations
Diffusion-based Event Generation for High-Quality Image Deblurring
CVPR 2025
0
citations
AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance
ICCV 2025
0
citations
Less Static, More Private: Towards Transferable Privacy-Preserving Action Recognition by Generative Decoupled Learning
ICCV 2025
0
citations
iManip: Skill-Incremental Learning for Robotic Manipulation
ICCV 2025
0
citations
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
ICCV 2025
0
citations
VIPerson: Flexibly Generating Virtual Identity for Person Re-Identification
ICCV 2025
0
citations
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal
ICCV 2025
0
citations
monoVLN: Bridging the Observation Gap between Monocular and Panoramic Vision and Language Navigation
ICCV 2025
0
citations
Distilling LLM Prior to Flow Model for Generalizable Agent’s Imagination in Object Goal Navigation
NeurIPS 2025
0
citations
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
AAAI 2025
0
citations
Action-guided 3D Human Motion Prediction
NeurIPS 2021
0
citations
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NeurIPS 2022
0
citations
Inner-Outer Aware Reconstruction Model for Monocular 3D Scene Reconstruction
NeurIPS 2023
0
citations
Diversifying Spatial-Temporal Perception for Video Domain Generalization
NeurIPS 2023
0
citations
Temporal Continual Learning with Prior Compensation for Human Motion Prediction
NeurIPS 2023
0
citations