Shu Zhang
11
Papers
356
Total Citations
Papers (11)
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
192
citations
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
164
citations
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
CVPR 2019
0
citations
Deep Homography Estimation for Dynamic Scenes
CVPR 2020arXiv
0
citations
Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework
CVPR 2022arXiv
0
citations
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
ICCV 2017arXiv
0
citations
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
ICCV 2023
0
citations
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents
AAAI 2025
0
citations
Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification
CVPR 2019
0
citations
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
NeurIPS 2023
0
citations
Fairness-guided Few-shot Prompting for Large Language Models
NeurIPS 2023
0
citations