Bin Zhu
8
Papers
363
Total Citations
Papers (8)
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
ICLR 2024
343
citations
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
ICCV 2025arXiv
12
citations
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
AAAI 2025
7
citations
From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning
ICCV 2025
1
citations
Intersecting-Boundary-Sensitive Fingerprinting for Tampering Detection of DNN Models
ICML 2024
0
citations
PolarNeXt: Rethink Instance Segmentation with Polar Representation
CVPR 2025
0
citations
RAGG: Retrieval-Augmented Grasp Generation Model
AAAI 2025
0
citations
HD-EPIC: A Highly-Detailed Egocentric Video Dataset
CVPR 2025
0
citations