Jun Zhang
29
Papers
322
Total Citations
1
Affiliations
Affiliations
Zhejiang University
Papers (29)
Generalized Predictive Model for Autonomous Driving
CVPR 2024
122
citations
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion
ICCV 2025
103
citations
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
AAAI 2025
62
citations
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
21
citations
Task-Aware Encoder Control for Deep Video Compression
CVPR 2024
8
citations
FloE: On-the-Fly MoE Inference on Memory-constrained GPU
ICML 2025
3
citations
FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging
ICCV 2025arXiv
2
citations
Learn How to Query from Unlabeled Data Streams in Federated Learning
AAAI 2025
1
citations
Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution
CVPR 2020
0
citations
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification
CVPR 2021
0
citations
Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification
CVPR 2022
0
citations
Generalized Relation Modeling for Transformer Tracking
CVPR 2023arXiv
0
citations
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query
ICCV 2021
0
citations
Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition
ICCV 2021
0
citations
Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians
ECCV 2020
0
citations
GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering
ECCV 2020
0
citations
Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
ICML 2024
0
citations
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
ICCV 2025
0
citations
Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation
AAAI 2025
0
citations
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
AAAI 2025
0
citations
On the Convergence of an Adaptive Momentum Method for Adversarial Attacks
AAAI 2024
0
citations
TransLoc4D: Transformer-based 4D Radar Place Recognition
CVPR 2024
0
citations
Boosting Neural Representations for Videos with a Conditional Decoder
CVPR 2024
0
citations
Training-Free Long-Context Scaling of Large Language Models
ICML 2024
0
citations
DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing
NeurIPS 2022
0
citations
Multi-dataset Training of Transformers for Robust Action Recognition
NeurIPS 2022
0
citations
SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification
NeurIPS 2022
0
citations
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NeurIPS 2022
0
citations
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NeurIPS 2022
0
citations