Jun Zhang

29
Papers
322
Total Citations
1
Affiliations

Affiliations

Zhejiang University

Papers (29)

Generalized Predictive Model for Autonomous Driving

CVPR 2024
122
citations

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion

ICCV 2025
103
citations

Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models

AAAI 2025
62
citations

MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

ICCV 2025
21
citations

Task-Aware Encoder Control for Deep Video Compression

CVPR 2024
8
citations

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

ICML 2025
3
citations

FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging

ICCV 2025arXiv
2
citations

Learn How to Query from Unlabeled Data Streams in Federated Learning

AAAI 2025
1
citations

Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution

CVPR 2020
0
citations

Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification

CVPR 2021
0
citations

Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification

CVPR 2022
0
citations

Generalized Relation Modeling for Transformer Tracking

CVPR 2023arXiv
0
citations

Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query

ICCV 2021
0
citations

Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition

ICCV 2021
0
citations

Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians

ECCV 2020
0
citations

GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering

ECCV 2020
0
citations

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

ICML 2024
0
citations

p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay

ICCV 2025
0
citations

Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation

AAAI 2025
0
citations

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

AAAI 2025
0
citations

On the Convergence of an Adaptive Momentum Method for Adversarial Attacks

AAAI 2024
0
citations

TransLoc4D: Transformer-based 4D Radar Place Recognition

CVPR 2024
0
citations

Boosting Neural Representations for Videos with a Conditional Decoder

CVPR 2024
0
citations

Training-Free Long-Context Scaling of Large Language Models

ICML 2024
0
citations

DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing

NeurIPS 2022
0
citations

Multi-dataset Training of Transformers for Robust Action Recognition

NeurIPS 2022
0
citations

SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification

NeurIPS 2022
0
citations

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining

NeurIPS 2022
0
citations

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval

NeurIPS 2022
0
citations