Tong Zhang
28
Papers
157
Total Citations
Papers (28)
PerceptionGPT: Effectively Fusing Visual Perception into LLM
CVPR 2024arXiv
59
citations
ASGO: Adaptive Structured Gradient Optimization
NeurIPS 2025arXiv
26
citations
AdaGrad under Anisotropic Smoothness
ICLR 2025arXiv
14
citations
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
NeurIPS 2025arXiv
8
citations
Data Augmentation via Latent Diffusion for Saliency Prediction
ECCV 2024arXiv
7
citations
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
ICLR 2024arXiv
7
citations
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
ICLR 2025arXiv
6
citations
Generating Multimodal Driving Scenes via Next-Scene Prediction
CVPR 2025arXiv
6
citations
TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection
AAAI 2024arXiv
5
citations
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
CVPR 2025arXiv
5
citations
Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
ICLR 2025arXiv
4
citations
Faster Sampling via Stochastic Gradient Proximal Sampler
ICML 2024
3
citations
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
ICCV 2025arXiv
3
citations
Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density
ICLR 2025arXiv
2
citations
Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods
ICML 2025arXiv
2
citations
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
0
citations
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
CVPR 2025arXiv
0
citations
TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging
ICCV 2025
0
citations
MatchDiffusion: Training-free Generation of Match-Cuts
ICCV 2025
0
citations
Scene Graph-Grounded Image Generation
AAAI 2025
0
citations
Desigen: A Pipeline for Controllable Design Template Generation
CVPR 2024
0
citations
InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields
CVPR 2024
0
citations
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
CVPR 2024
0
citations
Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
CVPR 2024arXiv
0
citations
The Non-linear $F$-Design and Applications to Interactive Learning
ICML 2024
0
citations
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
ICML 2024
0
citations
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
ICML 2024
0
citations
Scaling Mesh Generation via Compressive Tokenization
CVPR 2025
0
citations