Tong Zhang

28
Papers
157
Total Citations

Papers (28)

PerceptionGPT: Effectively Fusing Visual Perception into LLM

CVPR 2024arXiv
59
citations

ASGO: Adaptive Structured Gradient Optimization

NeurIPS 2025arXiv
26
citations

AdaGrad under Anisotropic Smoothness

ICLR 2025arXiv
14
citations

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

NeurIPS 2025arXiv
8
citations

Data Augmentation via Latent Diffusion for Saliency Prediction

ECCV 2024arXiv
7
citations

Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

ICLR 2024arXiv
7
citations

Refining CLIP's Spatial Awareness: A Visual-Centric Perspective

ICLR 2025arXiv
6
citations

Generating Multimodal Driving Scenes via Next-Scene Prediction

CVPR 2025arXiv
6
citations

TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection

AAAI 2024arXiv
5
citations

FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

CVPR 2025arXiv
5
citations

Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference

ICLR 2025arXiv
4
citations

Faster Sampling via Stochastic Gradient Proximal Sampler

ICML 2024
3
citations

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

ICCV 2025arXiv
3
citations

Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density

ICLR 2025arXiv
2
citations

Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods

ICML 2025arXiv
2
citations

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint

ICML 2024
0
citations

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection

CVPR 2025arXiv
0
citations

TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging

ICCV 2025
0
citations

MatchDiffusion: Training-free Generation of Match-Cuts

ICCV 2025
0
citations

Scene Graph-Grounded Image Generation

AAAI 2025
0
citations

Desigen: A Pipeline for Controllable Design Template Generation

CVPR 2024
0
citations

InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields

CVPR 2024
0
citations

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

CVPR 2024
0
citations

Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

CVPR 2024arXiv
0
citations

The Non-linear $F$-Design and Applications to Interactive Learning

ICML 2024
0
citations

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

ICML 2024
0
citations

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

ICML 2024
0
citations

Scaling Mesh Generation via Compressive Tokenization

CVPR 2025
0
citations