Han Zhang

46
Papers
1,436
Total Citations
1
Affiliations

Affiliations

Tsinghua University

Papers (46)

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

ICLR 2024
1,366
citations

CPPO: Continual Learning for Reinforcement Learning with Human Feedback

ICLR 2024
32
citations

Lipschitz Singularities in Diffusion Models

ICLR 2024
21
citations

BatteryML: An Open-source Platform for Machine Learning on Battery Degradation

ICLR 2024
11
citations

BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs

CVPR 2025
3
citations

Correcting Large Language Model Behavior via Influence Function

AAAI 2025
3
citations

BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection

AAAI 2025
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition

CVPR 2016
0
citations

Link the Head to the "Beak": Zero Shot Learning From Noisy Text Description at Part Precision

CVPR 2017arXiv
0
citations

AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks

CVPR 2018arXiv
0
citations

Co-Occurrent Features in Semantic Segmentation

CVPR 2019
0
citations

Distilling Effective Supervision From Severe Label Noise

CVPR 2020arXiv
0
citations

Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models

CVPR 2020arXiv
0
citations

Cross-Modal Contrastive Learning for Text-to-Image Generation

CVPR 2021arXiv
0
citations

Learning To Prompt for Continual Learning

CVPR 2022arXiv
0
citations

MAXIM: Multi-Axis MLP for Image Processing

CVPR 2022arXiv
0
citations

MaskGIT: Masked Generative Image Transformer

CVPR 2022arXiv
0
citations

Visual Prompt Tuning for Generative Transfer Learning

CVPR 2023arXiv
0
citations

MAGVIT: Masked Generative Video Transformer

CVPR 2023arXiv
0
citations

MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis

CVPR 2023arXiv
0
citations

Dimensionality-Varying Diffusion Process

CVPR 2023arXiv
0
citations

Enhanced Training of Query-Based Object Detection via Selective Query Recollection

CVPR 2023arXiv
0
citations

StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks

ICCV 2017arXiv
0
citations

SVDiff: Compact Parameter Space for Diffusion Fine-Tuning

ICCV 2023arXiv
0
citations

FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation

ICCV 2023arXiv
0
citations

VQ3D: Learning a 3D-Aware Generative Model on ImageNet

ICCV 2023arXiv
0
citations

"Unitail: Detecting, Reading, and Matching in Retail Scene"

ECCV 2022
0
citations

BLT: Bidirectional Layout Transformer for Controllable Layout Generation

ECCV 2022
0
citations

MaxViT: Multi-axis Vision Transformer

ECCV 2022
0
citations

DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning

ECCV 2022
0
citations

Learning Instance-Specific Adaptation for Cross-Domain Segmentation

ECCV 2022
0
citations

Lane Detection Transformer Based on Multi-Frame Horizontal and Vertical Attention and Visual Transformer Module

ECCV 2022
0
citations

MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction

AAAI 2025
0
citations

Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

ICCV 2025arXiv
0
citations

Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks

NeurIPS 2025
0
citations

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

AAAI 2025
0
citations

MITracker: Multi-View Integration for Visual Object Tracking

CVPR 2025
0
citations

Inheriting Generalized Learngene for Efficient Knowledge Transfer across Multiple Tasks

AAAI 2025
0
citations

FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

NeurIPS 2020
0
citations

Improved Transformer for High-Resolution GANs

NeurIPS 2021
0
citations

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

NeurIPS 2022
0
citations

Decision Tree for Locally Private Estimation with Public Data

NeurIPS 2023
0
citations

StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

NeurIPS 2023
0
citations

Diversify Your Vision Datasets with Automatic Diffusion-based Augmentation

NeurIPS 2023
0
citations

Self-Attention Generative Adversarial Networks

ICML 2019
0
citations