Han Zhang
46
Papers
1,436
Total Citations
1
Affiliations
Affiliations
Tsinghua University
Papers (46)
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
ICLR 2024
1,366
citations
CPPO: Continual Learning for Reinforcement Learning with Human Feedback
ICLR 2024
32
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
BatteryML: An Open-source Platform for Machine Learning on Battery Degradation
ICLR 2024
11
citations
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
3
citations
Correcting Large Language Model Behavior via Influence Function
AAAI 2025
3
citations
BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection
AAAI 2025
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition
CVPR 2016
0
citations
Link the Head to the "Beak": Zero Shot Learning From Noisy Text Description at Part Precision
CVPR 2017arXiv
0
citations
AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks
CVPR 2018arXiv
0
citations
Co-Occurrent Features in Semantic Segmentation
CVPR 2019
0
citations
Distilling Effective Supervision From Severe Label Noise
CVPR 2020arXiv
0
citations
Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
CVPR 2020arXiv
0
citations
Cross-Modal Contrastive Learning for Text-to-Image Generation
CVPR 2021arXiv
0
citations
Learning To Prompt for Continual Learning
CVPR 2022arXiv
0
citations
MAXIM: Multi-Axis MLP for Image Processing
CVPR 2022arXiv
0
citations
MaskGIT: Masked Generative Image Transformer
CVPR 2022arXiv
0
citations
Visual Prompt Tuning for Generative Transfer Learning
CVPR 2023arXiv
0
citations
MAGVIT: Masked Generative Video Transformer
CVPR 2023arXiv
0
citations
MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis
CVPR 2023arXiv
0
citations
Dimensionality-Varying Diffusion Process
CVPR 2023arXiv
0
citations
Enhanced Training of Query-Based Object Detection via Selective Query Recollection
CVPR 2023arXiv
0
citations
StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
ICCV 2017arXiv
0
citations
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
ICCV 2023arXiv
0
citations
FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation
ICCV 2023arXiv
0
citations
VQ3D: Learning a 3D-Aware Generative Model on ImageNet
ICCV 2023arXiv
0
citations
"Unitail: Detecting, Reading, and Matching in Retail Scene"
ECCV 2022
0
citations
BLT: Bidirectional Layout Transformer for Controllable Layout Generation
ECCV 2022
0
citations
MaxViT: Multi-axis Vision Transformer
ECCV 2022
0
citations
DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning
ECCV 2022
0
citations
Learning Instance-Specific Adaptation for Cross-Domain Segmentation
ECCV 2022
0
citations
Lane Detection Transformer Based on Multi-Frame Horizontal and Vertical Attention and Visual Transformer Module
ECCV 2022
0
citations
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
AAAI 2025
0
citations
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
ICCV 2025arXiv
0
citations
Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks
NeurIPS 2025
0
citations
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
AAAI 2025
0
citations
MITracker: Multi-View Integration for Visual Object Tracking
CVPR 2025
0
citations
Inheriting Generalized Learngene for Efficient Knowledge Transfer across Multiple Tasks
AAAI 2025
0
citations
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
NeurIPS 2020
0
citations
Improved Transformer for High-Resolution GANs
NeurIPS 2021
0
citations
GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization
NeurIPS 2022
0
citations
Decision Tree for Locally Private Estimation with Public Data
NeurIPS 2023
0
citations
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
NeurIPS 2023
0
citations
Diversify Your Vision Datasets with Automatic Diffusion-based Augmentation
NeurIPS 2023
0
citations
Self-Attention Generative Adversarial Networks
ICML 2019
0
citations