Han Lin
4
Papers
61
Total Citations
Papers (4)
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
ICLR 2025arXiv
48
citations
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
ICLR 2025
7
citations
Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents
NeurIPS 2025arXiv
6
citations
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
ICML 2024
0
citations