Humphrey Shi
16
Papers
446
Total Citations
3
Affiliations
Affiliations
OregonGeorgia TechUIUC
Papers (16)
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
CVPR 2025
154
citations
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025arXiv
116
citations
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
CVPR 2024
69
citations
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
CVPR 2024
48
citations
Benchmarking Object Detectors with COCO: A New Path Forward
ECCV 2024
23
citations
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
CVPR 2024
17
citations
Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
CVPR 2024
10
citations
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
ICLR 2024
8
citations
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
ICCV 2025arXiv
1
citations
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
ICCV 2025
0
citations
Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models
CVPR 2024
0
citations
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
CVPR 2024
0
citations
CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting
ICCV 2025
0
citations
Brush2Prompt: Contextual Prompt Generator for Object Inpainting
CVPR 2024
0
citations
HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection
ICCV 2025
0
citations
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
CVPR 2025arXiv
0
citations