Humphrey Shi

16
Papers
446
Total Citations
3
Affiliations

Affiliations

OregonGeorgia TechUIUC

Papers (16)

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

CVPR 2025
154
citations

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

ICLR 2025arXiv
116
citations

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

CVPR 2024
69
citations

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

CVPR 2024
48
citations

Benchmarking Object Detectors with COCO: A New Path Forward

ECCV 2024
23
citations

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

CVPR 2024
17
citations

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

CVPR 2024
10
citations

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

ICLR 2024
8
citations

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

ICCV 2025arXiv
1
citations

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

ICCV 2025
0
citations

Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models

CVPR 2024
0
citations

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

CVPR 2024
0
citations

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

ICCV 2025
0
citations

Brush2Prompt: Contextual Prompt Generator for Object Inpainting

CVPR 2024
0
citations

HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

ICCV 2025
0
citations

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

CVPR 2025arXiv
0
citations