Humphrey Shi

16

Papers

446

Total Citations

3

Affiliations

Affiliations

OregonGeorgia TechUIUC

Papers (16)

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Benchmarking Object Detectors with COCO: A New Path Forward

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

Brush2Prompt: Contextual Prompt Generator for Object Inpainting

HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment