Humphrey Shi
40
Papers
446
Total Citations
3
Affiliations
Affiliations
OregonGeorgia TechUIUC
Papers (40)
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
CVPR 2025
154
citations
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025arXiv
116
citations
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
CVPR 2024
69
citations
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
CVPR 2024
48
citations
Benchmarking Object Detectors with COCO: A New Path Forward
ECCV 2024
23
citations
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor
CVPR 2024
17
citations
Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
CVPR 2024
10
citations
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
ICLR 2024
8
citations
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
ICCV 2025arXiv
1
citations
Learning to Track Instances without Video Annotations
CVPR 2021arXiv
0
citations
DiSparse: Disentangled Sparsification for Multitask Model Compression
CVPR 2022
0
citations
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
CVPR 2022
0
citations
Object Localization Under Single Coarse Point Supervision
CVPR 2022arXiv
0
citations
Towards Layer-Wise Image Vectorization
CVPR 2022
0
citations
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition
CVPR 2022arXiv
0
citations
OneFormer: One Transformer To Rule Universal Image Segmentation
CVPR 2023arXiv
0
citations
Graph Transformer GANs for Graph-Constrained House Generation
CVPR 2023arXiv
0
citations
Automatic High Resolution Wire Segmentation and Removal
CVPR 2023arXiv
0
citations
Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning
CVPR 2023arXiv
0
citations
Neighborhood Attention Transformer
CVPR 2023arXiv
0
citations
Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style
CVPR 2023
0
citations
A Multi-Mode Modulator for Multi-Domain Few-Shot Classification
ICCV 2021
0
citations
Interpretable Visual Reasoning via Induced Symbolic Space
ICCV 2021arXiv
0
citations
MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices
ICCV 2023
0
citations
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
ICCV 2023arXiv
0
citations
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
ICCV 2023
0
citations
AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition
ECCV 2022
0
citations
Point-to-Box Network for Accurate Object Detection via Single Point Supervision
ECCV 2022
0
citations
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image
ECCV 2022
0
citations
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
CVPR 2025arXiv
0
citations
CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting
ICCV 2025
0
citations
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation
ICCV 2025
0
citations
HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection
ICCV 2025
0
citations
Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models
CVPR 2024
0
citations
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
CVPR 2024
0
citations
Brush2Prompt: Contextual Prompt Generator for Object Inpainting
CVPR 2024
0
citations
Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach
CVPR 2021arXiv
0
citations
Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
CVPR 2021arXiv
0
citations
Mask Matching Transformer for Few-Shot Segmentation
NeurIPS 2022
0
citations
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation
NeurIPS 2023
0
citations