Humphrey Shi

40
Papers
446
Total Citations
3
Affiliations

Affiliations

OregonGeorgia TechUIUC

Papers (40)

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

CVPR 2025
154
citations

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

ICLR 2025arXiv
116
citations

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models

CVPR 2024
69
citations

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

CVPR 2024
48
citations

Benchmarking Object Detectors with COCO: A New Path Forward

ECCV 2024
23
citations

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

CVPR 2024
17
citations

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

CVPR 2024
10
citations

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

ICLR 2024
8
citations

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

ICCV 2025arXiv
1
citations

Learning to Track Instances without Video Annotations

CVPR 2021arXiv
0
citations

DiSparse: Disentangled Sparsification for Multitask Model Compression

CVPR 2022
0
citations

VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

CVPR 2022
0
citations

Object Localization Under Single Coarse Point Supervision

CVPR 2022arXiv
0
citations

Towards Layer-Wise Image Vectorization

CVPR 2022
0
citations

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

CVPR 2022arXiv
0
citations

OneFormer: One Transformer To Rule Universal Image Segmentation

CVPR 2023arXiv
0
citations

Graph Transformer GANs for Graph-Constrained House Generation

CVPR 2023arXiv
0
citations

Automatic High Resolution Wire Segmentation and Removal

CVPR 2023arXiv
0
citations

Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning

CVPR 2023arXiv
0
citations

Neighborhood Attention Transformer

CVPR 2023arXiv
0
citations

Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style

CVPR 2023
0
citations

A Multi-Mode Modulator for Multi-Domain Few-Shot Classification

ICCV 2021
0
citations

Interpretable Visual Reasoning via Induced Symbolic Space

ICCV 2021arXiv
0
citations

MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices

ICCV 2023
0
citations

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

ICCV 2023arXiv
0
citations

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

ICCV 2023
0
citations

AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition

ECCV 2022
0
citations

Point-to-Box Network for Accurate Object Detection via Single Point Supervision

ECCV 2022
0
citations

SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image

ECCV 2022
0
citations

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

CVPR 2025arXiv
0
citations

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

ICCV 2025
0
citations

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

ICCV 2025
0
citations

HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

ICCV 2025
0
citations

Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models

CVPR 2024
0
citations

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

CVPR 2024
0
citations

Brush2Prompt: Contextual Prompt Generator for Object Inpainting

CVPR 2024
0
citations

Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach

CVPR 2021arXiv
0
citations

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

CVPR 2021arXiv
0
citations

Mask Matching Transformer for Few-Shot Segmentation

NeurIPS 2022
0
citations

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation

NeurIPS 2023
0
citations