Hao Zhang

44
Papers
232
Total Citations
1
Affiliations

Affiliations

UIUC

Papers (44)

Visual In-Context Prompting

CVPR 2024
52
citations

Revisiting Single Image Reflection Removal In the Wild

CVPR 2024
37
citations

Explaining Generalization Power of a DNN Using Interactive Concepts

AAAI 2024arXiv
33
citations

OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

CVPR 2025
18
citations

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation including the Unseen

AAAI 2024arXiv
15
citations

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

CVPR 2024
14
citations

Learning Implicit Representation for Reconstructing Articulated Objects

ICLR 2024
11
citations

Non-parametric Representation Learning with Kernels

AAAI 2024arXiv
11
citations

Learning Adaptive Lighting via Channel-Aware Guidance

ICML 2025
7
citations

PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS

ICLR 2025
6
citations

Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images

ECCV 2024
5
citations

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

ICCV 2025
5
citations

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity

ICLR 2025arXiv
5
citations

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

AAAI 2024arXiv
4
citations

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation

NeurIPS 2025arXiv
3
citations

Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker

AAAI 2025
3
citations

MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

NeurIPS 2025
2
citations

GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation

ICLR 2025
1
citations

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving

ICML 2024
0
citations

CLLMs: Consistency Large Language Models

ICML 2024
0
citations

Online Speculative Decoding

ICML 2024
0
citations

S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video

ICML 2024
0
citations

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

ICML 2024
0
citations

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

ICML 2024
0
citations

InferCept: Efficient Intercept Support for Augmented Large Language Model Inference

ICML 2024
0
citations

ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points

CVPR 2025
0
citations

When Will Gradient Regularization Be Harmful?

ICML 2024
0
citations

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

CVPR 2025
0
citations

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

CVPR 2025
0
citations

IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A

ICCV 2025
0
citations

SDMatte: Grafting Diffusion Models for Interactive Matting

ICCV 2025
0
citations

TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration

ICCV 2025
0
citations

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes

NeurIPS 2025arXiv
0
citations

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning

AAAI 2025
0
citations

Boosting Vision State Space Model with Fractal Scanning

AAAI 2025
0
citations

Scalable Trajectory-User Linking with Dual-Stream Representation Networks

AAAI 2025
0
citations

A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency

AAAI 2024
0
citations

Clarifying the Behavior and the Difficulty of Adversarial Training

AAAI 2024
0
citations

Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

CVPR 2024
0
citations

Multi-Task Dense Prediction via Mixture of Low-Rank Experts

CVPR 2024
0
citations

MeaCap: Memory-Augmented Zero-shot Image Captioning

CVPR 2024
0
citations

LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation

ICML 2025
0
citations

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

ICML 2024
0
citations

Improving Adversarial Energy-Based Model via Diffusion Process

ICML 2024
0
citations