Hao Zhang

44

Papers

232

Total Citations

1

Affiliations

Affiliations

UIUC

Papers (44)

Visual In-Context Prompting

Revisiting Single Image Reflection Removal In the Wild

Explaining Generalization Power of a DNN Using Interactive Concepts

OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation including the Unseen

Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction

Learning Implicit Representation for Reconstructing Articulated Objects

Non-parametric Representation Learning with Kernels

Learning Adaptive Lighting via Channel-Aware Guidance

PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS

Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation

NeurIPS 2025arXiv

Cross-Modal Stealth: A Coarse-to-Fine Attack Framework for RGB-T Tracker

MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving

CLLMs: Consistency Large Language Models

Online Speculative Decoding

S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

InferCept: Efficient Intercept Support for Augmented Large Language Model Inference

ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points

When Will Gradient Regularization Be Harmful?

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A

SDMatte: Grafting Diffusion Models for Interactive Matting

TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes

NeurIPS 2025arXiv

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning

Boosting Vision State Space Model with Fractal Scanning

Scalable Trajectory-User Linking with Dual-Stream Representation Networks

A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency

Clarifying the Behavior and the Difficulty of Adversarial Training

Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

Multi-Task Dense Prediction via Mixture of Low-Rank Experts

MeaCap: Memory-Augmented Zero-shot Image Captioning

LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Improving Adversarial Energy-Based Model via Diffusion Process