Han

30
Papers
1,699
Total Citations

Papers (30)

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

ICLR 2025arXiv
1,016
citations

Agent Attention: On the Integration of Softmax and Linear Attention

ECCV 2024arXiv
206
citations

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

ICLR 2025arXiv
165
citations

SVDQuant: Absorbing Outliers by Low-Rank Component for 4-Bit Diffusion Models

ICLR 2025arXiv
90
citations

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

NeurIPS 2025arXiv
31
citations

KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills

NeurIPS 2025arXiv
31
citations

Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs

ICLR 2025arXiv
30
citations

MLLMs Need 3D-Aware Representation Supervision for Scene Understanding

NeurIPS 2025arXiv
17
citations

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search

NeurIPS 2025arXiv
15
citations

On the Feature Learning in Diffusion Models

ICLR 2025arXiv
13
citations

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments

ICLR 2025arXiv
12
citations

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence

ECCV 2024arXiv
10
citations

Breach By A Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses

ICLR 2025arXiv
10
citations

ADIFF: Explaining audio difference using natural language

ICLR 2025arXiv
9
citations

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

NeurIPS 2025arXiv
9
citations

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

NeurIPS 2025arXiv
8
citations

LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment

ECCV 2024arXiv
5
citations

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

ICLR 2025arXiv
5
citations

Non-parametric Sensor Noise Modeling and Synthesis

ECCV 2024
4
citations

Physics-aligned field reconstruction with diffusion bridge

ICLR 2025
3
citations

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

NeurIPS 2025arXiv
3
citations

Online Statistical Inference in Decision Making with Matrix Context

NeurIPS 2025arXiv
2
citations

Streaming Attention Approximation via Discrepancy Theory

NeurIPS 2025arXiv
2
citations

Creativity or Brute Force? Using Brainteasers as a Window into the Problem-Solving Abilities of Large Language Models

NeurIPS 2025arXiv
1
citations

HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis

NeurIPS 2025arXiv
1
citations

Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection

NeurIPS 2025arXiv
1
citations

Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix

NeurIPS 2025arXiv
0
citations

Diffusing to the Top: Boost Graph Neural Networks with Minimal Hyperparameter Tuning

ICLR 2025arXiv
0
citations

Minimal Semantic Sufficiency Meets Unsupervised Domain Generalization

NeurIPS 2025arXiv
0
citations

Fairness-Regularized Online Optimization with Switching Costs

NeurIPS 2025arXiv
0
citations