Fu

26
Papers
388
Total Citations

Papers (26)

UMA: A Family of Universal Models for Atoms

NeurIPS 2025arXiv
62
citations

Hymba: A Hybrid-head Architecture for Small Language Models

ICLR 2025arXiv
55
citations

Vision Language Models are In-Context Value Learners

ICLR 2025arXiv
43
citations

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

ICLR 2025arXiv
40
citations

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation

ECCV 2024arXiv
33
citations

Fast-in-Slow: A Dual-System VLA Model Unifying Fast Manipulation within Slow Reasoning

NeurIPS 2025
27
citations

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

ICLR 2025arXiv
27
citations

SWE-bench Goes Live!

NeurIPS 2025arXiv
22
citations

Nemotron-CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

NeurIPS 2025arXiv
19
citations

Is Artificial Intelligence Generated Image Detection a Solved Problem?

NeurIPS 2025arXiv
15
citations

Sports-Traj: A Unified Trajectory Generation Model for Multi-Agent Movement in Sports

ICLR 2025arXiv
10
citations

Learning-Augmented Search Data Structures

ICLR 2025arXiv
6
citations

Towards Doctor-Like Reasoning: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients

NeurIPS 2025
6
citations

Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence

NeurIPS 2025arXiv
6
citations

Not-So-Optimal Transport Flows for 3D Point Cloud Generation

ICLR 2025arXiv
5
citations

Exploring Diffusion Transformer Designs via Grafting

NeurIPS 2025arXiv
4
citations

ThunderKittens: Simple, Fast, and $\textit{Adorable}$ Kernels

ICLR 2025
3
citations

Hamiltonian Descent Algorithms for Optimization: Accelerated Rates via Randomized Integration Time

NeurIPS 2025arXiv
2
citations

COS3D: Collaborative Open-Vocabulary 3D Segmentation

NeurIPS 2025arXiv
1
citations

From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction

NeurIPS 2025arXiv
1
citations

KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

NeurIPS 2025arXiv
1
citations

Towards Reliable and Holistic Visual In-Context Learning Prompt Selection

NeurIPS 2025arXiv
0
citations

Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays

NeurIPS 2025
0
citations

ScatterAD: Temporal-Topological Scattering Mechanism for Time Series Anomaly Detection

NeurIPS 2025arXiv
0
citations

VisualLens: Personalization through Task-Agnostic Visual History

NeurIPS 2025arXiv
0
citations

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

NeurIPS 2025arXiv
0
citations