Yu

25
Papers
396
Total Citations

Papers (25)

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

ICLR 2025arXiv
121
citations

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?

ICLR 2025arXiv
62
citations

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

NeurIPS 2025arXiv
52
citations

RRM: Robust Reward Model Training Mitigates Reward Hacking

ICLR 2025arXiv
44
citations

Can LLMs Understand Time Series Anomalies?

ICLR 2025arXiv
32
citations

KGGen: Extracting Knowledge Graphs from Plain Text with Language Models

NeurIPS 2025arXiv
25
citations

MoonCast: High-Quality Zero-Shot Podcast Generation

NeurIPS 2025arXiv
18
citations

SimulPL: Aligning Human Preferences in Simultaneous Machine Translation

ICLR 2025
8
citations

Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection

ICLR 2025arXiv
7
citations

CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching

NeurIPS 2025arXiv
6
citations

Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization

ECCV 2024arXiv
4
citations

Discovering Influential Neuron Path in Vision Transformers

ICLR 2025arXiv
4
citations

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

NeurIPS 2025arXiv
3
citations

Elucidated Rolling Diffusion Models for Probabilistic Forecasting of Complex Dynamics

NeurIPS 2025arXiv
2
citations

OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction

ECCV 2024arXiv
2
citations

Rethinking Residual Distribution in Locate-then-Edit Model Editing

NeurIPS 2025arXiv
2
citations

PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph

ICLR 2025arXiv
1
citations

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

NeurIPS 2025arXiv
1
citations

ViewCraft3D: High-fidelity and View-Consistent 3D Vector Graphics Synthesis

NeurIPS 2025arXiv
1
citations

Unifying Proportional Fairness in Centroid and Non-Centroid Clustering

NeurIPS 2025arXiv
1
citations

Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance

CVPR 2024
0
citations

Revealing Multimodal Causality with Large Language Models

NeurIPS 2025arXiv
0
citations

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery

NeurIPS 2025arXiv
0
citations

Simulating Society Requires Simulating Thought

NeurIPS 2025arXiv
0
citations

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

ICLR 2025
0
citations