Yu
25
Papers
396
Total Citations
Papers (25)
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
ICLR 2025arXiv
121
citations
DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?
ICLR 2025arXiv
62
citations
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
NeurIPS 2025arXiv
52
citations
RRM: Robust Reward Model Training Mitigates Reward Hacking
ICLR 2025arXiv
44
citations
Can LLMs Understand Time Series Anomalies?
ICLR 2025arXiv
32
citations
KGGen: Extracting Knowledge Graphs from Plain Text with Language Models
NeurIPS 2025arXiv
25
citations
MoonCast: High-Quality Zero-Shot Podcast Generation
NeurIPS 2025arXiv
18
citations
SimulPL: Aligning Human Preferences in Simultaneous Machine Translation
ICLR 2025
8
citations
Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection
ICLR 2025arXiv
7
citations
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching
NeurIPS 2025arXiv
6
citations
Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization
ECCV 2024arXiv
4
citations
Discovering Influential Neuron Path in Vision Transformers
ICLR 2025arXiv
4
citations
Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control
NeurIPS 2025arXiv
3
citations
Elucidated Rolling Diffusion Models for Probabilistic Forecasting of Complex Dynamics
NeurIPS 2025arXiv
2
citations
OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction
ECCV 2024arXiv
2
citations
Rethinking Residual Distribution in Locate-then-Edit Model Editing
NeurIPS 2025arXiv
2
citations
PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph
ICLR 2025arXiv
1
citations
UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression
NeurIPS 2025arXiv
1
citations
ViewCraft3D: High-fidelity and View-Consistent 3D Vector Graphics Synthesis
NeurIPS 2025arXiv
1
citations
Unifying Proportional Fairness in Centroid and Non-Centroid Clustering
NeurIPS 2025arXiv
1
citations
Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance
CVPR 2024
0
citations
Revealing Multimodal Causality with Large Language Models
NeurIPS 2025arXiv
0
citations
Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
NeurIPS 2025arXiv
0
citations
Simulating Society Requires Simulating Thought
NeurIPS 2025arXiv
0
citations
Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks
ICLR 2025
0
citations