Yu Shen

20
Papers
270
Total Citations

Papers (20)

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

NeurIPS 2025arXiv
130
citations

What Makes a Good Diffusion Planner for Decision Making?

ICLR 2025arXiv
24
citations

Framer: Interactive Frame Interpolation

ICLR 2025arXiv
20
citations

How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension

ICLR 2025arXiv
20
citations

SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Model

ICLR 2025arXiv
19
citations

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model

NeurIPS 2025
17
citations

Refine Knowledge of Large Language Models via Adaptive Contrastive Learning

ICLR 2025arXiv
14
citations

KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows

NeurIPS 2025arXiv
8
citations

SysBench: Can LLMs Follow System Message?

ICLR 2025
5
citations

Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

CVPR 2025arXiv
4
citations

API Pack: A Massive Multi-Programming Language Dataset for API Call Generation

ICLR 2025arXiv
4
citations

VideoVLA: Video Generators Can Be Generalizable Robot Manipulators

NeurIPS 2025arXiv
3
citations

Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs

NeurIPS 2025arXiv
1
citations

FairViT: Fair Vision Transformer via Adaptive Masking

ECCV 2024arXiv
1
citations

CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations

NeurIPS 2025arXiv
0
citations

GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning

NeurIPS 2025arXiv
0
citations

UniRestore3D: A Scalable Framework For General Shape Restoration

ICLR 2025
0
citations

GAN-based Garment Generation Using Sewing Pattern Images

ECCV 2020
0
citations

Gradient-Free Adversarial Training Against Image Corruption for Learning-based Steering

NeurIPS 2021
0
citations

DivBO: Diversity-aware CASH for Ensemble Learning

NeurIPS 2022arXiv
0
citations