Yuan

33
Papers
629
Total Citations

Papers (33)

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

NeurIPS 2025arXiv
118
citations

MoBA: Mixture of Block Attention for Long-Context LLMs

NeurIPS 2025arXiv
94
citations

ImgEdit: A Unified Image Editing Dataset and Benchmark

NeurIPS 2025arXiv
84
citations

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

ICLR 2025arXiv
64
citations

GVGEN: Text-to-3D Generation with Volumetric Representation

ECCV 2024arXiv
51
citations

HiFi-123: Towards High-fidelity One Image to 3D Content Generation

ECCV 2024arXiv
34
citations

GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation

ECCV 2024arXiv
33
citations

Boosting Neural Combinatorial Optimization for Large-Scale Vehicle Routing Problems

ICLR 2025
18
citations

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

ECCV 2024arXiv
17
citations

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

ECCV 2024
14
citations

InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting

ICLR 2025
12
citations

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

ECCV 2024
12
citations

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

ECCV 2024arXiv
11
citations

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

ECCV 2024arXiv
10
citations

VideoMAR: Autoregressive Video Generation with Continuous Tokens

NeurIPS 2025
8
citations

LLM Strategic Reasoning: Agentic Study through Behavioral Game Theory

NeurIPS 2025arXiv
7
citations

IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

ICLR 2025arXiv
6
citations

IGL-Bench: Establishing the Comprehensive Benchmark for Imbalanced Graph Learning

ICLR 2025arXiv
5
citations

The Fluorescent Veil: A Stealthy and Effective Physical Adversarial Patch Against Traffic Sign Recognition

NeurIPS 2025arXiv
5
citations

Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model

ICLR 2025arXiv
4
citations

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

NeurIPS 2025arXiv
4
citations

LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models

NeurIPS 2025arXiv
3
citations

Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks

ICLR 2025arXiv
3
citations

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

NeurIPS 2025arXiv
2
citations

RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills

NeurIPS 2025arXiv
2
citations

SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

NeurIPS 2025arXiv
2
citations

Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

ECCV 2024arXiv
2
citations

MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation

NeurIPS 2025arXiv
1
citations

Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling

NeurIPS 2025arXiv
1
citations

Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation

ECCV 2024
1
citations

Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation

ECCV 2024arXiv
1
citations

MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization

NeurIPS 2025arXiv
0
citations

FedGPS: Statistical Rectification Against Data Heterogeneity in Federated Learning

NeurIPS 2025arXiv
0
citations