YI WU

19
Papers
25
Total Citations

Papers (19)

On Conformal Isometry of Grid Cells: Learning Distance-Preserving Position Embedding

ICLR 2025arXiv
11
citations

Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

ECCV 2024arXiv
10
citations

UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning

NeurIPS 2025arXiv
3
citations

EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization

NeurIPS 2025arXiv
1
citations

PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

ECCV 2024
0
citations

Compositional Substitutivity of Visual Reasoning for Visual Question Answering

ECCV 2024
0
citations

SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models

ECCV 2024arXiv
0
citations

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation

ECCV 2024arXiv
0
citations

Learning Pseudo 3D Guidance for View-consistent Texturing with 2D Diffusion

ECCV 2024
0
citations

AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs

NeurIPS 2025arXiv
0
citations

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

NeurIPS 2025arXiv
0
citations

MLZero: A Multi-Agent System for End-to-end Machine Learning Automation

NeurIPS 2025arXiv
0
citations

WritingBench: A Comprehensive Benchmark for Generative Writing

NeurIPS 2025arXiv
0
citations

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

NeurIPS 2025arXiv
0
citations

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?

NeurIPS 2025arXiv
0
citations

Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping

NeurIPS 2025arXiv
0
citations

Counterfactual Generative Modeling with Variational Causal Inference

ICLR 2025arXiv
0
citations

Learning Fine-Grained Representations through Textual Token Disentanglement in Composed Video Retrieval

ICLR 2025
0
citations

FlowDec: A flow-based full-band general audio codec with high perceptual quality

ICLR 2025arXiv
0
citations