Yao Fu
9
Papers
165
Total Citations
Papers (9)
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
ICLR 2025arXiv
165
citations
Data Engineering for Scaling Language Models to 128K Context
ICML 2024
0
citations
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
ICML 2024
0
citations
RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis
NeurIPS 2025
0
citations
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
NeurIPS 2023
0
citations
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
NeurIPS 2023
0
citations
Paraphrase Generation with Latent Bag of Words
NeurIPS 2019
0
citations
Latent Template Induction with Gumbel-CRFs
NeurIPS 2020
0
citations
Analyzing the Confidentiality of Undistillable Teachers in Knowledge Distillation
NeurIPS 2021
0
citations