Poster "long-context reasoning" Papers
2 papers found
Learning to Focus: Causal Attention Distillation via Gradient‐Guided Token Pruning
Yiju Guo, Wenkai Yang, Zexu Sun et al.
NeurIPS 2025posterarXiv:2506.07851
3
citations
Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation
Linda He, Jue Wang, Maurice Weber et al.
ICLR 2025posterarXiv:2504.12637
2
citations