Open-Source vs Close-Source: The Context Utilization Challenge

0citations

Citations

#1486

in ICLR 2025

of 3827 papers

Authors

Data Points

Authors

Litu Ou

Topics

large language models context utilization needle in a haystack test chapter summarization long context models data contamination minimization

Abstract

This blog post aims to evaluate how well the most capable open-source long context large language models (LLMs) utilize context, using the Needle In A Haystack test. We adopt the task of chapter summarization for recently published books to minimize data contamination while ensuring a challenging test. Our results show that open-source models still have room to improve in context utilization compared to close-source models..

Citation History

Jan 26, 2026

Jan 27, 2026

Feb 1, 2026