by John Terilla Papers
2 papers found
From Language Models over Tokens to Language Models over Characters
Tim Vieira, Benjamin LeBrun, Mario Giulianelli et al.
ICML 2025spotlightarXiv:2412.03719
The Foundations of Tokenization: Statistical and Computational Concerns
Juan Luis Gastaldi, John Terilla, Luca Malagutti et al.
ICLR 2025posterarXiv:2407.11606