Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

22citations
PDFProject
22
Citations
6
Authors
1
Data Points

Citation History

Jan 28, 2026
22