Sho Yokoi
3
papers
42
total citations
papers (3)
Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps
ICLR 2024arXiv
28
citations
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
ICLR 2025arXiv
12
citations
SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches
ICLR 2025arXiv
2
citations