"automatic speech recognition" Papers
7 papers found
BlockDecoder: Boosting ASR Decoders with Context and Merger Modules
Darshan Prabhu, Preethi Jyothi
NeurIPS 2025poster
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Joya Chen, Yiqi Lin, Ziyun Zeng et al.
CVPR 2025posterarXiv:2504.16030
4
citations
Speech Robust Bench: A Robustness Benchmark For Speech Recognition
Muhammad Shah, David Solans Noguero, Mikko Heikkilä et al.
ICLR 2025posterarXiv:2403.07937
12
citations
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
Zuwei Long, Yunhang Shen, Chaoyou Fu et al.
NeurIPS 2025poster
17
citations
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems
Hitesh Tulsiani, David Chan, Shalini Ghosh et al.
ICML 2024posterarXiv:2409.10515
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Nina Shvetsova, Anna Kukleva, Xudong Hong et al.
ECCV 2024posterarXiv:2310.04900
31
citations
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
Heting Gao, Kaizhi Qian, Junrui Ni et al.
ICML 2024poster