ASR
Mega-ASR: When Speech Recognition Stops Hearing the Audio
A clean ASR mistake is usually easy to describe. The model hears "Victor" as "vector." It drops a short function word. It guesses the wrong homophone. The transcript is wrong, but it is still anchored to the waveform. Severe real-world audio fails differently. A distant speaker