Zhang, S., Loweimi, E., Bell, P., & Renals, S. (2021, August). Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models. Presented at Interspeech 2021, Brno, Czechia