Skip to main content

Research Repository

Advanced Search

All Outputs (2)

Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling (2020)
Presentation / Conference Contribution
Loweimi, E., Bell, P., & Renals, S. (2020). Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling. In Proc. Interspeech 2020 (1644-1648). https://doi.org/10.21437/interspeech.2020-18

In this paper we investigate the usefulness of the sign spectrum and its combination with the raw magnitude spectrum in acoustic modelling for automatic speech recognition (ASR). The sign spectrum is a sequence of ±1s, capturing one bit of the phase... Read More about Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling.

On the Robustness and Training Dynamics of Raw Waveform Models (2020)
Presentation / Conference Contribution
Loweimi, E., Bell, P., & Renals, S. (2020). On the Robustness and Training Dynamics of Raw Waveform Models. In Proc. Interspeech 2020 (1001-1005). https://doi.org/10.21437/interspeech.2020-17

We investigate the robustness and training dynamics of raw waveform acoustic models for automatic speech recognition (ASR). It is known that the first layer of such models learn a set of filters, performing a form of time-frequency analysis. This lay... Read More about On the Robustness and Training Dynamics of Raw Waveform Models.