School of Computing Engineering and the Built Environment

Iterative Speech Enhancement with Transformers (2024)
Presentation / Conference Contribution
Nazemi, A., Sami, A., Sami, M., & Hussain, A. (2024, September). Iterative Speech Enhancement with Transformers. Presented at 3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement (AVSEC), Kos, Greece

Enhancing audio quality in audio-video speech enhancement (AVSE) is a crucial step in improving the performance of speech recognition systems, particularly by integrating visual and auditory data to create more robust and accurate models. This study... Read More about Iterative Speech Enhancement with Transformers.

A Framework for Speech Enhancement based on Audio Signal and Speaker Embeddings (2024)
Presentation / Conference Contribution
Nazemi, A., Sami, A., Sami, M., & Hussain, A. (2024, September). A Framework for Speech Enhancement based on Audio Signal and Speaker Embeddings. Presented at 3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement (AVSEC), Kos Island, Greece

This study addresses the challenge of speech enhancement within an audio-only context. Our proposed framework extracts speaker embeddings and voice signals, subsequently integrating these components to synthesise a voice based on the extracted data.... Read More about A Framework for Speech Enhancement based on Audio Signal and Speaker Embeddings.

Outputs (2)