Robust Real-time Audio-Visual Speech Enhancement based on DNN and GAN
(2024)
Journal Article
Gogate, M., Dashtipour, K., & Hussain, A. (in press). Robust Real-time Audio-Visual Speech Enhancement based on DNN and GAN. IEEE Transactions on Artificial Intelligence, https://doi.org/10.1109/tai.2024.3366141
The human auditory cortex contextually integrates audio-visual (AV) cues to better understand speech in a cocktail party situation. Recent studies have shown that AV speech enhancement (SE) models can significantly improve speech quality and intellig... Read More about Robust Real-time Audio-Visual Speech Enhancement based on DNN and GAN.