Skip to main content

Research Repository

Advanced Search

The benefits of contextual information for speech recognition systems

Kinch, Martin W.; Melis, Wim J.C.; Keates, Simeon

Authors

Martin W. Kinch

Wim J.C. Melis

Simeon Keates



Abstract

This paper demonstrates the significance of using contextual information in machine learning and speech recognition. While the benefits of contextual information in human communication are widely known, their significance is rarely explored or discussed with a view to their potential for improving speech recognition accuracy. The presented research primarily focuses on an undertaken empirical study that looks at how context affects human communication and understanding. During the study, comparisons between human communication with and without context, have shown overall recognition improvements of over 30% when contextual information is provided. The study has also investigated the importance of the former/middle/latter part of a word towards recognition. These results show that the first two-thirds of a spoken word are key for humans to correctly infer a word. The conclusions from the performed study are then drawn upon to identify useful types of context that can help a machine's understanding, and how such contextual information can be gathered in speech recognition and machine learning systems. This paper shows that context is not only highly important for human communication, but can easily provide a wealth of information to enhance computational systems.

Presentation Conference Type Conference Paper (Published)
Conference Name The 10th Computer Science and Electronic Engineering (CEEC'18)
Start Date Sep 19, 2018
End Date Sep 21, 2018
Acceptance Date Jul 20, 2018
Online Publication Date Mar 28, 2019
Publication Date 2018
Deposit Date Feb 7, 2019
Publisher Institute of Electrical and Electronics Engineers
Book Title The 10th Computer Science and Electronic Engineering Conference (CEEC)
DOI https://doi.org/10.1109/CEEC.2018.8674204
Keywords machine learning, contextual information, speech recognition, natural language processing, context-aware computing, artificial intelligence
Public URL http://researchrepository.napier.ac.uk/Output/1497077
Publisher URL http://gala.gre.ac.uk/21076/