Automated Human-Readable Label Generation in Open Intent Discovery

Anderson, Grant; Hart, Emma; Gkatzia, Dimitra; Beaver, Ian

doi:10.21437/Interspeech.2024-1351

Automated Human-Readable Label Generation in Open Intent Discovery

Anderson, Grant; Hart, Emma; Gkatzia, Dimitra; Beaver, Ian

Authors

Grant Anderson G.Anderson@napier.ac.uk
Research Student

Prof Emma Hart E.Hart@napier.ac.uk
Professor

Dr Dimitra Gkatzia D.Gkatzia@napier.ac.uk
Associate Professor

Ian Beaver

Abstract

The correct determination of user intent is key in dialog systems. However, an intent classifier often requires a large, labelled training dataset to identify a set of known intents. The creation of such a dataset is a complex and time-consuming task which usually involves humans applying clustering tools to unlabelled data, analysing the results, and creating human-readable labels for each cluster. While many Open Intent Discovery works tackle the problem of discovering clusters of common intent, few generate a human-readable label that can be used to make decisions in downstream systems. To address this, we introduce a novel candidate label extraction method then evaluate six combinations of candidate extraction and label selection methods on three datasets. We find that our extraction method produces more detailed labels than the alternatives and that high quality intent labels can be generated from unlabelled data without resorting to applying costly pre-trained language models.

Citation

Anderson, G., Hart, E., Gkatzia, D., & Beaver, I. (2024, September). Automated Human-Readable Label Generation in Open Intent Discovery. Presented at Interspeech 2024, Kos, Greece

Presentation Conference Type	Conference Paper (published)
Conference Name	Interspeech 2024
Start Date	Sep 1, 2024
End Date	Sep 5, 2024
Acceptance Date	Jun 4, 2024
Online Publication Date	Oct 1, 2024
Publication Date	2024
Deposit Date	Jun 17, 2024
Publicly Available Date	Oct 3, 2024
Peer Reviewed	Peer Reviewed
Pages	3540-3544
Series Title	Interspeech
Series ISSN	2958-1796
Book Title	Interspeech 2024
DOI	https://doi.org/10.21437/Interspeech.2024-1351
Keywords	Index Terms: open intent discovery; label generation; plm prompting
External URL	https://interspeech2024.org/