Nikolaos Panagiaris
Generating Unambiguous and Diverse Referring Expressions
Panagiaris, Nikolaos; Hart, Emma; Gkatzia, Dimitra
Authors
Prof Emma Hart E.Hart@napier.ac.uk
Professor
Dr Dimitra Gkatzia D.Gkatzia@napier.ac.uk
Associate Professor
Abstract
Neural Referring Expression Generation (REG) models have shown promising results in generating expressions which uniquely describe visual objects. However, current REG models still lack the ability to produce diverse and unambiguous referring expressions (REs). To address the lack of diversity, we propose generating a set of diverse REs, rather than one-shot REs. To reduce the ambiguity of referring expressions, we directly optimise non-differentiable test metrics using reinforcement learning (RL), and we show that our approaches achieve better results under multiple different settings. Specifically, we initially present a novel RL approach to REG training, which instead of drawing one sample per input, it averages over multiple samples to normalize the reward during RL training. Secondly, we present an innovative REG model that utilizes an object attention mechanism that explicitly incorporates information about the target object and is optimised using our proposed RL approach. Thirdly, we propose a novel transformer model optimised with RL that exploits different levels of visual information. Our human evaluation demonstrates the effectiveness of this model, where we improve the state-of-the-art results in RefCOCO testA and testB in terms of task success from to and from to respectively. While in RefCOCO+ testA we show improvements from to . Finally, we present a thorough comparison of diverse decoding strategies (sampling and maximisation-based) and how they control the trade-off between the quality and diversity.
Citation
Panagiaris, N., Hart, E., & Gkatzia, D. (2021). Generating Unambiguous and Diverse Referring Expressions . Computer Speech and Language, 68, Article 101184. https://doi.org/10.1016/j.csl.2020.101184
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 10, 2020 |
Online Publication Date | Dec 31, 2020 |
Publication Date | 2021-07 |
Deposit Date | Dec 11, 2020 |
Publicly Available Date | Jan 1, 2022 |
Print ISSN | 0885-2308 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 68 |
Article Number | 101184 |
DOI | https://doi.org/10.1016/j.csl.2020.101184 |
Keywords | Referring Expression Generation, Natural Language Generation, Neural Models |
Public URL | http://researchrepository.napier.ac.uk/Output/2710362 |
Files
Generating Unambiguous And Diverse Referring Expressions (accepted version)
(4.7 Mb)
PDF
Licence
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
Accepted version licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
You might also like
Advances in artificial immune systems
(2011)
Journal Article
On Clonal Selection.
(2011)
Journal Article
Structure versus function: a topological perspective on immune networks
(2009)
Journal Article
How affinity influences tolerance in an idiotypic network.
(2007)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search