Nikolaos Panagiaris
Generating Unambiguous and Diverse Referring Expressions
Panagiaris, Nikolaos; Hart, Emma; Gkatzia, Dimitra
Authors
Prof Emma Hart E.Hart@napier.ac.uk
Professor
Dr Dimitra Gkatzia D.Gkatzia@napier.ac.uk
Associate Professor
Abstract
Neural Referring Expression Generation (REG) models have shown promising results in generating expressions which uniquely describe visual objects. However, current REG models still lack the ability to produce diverse and unambiguous referring expressions (REs). To address the lack of diversity, we propose generating a set of diverse REs, rather than one-shot REs. To reduce the ambiguity of referring expressions, we directly optimise non-differentiable test metrics using reinforcement learning (RL), and we show that our approaches achieve better results under multiple different settings. Specifically, we initially present a novel RL approach to REG training, which instead of drawing one sample per input, it averages over multiple samples to normalize the reward during RL training. Secondly, we present an innovative REG model that utilizes an object attention mechanism that explicitly incorporates information about the target object and is optimised using our proposed RL approach. Thirdly, we propose a novel transformer model optimised with RL that exploits different levels of visual information. Our human evaluation demonstrates the effectiveness of this model, where we improve the state-of-the-art results in RefCOCO testA and testB in terms of task success from to and from to respectively. While in RefCOCO+ testA we show improvements from to . Finally, we present a thorough comparison of diverse decoding strategies (sampling and maximisation-based) and how they control the trade-off between the quality and diversity.
Citation
Panagiaris, N., Hart, E., & Gkatzia, D. (2021). Generating Unambiguous and Diverse Referring Expressions . Computer Speech and Language, 68, Article 101184. https://doi.org/10.1016/j.csl.2020.101184
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 10, 2020 |
Online Publication Date | Dec 31, 2020 |
Publication Date | 2021-07 |
Deposit Date | Dec 11, 2020 |
Publicly Available Date | Jan 1, 2022 |
Print ISSN | 0885-2308 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 68 |
Article Number | 101184 |
DOI | https://doi.org/10.1016/j.csl.2020.101184 |
Keywords | Referring Expression Generation, Natural Language Generation, Neural Models |
Public URL | http://researchrepository.napier.ac.uk/Output/2710362 |
Files
Generating Unambiguous And Diverse Referring Expressions (accepted version)
(4.7 Mb)
PDF
Licence
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
Accepted version licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
You might also like
Evolutionary Computation Combinatorial Optimization.
(2004)
Journal Article
A hyper-heuristic ensemble method for static job-shop scheduling.
(2016)
Journal Article
A research agenda for metaheuristic standardization.
(2015)
Presentation / Conference Contribution
A Lifelong Learning Hyper-heuristic Method for Bin Packing
(2015)
Journal Article