P Bartie
The REAL Corpus: a crowd-sourced corpus of human generated and evaluated spatial references to real-world urban scenes
Bartie, P; Mackaness, W; Gkatzia, Dimitra; Rieser, V
Abstract
We present a newly crowd-sourced data set of natural language references to objects anchored in complex urban scenes (In short: The REAL Corpus – Referring Expressions Anchored Language). The REAL corpus contains a collection of images of real-world urban
scenes together with verbal descriptions of target objects generated by humans, paired with data on how successful other people were
able to identify the same object based on these descriptions. In total, the corpus contains 32 images with on average 27 descriptions per image and 3 verifications for each description. In addition, the corpus is annotated with a variety of linguistically motivated features.
The paper highlights issues posed by collecting data using crowd-sourcing with an unrestricted input format, as well as using real-world urban scenes. The corpus will be released via the ELRA repository as part of this submission.
Citation
Bartie, P., Mackaness, W., Gkatzia, D., & Rieser, V. (2016). The REAL Corpus: a crowd-sourced corpus of human generated and evaluated spatial references to real-world urban scenes. In 10th International Conference on Language Resources and Evaluation (LREC)
Conference Name | 10th International Conference on Language Resources and Evaluation (LREC) |
---|---|
Start Date | May 23, 2016 |
End Date | May 28, 2016 |
Acceptance Date | Jan 26, 2016 |
Publication Date | 2016 |
Deposit Date | Mar 1, 2016 |
Publicly Available Date | Mar 29, 2024 |
Peer Reviewed | Peer Reviewed |
Book Title | 10th International Conference on Language Resources and Evaluation (LREC) |
ISBN | 978-2-9517408-9-1 |
Keywords | Real-world urban spaces; spatial references; crowd sourcing; |
Public URL | http://researchrepository.napier.ac.uk/id/eprint/9564 |
Files
The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
(2.2 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc/4.0/
You might also like
TaskMaster: A Novel Cross-platform Task-based Spoken Dialogue System for Human-Robot Interaction
(2023)
Conference Proceeding
Building a dual dataset of text-and image-grounded conversations and summarisation in Gàidhlig (Scottish Gaelic)
(2023)
Conference Proceeding
enunlg: a Python library for reproducible neural data-to-text experimentation
(2023)
Conference Proceeding
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search