Prof Jessie Kennedy J.Kennedy@napier.ac.uk
Enhanced Associate
Visual cleaning of genotype data.
Kennedy, Jessie; Graham, Martin; Paterson, Trevor; Law, Andy
Authors
Martin Graham
Trevor Paterson
Andy Law
Abstract
While some data cleaning tasks can be performed automatically, many more require expert human guidance to steer the cleaning process, especially if erroneous or unclean data is a product of relationships between entities. An example is pedigree genotype data: inheritance hierarchies in which the correctness of genotype data for any individual is judged on comparison to their relations’ genotypes, as individuals should inherit DNA from their assumed ancestors. Thus, cleaning this data must consider the relationships between individuals; sometimes this means more data must be cleaned than first assumed, while in other situations it means errors across many individuals can be remedied by cleaning the data of a shared relation. Such judgements require a domain expert to hypothesise the effect changing particular data has on the wider data set. Using a visualization tool with the ability to undertake what-if interactions can assist a user in correctly cleaning such data. We achieve this by closely coupling an existing pedigree visualisation technique, VIPER, with a genotype cleaning algorithm, and then develop necessary extensions to the visualization to allow interactive data cleaning. A comparative user evaluation with biologists shows the advantages of this visualisation design over an existing cleaning tool and we discuss the challenges in the design of visual cleaning tools in which errors may be transitive.
Conference Name | BioVis 2013 |
---|---|
Start Date | Oct 13, 2013 |
End Date | Oct 14, 2013 |
Publication Date | 2013 |
Deposit Date | Nov 12, 2013 |
Publicly Available Date | Dec 31, 2013 |
Peer Reviewed | Peer Reviewed |
Pages | 105-112 |
Book Title | Proceedings of BioVis 2013 |
ISBN | 978-1-4799-1658-0 |
DOI | https://doi.org/10.1109/BioVis.2013.6664353 |
Keywords | Pedigree; data cleaning; genotypes; user evaluation; |
Public URL | http://researchrepository.napier.ac.uk/id/eprint/6454 |
Publisher URL | http://dx.doi.org/10.1109/BioVis.2013.6664353 |
Contract Date | Nov 12, 2013 |
Files
ViperBioVisFinal.pdf
(773 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc/4.0/
You might also like
Vesper: Visualising species archives
(2014)
Journal Article
MaTSE: the gene expression time-series explorer.
(2013)
Journal Article
VIPER: a visualisation tool for exploring inheritance inconsistencies in genotyped pedigrees
(2012)
Journal Article
A comparison of techniques for name matching
(2012)
Journal Article
Visualising errors in animal pedigree genotype data
(2011)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search