Skip to main content

Research Repository

Advanced Search

MLCut: exploring multi-level cuts in dendrograms for biological data

Vogogias, Athanasios; Kennedy, Jessie; Archambault, Daniel; Anne Smith, V; Currant, Hannah

Authors

Athanasios Vogogias

Daniel Archambault

V Anne Smith

Hannah Currant



Contributors

Cagatay Turkay
Editor

Tao Ruan Wan
Editor

Abstract

Choosing a single similarity threshold for cutting dendrograms is not sufficient for performing hierarchical clustering analysis of heterogeneous data sets. In addition, alternative automated or semi-automated methods that cut dendrograms in multiple levels make assumptions about the data in hand. In an attempt to help the user to find patterns in the data and resolve ambiguities
in cluster assignments, we developed MLCut: a tool that provides visual support for exploring dendrograms of heterogeneous data sets in different levels of detail. The interactive exploration of the dendrogram is coordinated with a representation of the original data, shown as parallel coordinates. The tool supports three analysis steps. Firstly, a single-height similarity threshold can be applied using a dynamic slider to identify the main clusters. Secondly, a distinctiveness threshold can be applied using
a second dynamic slider to identify “weak-edges” that indicate heterogeneity within clusters. Thirdly, the user can drill-down to further explore the dendrogram structure - always in relation to the original data - and cut the branches of the tree at multiple levels. Interactive drill-down is supported using mouse events such as hovering, pointing and clicking on elements of the dendrogram. Two prototypes of this tool have been developed in collaboration with a group of biologists for analysing their
own data sets. We found that enabling the users to cut the tree at multiple levels, while viewing the effect in the original data, is a promising method for clustering which could lead to scientific discoveries

Presentation Conference Type Conference Paper (Published)
Conference Name Computer Graphics & Visual Computing (CGVC) 2016
Start Date Sep 15, 2016
End Date Sep 16, 2016
Acceptance Date Jul 26, 2016
Online Publication Date Sep 15, 2016
Publication Date Sep 15, 2016
Deposit Date Sep 20, 2016
Publicly Available Date Sep 20, 2016
Book Title Computer Graphics and Visual Computing (CGVC)
ISBN 978-3-03868-022-2
DOI https://doi.org/10.2312/cgvc.20161288
Keywords hierarchical; clustering; computer graphics; viewing algorithms; information search and retrieval; information
Public URL http://researchrepository.napier.ac.uk/Output/382443
Related Public URLs https://diglib.eg.org/handle/10.2312/cgvc20161288

https://diglib.eg.org/handle/10.2312/2630862
Contract Date Sep 20, 2016

Files

MLcut: exploring multi-level cutes in dendrograms for biological data. (3.7 Mb)
PDF







You might also like



Downloadable Citations