Dominic Davies-Tagg
Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage
Davies-Tagg, Dominic; Anjum, Ashiq; Zahir, Ali; Liu, Lu; Yaseen, Muhammad Usman; Antonopoulos, Nick
Authors
Ashiq Anjum
Ali Zahir
Lu Liu
Muhammad Usman Yaseen
Prof Nick Antonopoulos N.Antonopoulos@napier.ac.uk
Deputy Vice Chancellor and Vice Principal of Research & Innovation
Abstract
Data temperature is a response to the ever-growing amount of data. These data have to be stored, but they have been observed that only a small portion of the data are accessed more frequently at any one time. This leads to the concept of hot and cold data. Cold data can be migrated away from high-performance nodes to free up performance for higher priority data. Existing studies classify hot and cold data primarily on the basis of data age and usage frequency. We present this as a limitation in the current implementation of data temperature. This is due to the fact that age automatically assumes that all new data have priority and that usage is purely reactive. We propose new variables and conditions that influence smarter decision-making on what are hot or cold data and allow greater user control over data location and their movement. We identify new metadata variables and user-defined variables to extend the current data temperature value. We further establish rules and conditions for limiting unnecessary movement of the data, which helps to prevent wasted input output (I/O) costs. We also propose a hybrid algorithm that combines existing variables and new variables and conditions into a single data temperature. The proposed system provides higher accuracy, increases performance, and gives greater user control for optimal positioning of data within multi-tiered storage solutions.
Citation
Davies-Tagg, D., Anjum, A., Zahir, A., Liu, L., Yaseen, M. U., & Antonopoulos, N. (2024). Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage. Big Data Mining and Analytics, 7(2), 371-398. https://doi.org/10.26599/bdma.2023.9020039
Journal Article Type | Article |
---|---|
Online Publication Date | Apr 22, 2024 |
Publication Date | 2024-06 |
Deposit Date | Aug 8, 2024 |
Publicly Available Date | Aug 8, 2024 |
Journal | Big Data Mining and Analytics |
Print ISSN | 2096-0654 |
Electronic ISSN | 2097-406X |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Peer Reviewed | Peer Reviewed |
Volume | 7 |
Issue | 2 |
Pages | 371-398 |
DOI | https://doi.org/10.26599/bdma.2023.9020039 |
Files
Data Temperature Informed Streaming For Optimising Large-Scale Multi-Tiered Storage
(11.4 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Context-aware service utilisation in the clouds and energy conservation
(2012)
Journal Article
Achieving green IT using VDI in cyber physical society.
(2013)
Journal Article
Virtual vignettes: the acquisition, analysis, and presentation of social network data
(2014)
Journal Article
A critical comparative evaluation on DHT-based peer-to-peer search algorithms
(2014)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search