Mingxu Sun
Near-Data Prediction Based Speculative Optimization in a Distribution Environment
Sun, Mingxu; Wu, Xueyan; Jin, Dandan; Xu, Xiaolong; Liu, Qi; Liu, Xiaodong
Abstract
Apache Hadoop is an open source software framework that supports
data-intensive distributed applications and is distributed under the Apache 2.0 licensing agreement, where consumers will no longer deal with complex configuration of software and hardware but only pay for cloud services on demand. So how to make the performance of the cloud platform become more important in a consumer-centric environment. There exists imbalance between in some distribution of slow tasks, which results in straggling tasks will have a great influence on the Hadoop framework. By monitoring those tasks in real-time progress and copying the potential Stragglers to a different node, the speculative execution (SE) realizes to improve the probability of finishing those backup tasks before the original ones. The Speculative execution (SE) applies this principle and thus proposed a solution to handle the Straggling tasks. At present, the performance of the Hadoop system is unsatisfying because of the erroneous judgement and inappropriate selection for the backup nodes in the current SE policy. This paper proposes an SE optimized strategy which can be used in prediction of near data. In this strategy, the first step is gathering the real-time task execution information and the remaining runtime required for the task is predicted by a local prediction method. Then it chooses a proper backup node according to the near data and actual demand in the second step. On the other side, this model also includes a cost-effective model in order to make the performance of SE to the peak. The results show that using this strategy in Hadoopeffectively improves the accuracy of alternative tasks and effects better in heterogeneous Hadoop environments in various situations, which is beneficial to consumers and cloud platform.
Citation
Sun, M., Wu, X., Jin, D., Xu, X., Liu, Q., & Liu, X. (2019, December). Near-Data Prediction Based Speculative Optimization in a Distribution Environment. Presented at 9th EAI International Conference on Cloud Computing (CloudComp 2019), Sydney, Australia
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 9th EAI International Conference on Cloud Computing (CloudComp 2019) |
Start Date | Dec 4, 2019 |
End Date | Dec 5, 2019 |
Acceptance Date | Oct 20, 2019 |
Publication Date | Dec 4, 2019 |
Deposit Date | Dec 16, 2019 |
Publicly Available Date | Dec 5, 2020 |
Publisher | Springer |
Public URL | http://researchrepository.napier.ac.uk/Output/2392326 |
Files
Near-Data Prediction Based Speculative Optimization In A Distribution Environment
(494 Kb)
PDF
You might also like
Towards Building a Smart Water Management System (SWAMS) in Nigeria
(2024)
Presentation / Conference Contribution
Utilizing the Ensemble Learning and XAI for Performance Improvements in IoT Network Attack Detection
(2024)
Presentation / Conference Contribution
Emotion Recognition on Social Media Using Natural Language Processing (NLP) Techniques
(2023)
Presentation / Conference Contribution