Qi Liu
Near-Data Prediction Based Speculative Optimization in a Distribution Environment
Liu, Qi; Wu, Xueyan; Liu, Xiaodong; Zhang, Yonghong; Hu, Yuemei
Abstract
Hadoop is an open source from Apache with a distributed file system and MapReduce distributed computing framework. The current Apache 2.0 license agreement supports on-demand payment by consumers for cloud platform services, helping users leverage their respective different hardware to provides cloud services. In cloud-based environment, there is a need to balance the resource requirements of workloads, optimize load performance, and the cloud compute costs to manage. When the processing power of clustered machines varies widely, such as when hardware is aging or overloaded, Hadoop offers a speculative execution (SE) optimization strategy, by monitoring task progress in real time, in the starting identical backup tasks on different nodes when multiple tasks under a job are not running at the same speed, providing the first to go. The completed calculations maintain the overall progress of the job. At present, the SE strategy’s incorrect selection of backup nodes and resource constraints may result in poor Hadoop performance, and subsequent tasks cannot be completed execution and other problems. This paper proposes an SE optimization strategy based on near data prediction, which analyzes the prediction of real-time task execution information to predict the required running time, select backup nodes based on actual requirements and approximate data to make the SE strategy achieve the best performance. Experiments prove that in a heterogeneous Hadoop environment, the optimization strategy can effectively improve the effectiveness and accuracy of various tasks and enhance the performance of cloud computing. Platform performance can benefits consumers better than before.
Citation
Liu, Q., Wu, X., Liu, X., Zhang, Y., & Hu, Y. (2022). Near-Data Prediction Based Speculative Optimization in a Distribution Environment. Mobile Networks and Applications, 27(6), 2339-2347. https://doi.org/10.1007/s11036-021-01793-7
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 10, 2020 |
Online Publication Date | Jun 18, 2022 |
Publication Date | 2022-12 |
Deposit Date | Oct 20, 2020 |
Publicly Available Date | Jun 19, 2023 |
Print ISSN | 1383-469X |
Electronic ISSN | 1572-8153 |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Volume | 27 |
Issue | 6 |
Pages | 2339-2347 |
DOI | https://doi.org/10.1007/s11036-021-01793-7 |
Keywords | Distributed systems, Hadoop, Speculative execution, Locally weighted regression, Near data prediction |
Public URL | http://researchrepository.napier.ac.uk/Output/2694806 |
Files
Near-Data Prediction Based Speculative Optimization In A Distribution Environment (accepted version)
(1 Mb)
PDF
You might also like
An adaptive approach to better load balancing in a consumer-centric cloud environment
(2016)
Journal Article
Grid Routing: An Energy-Efficient Routing Protocol for WSNs with Single Mobile Sink
(2017)
Journal Article
SMK-means: An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data
(2018)
Journal Article