Skip to main content

Research Repository

Advanced Search

Near-Data Prediction Based Speculative Optimization in a Distribution Environment

Liu, Qi; Wu, Xueyan; Liu, Xiaodong; Zhang, Yonghong; Hu, Yuemei

Authors

Qi Liu

Xueyan Wu

Yonghong Zhang

Yuemei Hu



Abstract

Hadoop is an open source from Apache with a distributed file system and MapReduce distributed computing framework. The current Apache 2.0 license agreement supports on-demand payment by consumers for cloud platform services, helping users leverage their respective different hardware to provides cloud services. In cloud-based environment, there is a need to balance the resource requirements of workloads, optimize load performance, and the cloud compute costs to manage. When the processing power of clustered machines varies widely, such as when hardware is aging or overloaded, Hadoop offers a speculative execution (SE) optimization strategy, by monitoring task progress in real time, in the starting identical backup tasks on different nodes when multiple tasks under a job are not running at the same speed, providing the first to go. The completed calculations maintain the overall progress of the job. At present, the SE strategy’s incorrect selection of backup nodes and resource constraints may result in poor Hadoop performance, and subsequent tasks cannot be completed execution and other problems. This paper proposes an SE optimization strategy based on near data prediction, which analyzes the prediction of real-time task execution information to predict the required running time, select backup nodes based on actual requirements and approximate data to make the SE strategy achieve the best performance. Experiments prove that in a heterogeneous Hadoop environment, the optimization strategy can effectively improve the effectiveness and accuracy of various tasks and enhance the performance of cloud computing. Platform performance can benefits consumers better than before.

Citation

Liu, Q., Wu, X., Liu, X., Zhang, Y., & Hu, Y. (2022). Near-Data Prediction Based Speculative Optimization in a Distribution Environment. Mobile Networks and Applications, 27(6), 2339-2347. https://doi.org/10.1007/s11036-021-01793-7

Journal Article Type Article
Acceptance Date Aug 10, 2020
Online Publication Date Jun 18, 2022
Publication Date 2022-12
Deposit Date Oct 20, 2020
Publicly Available Date Jun 19, 2023
Print ISSN 1383-469X
Electronic ISSN 1572-8153
Publisher Springer
Peer Reviewed Peer Reviewed
Volume 27
Issue 6
Pages 2339-2347
DOI https://doi.org/10.1007/s11036-021-01793-7
Keywords Distributed systems, Hadoop, Speculative execution, Locally weighted regression, Near data prediction
Public URL http://researchrepository.napier.ac.uk/Output/2694806

Files


Near-Data Prediction Based Speculative Optimization In A Distribution Environment (accepted version) (1 Mb)
PDF







You might also like



Downloadable Citations