Volume 20 No 12 (2022)
Download PDF
DESIGN AND DEVELOPMENT OF OPTIMIZATION ON MAPREDUCE METHOD FOR DATA MINING
Dr. Jitendra Sheetlani , Dr. J. P. Patra Bhramara Bar Biswal
Abstract
MapReduce is a developer-friendly framework that encapsulates the underlying complexities of
distributed computing. It is increasingly being used across enterprises for advanced data analytics,
business intelligence, and data mining tasks. But there are two questions bothering Hadoop users:
how to improve the performance of MapReduce workloads, and how to estimate the time needed
to run a MapReduce job. In this paper, we provide some performance optimization techniques on
the premise of workload characterization. After the cluster achieving the best performance, we
further propose a modeling method to help Hadoop users estimate the execution time of
MapReduce jobs. For evaluation, typical benchmarks are utilized to evaluate the accuracy of our
techniques
Keywords
MapReduce, Data Mining , Hadoop
Copyright
Copyright © Neuroquantology
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in the Neuroquantology are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJECSE right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.