International Journal of Soft Computing

Year: 2016

Volume: 11

Issue: 5

Page No. 312 - 318

DOI: 10.36478/ijscomp.2016.312.318

Download PDF References Abstract

Efficient Big Data Analytics With Optimized Parallel Processing

Authors : S. Sravanthi and K. Thirupathi Rao

Abstract: Now a days the word MapReduce is synonymous with big data processing. Different flavors of it is available over Apache Hadoop being at the core of big data processing. For well versed Java developers there is the direct interaction with the core there is Hive for the SQL proficient one’s and there is Pig for procedural language aware developers. What ever the wrapper being used the core implementation of hadoop is simply is to divide the processing into two disjoint phases. One being the Map function and the other being the reduce function. So far many big data processing implementations are driven with the idea of equal distribution of workloads across processing nodes. We propose a dynamic distributed algorithm that is a processing aware job scheduler that assigns data processing nodes work load based on their prior performance throughputs. Extensive simulations using a 2.4 GB weather temperature conversion datasets demonstrates that our proposals can significantly reduce processing costs while prioritizing working nodes better compared to previous approaches.

How to cite this article:

S. Sravanthi and K. Thirupathi Rao, 2016. Efficient Big Data Analytics With Optimized Parallel Processing. International Journal of Soft Computing, 11: 312-318.

DOI: 10.36478/ijscomp.2016.312.318

URL: https://medwelljournals.com/abstract/?doi=ijscomp.2016.312.318

Related Links

Journals By Subject

International Journal of Soft Computing

Efficient Big Data Analytics With Optimized Parallel Processing

How to cite this article: