Journal of Engineering and Applied Sciences

Year: 2019
Volume: 14
Issue: 13
Page No. 4367 - 4374

Data Summary Techniques based on MapReduce in Big Data

Authors : Jeong-Joon Kim

Abstract: Wavelet, known as one of summary construction techniques was applied to feature extraction for multimedia data. Wavelet histogram is a summary technique which grafts wavelet on to histogram considered as a typical summary technique used in query optimization of database system and processing approximate query, etc. Wavelet histogram which combines merits of wavelet and histogram can generate a lossless optimal data summary of original data. In the existing studies, it needed more than one MapReduce job to construct local wavelet histogram of partial data stored in each node. In addition, it took a lot of time to construct the global wavelet histogram which is the combination of all local distributed wavelet histograms. Because the error bound for data reconstructed from wavelet histogram was not considered, there is a shortcoming that we cannot control the error of reconstructed data beforehand. In this thesis, we developed a wavelet histogram construction system which can construct wavelet histogram fast by one MapReduce job. Since, the error bound can beset before the construction of wavelet histogram, we can control the error of data reconstructed from wavelet histogram under the error bound. Finally, the efficiency of our wavelet histogram construction system was proved by comparing our system with others.

How to cite this article:

Jeong-Joon Kim , 2019. Data Summary Techniques based on MapReduce in Big Data. Journal of Engineering and Applied Sciences, 14: 4367-4374.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved