Resilient distributed processing technique (RDPT), in which mapper and reducer are simplified with the Spark contexts and support distributed parallel query processing.
The proposed work is implemented with Pig Latin with Spark contexts to develop query processing in a distributed environment.
Query processing in Hadoop influences the distributed processing with the MapReduce model. MapReduce caters to the works on different nodes with the implementation of complex mappers and reducers. Its results are valid for some extent size of the data.
Pig supports the required parallel processing framework with the following constructs during the processing of queries: FOREACH; FLATTEN; COGROUP.
Lakshmi, C. and Usha Rani, K. (2021), "Improving the performance of query processing using proposed resilient distributed processing technique", International Journal of Intelligent Computing and Cybernetics, Vol. 14 No. 2, pp. 158-169. https://doi.org/10.1108/IJICC-10-2020-0157
Emerald Publishing Limited
Copyright © 2021, Emerald Publishing Limited