1170709
Assignment 2-Usage of Map Reduce
Governors State University
Introduction:
MapReduce is one of the main topic in the Hadoop technology as it plays key role to arrange the large amount of data in to clusters.MapReduce came in to picture as it is effective in handling and processing the large amount of data.The main aim of the MapReduce is to read the data and filter the data according to the business logic.we are going to discuss more about the MapReduce in the following business problem.
Business Problem:
The main problem in todays industries are facing the excess amount of data in there data storage to retrieve or place the data in an order .The amount …show more content…
MapReduce Implements various algorithms to break the data or filter the various data .as Mapreduce framework uses different programming languages like pyton,java etc It uses searching,Sorting,Indexing etc to make the data break drown in to simple format . The data will be sorted based up on the keys and shuffles in the shuffle and sort block which helps the reducer block to make easily reduce the data based on the business logic.The Reducer phase data can be filtered and it can be combined it in to number of ways to make it easily accessible.In the output phase the output translates the grouped key-valued pairs in to the one file system for easy access of the data.The output files we are generating are savd in the HDFS only in the file