Friday, August 8, 2014

Introduction to Map-R

Map-R program


What is Map-R program?
Map reduced program often called Map-R program. It is a programming approach to create / process large datasets in a parallel, distributed way in a given cluster.
Map-R program is a combination of Map function and Reduce function.

In general, Map-R program is classified in two major steps.
1)      Map function
2)      Reduce function

Map Step
Master node reads the input, and will divide the task into number of smaller tasks and assigns them into the worker nodes in the cluster.

Reduce Step
            Master node collects the output from all the worked nodes, creates as a single output based on the keys or some other way.

No comments:

Post a Comment