Shuffle & Sorting of MapReduce Task

Name: Shuffle & Sorting of MapReduce Task | Humix Video
Uploaded: 2024-11-18T07:14:39+00:00
Duration: 2 min 55 s
Description: Shuffle & Sorting of MapReduce Task

0:00
Suffling and Shorting of Map Reduce Task
0:03
In between mapper and Reducer, the softling and shorting tasks will be performed
0:10
But if there is no reducer, so if the mapper output is the final output, then there is no need to have any
0:16
suffling or shorting tasks. So let us go for some more discussion on it
0:23
So what is suffling? So you are starting with suffling at first. The mapper creates the intermediate key value pairs
0:29
and the transfers them to the reducer task and this procedure is known as shuffling
0:35
So, in case of shuffling, some reordering of this key value pairs will take place
0:40
Mapper will take the key value pairs and it will do some processing. The developer will prove the business logic in the mapper to do the required intended processing on it
0:51
And then the mapper will be also producing the outputs in the form of key value pairs
0:55
and this output is known as intermittent result and that will be stored onto the local disk
1:01
not onto the HDFS. And this suffling is a process with the help of which these key value pairs will be going
1:10
to the reducer in some different order. So, using the shuffling procedure, the system can short the data using the key values
1:19
So depending upon the values of the key this shuffling will take place that means reordering of this key value pairs will be done for the shorting operation The suffling tax begins when some of the mapping tasks are done So suffling tax will begin
1:34
when the some of the mappers will produce some outputs. So it is not, it is not waiting for
1:39
all mappers completion. So it is a faster task. So as a result of that, when the some of the
1:45
mappers have completed their operations and outputs have been obtained, then the
1:49
the suffering operation will be working on that. So, this is the faster process and it will
1:54
not wait for the completion of the all mapper tasks. Next, we are going for what is
2:03
sorting. So, the MapReduce framework automatically shorts the data on the key values
2:09
on the output of the mapper. So before sending it to the reducer, all the key values will be
2:15
shorted. And as a result of that, the reducer will take the
2:19
lesser time to do the reduce operation. The reducer can easily understand when a new reducing
2:26
tax will be started by the shorted key value pairs. And if the user set no reducer task
2:32
if that is no reducer task, and then the suffling and shorting phase will not take place. If there
2:39
is no reducer, there is no need to have any shuffling or shorting. The tax will award after the
2:45
mapper task. So in this way, in this discussion we have discussed, that
2:49
what is the suffling and shorting in MapReduce. Thanks for watching this video

Shuffle & Sorting of MapReduce Task

Tutorialspoint

Understanding the TAKE function in Excel | Simplify Data Extraction!

How to use comments and notes in Excel | Enhance Collaboration and Clarity!

Simple PDF to Excel Tutorial (Quick & Easy) | Convert Data in Seconds!

Your AI Can Now Shop For You — And Pay With Your Money | FrontPage

Master PLC Sorting Station Programming in 20 Minutes!

Operators in Python | Python Identifiers | AI & Machine Learning | Tutorialspoint

iFi iOne - Compact Audiophile DAC with aptX and Bluetooth - REVIEWED!

iOS 18.4 - 12.0 - How to Skip Activation Screen & Jailbreak Your Device With AnyUnlock

Angular 19 Material CRUD | Full Project with Source Code

OMRON Tutorial: Reversible Shift Register (SFTR)

Windows vs M1 Macbook Pro for Matlab, R, Python

4 Tier Narrow Shoe Rack – Space-Saving Shoe Organizer for Kids & Adults

Up next in 10

Shuffle & Sorting of MapReduce Task

Tutorialspoint