MapReduce and Design Patterns - Structure of Numerical Summarization Pattern
2K views
Oct 18, 2024
MapReduce and Design Patterns - Structure of Numerical Summarization Pattern https://www.tutorialspoint.com/market/index.asp Get Extra 10% OFF on all courses, Ebooks, and prime packs, USE CODE: YOUTUBE10
View Video Transcript
0:00
Structure of numerical summarization pattern
0:04
We know that whenever we're dealing with a big set of data, then those data are to be
0:09
categorized or to be clapped depending upon some criteria, and then we are supposed to
0:14
calculate the summary on the numerical values. So summary means some statistical functions
0:20
So let us discuss structure of numerical summarization pattern. The numerical summarization pattern has mapper
0:28
the combiner and the reducer and this combiner is optional so the mapper finds the keys that
0:36
consists of each field to group by and the values consisting of numerical items so here the
0:43
mapper will be dealing with some key value pairs as usual and mapper will find the keys
0:49
that consists of each field of group by and values will be consisting of the respective
0:55
numerical items because until and unless the values are numerical we cannot
1:00
apply any statistical functions over them these outputs are intermediate and these will not be stored onto the HDFS the mapper output does not get
1:11
stored onto the HDFS because it is the intermediate output so its output will be
1:16
saved on the local disk the compiler is optional and it is good practice to
1:22
use combiners to decrease the overhead of the reducer. So, combiner will do some reducing task before the reducer. So, that's why
1:32
combiner can also be called as mini The combinator usage is optional but if we use the combiner then obviously that will reduce the over rate on the reducer The combiner reduces
1:45
the number of key value pairs of each split and sends it to the respective reducer. So
1:51
combiner will do some clubbing. So we'll calculate some statistical values out of some set
1:57
of similar keys. So that's why it is doing a reducing operation
2:02
prior to the dataset is made available to the actual reducer. So, here you can get the respective idea
2:11
So, here we are having multiple mappers, and these mappers will produce the key and
2:16
the summary field values like in this way. And they will be available to the respective partitioner
2:22
And then this partitioner will do the suffling and sorting, and the values will be, the
2:27
key value peers will be available to the respective reducer, and reducer will produce the
2:32
final summary summary of the data depending upon the same key values. So, here this summary
2:38
will be obtained by applying some of the statistical functions. So the numerical summarization
2:45
pattern structure we have shown here in this diagram. So finally, the reducer phase
2:50
it receives the numerical values associated with the keys to perform the actual reducing
2:57
task. And that is the structure of numerical summarization pattern. So, in this way, using this particular diagram, we have got the idea that how this numerical
3:07
summarization pattern is actually playing its role in our MapReduce applications. Thanks for watching this video
#Computer Science
#Engineering & Technology
#Programming
#Software