Brief History on Hadoop
671 views
Nov 28, 2024
Brief History on Hadoop
View Video Transcript
0:00
In this video we are discussing brief history on Hadoop
0:05
Whenever we are having a small set of data on cloud, then our traditional system could handle that one, could did the data ysis, report generation, forecasting, whatever
0:15
But whenever we are having the huge amount of data, there is a big data is available on the cloud, then ordinary traditional system cannot handle that one
0:25
So that's why we required the solution and the solution is Hadoop
0:29
So let us go for further discussion on brief history on hadup
0:33
Let us discuss a brief history on hadoo. So why hadoob have got originated or why this hadup came in the market
0:43
So before developing the hardup solution, the traditional approach was used to store our data
0:49
And in the traditional approach, it can work fine if the data is not large
0:54
And when it tries to work with a huge amount of data
0:59
the traditional approach fails to do the task accordingly and that's the reason why this
1:05
Hadoop has come in the scene so Google's will Google has provided one solution
1:12
so what is the Google's solution so Google solved this problem by an algorithm
1:16
called the MapReduce and these algorithm divides the task into smaller parts and assign them to many low computers and after finishing the job it takes the output from different systems and merge them all to create the
1:32
final required output out of the data and haddub was originally developed for the nudge search engine project
1:41
by dog cutting and his group so we know that haddub is nothing but one
1:48
open source tool from the ASF that is the Apache Software Foundation so open source
1:55
project means it is freely available and you can even change its source code as
2:01
part the requirements we're having so if certain functionality does not fulfill our
2:08
need then we can change it accordingly and according to our need and most of the
2:15
had-up code is written by Yahoo IBM, Facebook, cloud data, etc. And it provides an efficient framework for running jobs on multiple nodes
2:26
of clusters. A cluster means a group of systems connected via LAN and Apache Hadoop provides the
2:35
parallel processing of data as it works on multiple machines simultaneously. And that is the
2:42
brief history of Hadoop. Thanks for watching this video
#Data Management
#Distributed & Cloud Computing
#Programming
#Software