Oozie Workflow | Hadoop
61K views
Oct 24, 2024
Oozie Workflow | Hadoop
View Video Transcript
0:00
In this video we are discussing OZ workflow. OZ workflow is nothing but sequence of actions to be carried out and represented in the form of directed a cyclic graph
0:12
Each and every action is dependent on the previous action. If the previous action works well and has its completion successfully, then its output will be going to the next action
0:24
So let us go for more detailing about this OZ workflow. So, the workflow techniques in OZ
0:32
So the OZ workflow is a sequence of actions arranged in a DAG, that is a directed acyclic graph
0:39
and these actions are dependent on the previous tasks or actions. And one action can run properly if the previous one returns a valid output to it
0:50
So, workflow actions can be different kind of tasks, like our hype task, we can have the big task
0:58
We can have the Java action, shell action, etc. So, different kinds of tasks, it can do the scheduling
1:05
In a flow of different sequential tasks, some tasks can be performed in parallel, some
1:11
tasks can be performed only in sequential. So to execute some tasks in parallel, we can use the fork option in OZ
1:20
So if I want to execute some tasks in parallel, then we'll be using the fork option so
1:26
that these two tasks can execute in parallel and later we can go for the join option so that again
1:33
the tasks will get the sequence will get merged so the join option is used to merge two parallel tasks into one So let us discuss this one with one proper diagram Just find this one
1:47
This is the different components of OZ workflow. Just see. Here we are having this start
1:53
Then we are having this MR job that is a map reduce job. We are having this peak job. So two jobs
1:59
are there, then OZ will do the fork. So, as a fork, so these two jobs will be executed in parallel, then join will be there
2:08
then it will have the proper completion and end. Nodes in OZ workflow
2:17
So there are mainly three control nodes in the OZ workflow and they are start, end, and
2:24
the kill nodes. You can find here we're having the start, we're having the end and the kill node
2:29
So, there are mainly three control flows in our OZ control flow nodes in our OZ workflow
2:36
So, a workflow can start from the start node and ends at the end node, but when an error
2:42
occurs, it moves to the kill node to kill the process because the process has got some interruption
2:49
So here you can find here we're having this start. So now this map-produce word count is the process to be executed
2:56
So if it faces any kind of error, then the kill node. will be reached, otherwise it will go to the end if the process has got okay with the operation
3:05
and processing. So, this is the basic overview of this OZ workflow and we have explained this one with proper
3:12
diagram and explanation. Thanks for watching this video
#Programming