Machine Learning - Data Description - Boxplot with Five Number Summary
11K views
Oct 17, 2024
Machine Learning - Data Description - Boxplot with Five Number Summary https://www.tutorialspoint.com/market/index.asp Get Extra 10% OFF on all courses, Ebooks, and prime packs, USE CODE: YOUTUBE10
View Video Transcript
0:00
In this video we are discussing box plot with five number summary
0:07
A box plot is a graph of data set obtained by drawing a horizontal line from the minimum
0:13
data to the value Q1 and drawing a horizontal line from Q3 to the maximum data value
0:21
Here Q1, Q1, they are nothing but quartile 1 and quartile 3
0:27
And drawing a box whose vertical sides pass the number. through Q1 and Q3 with a vertical line inside the box appearing through the median or Q2
0:39
So, this is the Q2 will be known as the second quartile can also be called as a median
0:45
So a box plot can be used to graphically represent the data set and these plots involve
0:51
five specific values. So the first one is the lowest value of the data set, that is the minimum
0:59
we are having the first quartile then the median also can be called as Q2 second quartile Then we are having the third quartile Q3 and the highest value of the data set that is the maximum
1:11
These values are called a five number summary of the dataset. I think to make the conception clear, we require one example to explain
1:22
Let us suppose we are having one set of data on which we are supposed to draw a box plot
1:27
So, the data set is 89 47 164 in this way we're having this set of data and this data
1:35
is not obviously shorted. Construct a box plot for the data. So, arrange the data, there is a step number one, arrange the data in the order
1:44
So, from the lowest value to the maximum value, we have arranged all this data in the ascending order
1:52
Now step two, find the median. We know that here we are having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 data
2:00
So, 10 is even. So I cannot find the middle most values In that case I shall be taking the average of the fifth and sixth value so 1 2 3 4 5 so 78 and 89 that average will be the respective median
2:15
also will be represented by Q2 or the second quartile so now we are going to find
2:21
the Q1 so here it is Q1 is equal to 47 what is the median here that is 83.5 so
2:30
83.5 is falling here so in this particular particular set of data calculate its median value. So, now we are getting this 47 as the
2:39
middlemost one. So Q1 is equal to 47. Now we are trying to find out the Q3. So here Q3
2:46
is equal to 164. How did I get this one? So 83.5 is the median. So 83.5 will be
2:54
lying here. So here in this upper part we are having 1, 2, 3, 4, 5 data. So take the middle most
3:01
I am taking this one as 164. So we have got the value of Q1 Q2 and Q3 And draw a scale for the for the data on the X axis And step six is that locate the lowest value we know the lowest value what is the lowest value here that is a 30 q1 already we have calculated
3:21
that is 47 median that is our q2 that is 83.5 and then q3 already we have calculated that one
3:29
has 164 and the highest value of the scale and that is nothing but 296 draw a box
3:37
around Q1 and Q3 draw a vertical line through the median and connect the upper value and
3:45
the lower value to the box. So here we are connecting this upper, the lower value to the box and upper value to the box
3:52
And here we are drawing one box from Q1 to Q3 and we are drawing one vertical line
3:58
in the across this box for our Q2. So Q1, Q2, Q3, there is the highest value, there is the lowest value
4:07
So, in this way, we have shown you that how this box plot can be done using five numbers
4:14
Thanks for watching this video
#Machine Learning & Artificial Intelligence