Types of Data under Big Data
5K views
Oct 24, 2024
Types of Data under Big Data Watch more Videos at https://www.tutorialspoint.com/videotutorials/index.htm Lecture By: Mr. Arnab Chakraborty, Tutorials Point India Private Limited
View Video Transcript
0:00
In this video we are discussing types of data under big data
0:04
So, there are three main types of data which will be existing under big data
0:09
One is the structured data, unstructured data and semi-structured data. Structured data means that data which is getting generated from, say, web blocks, from different sensor data, machine generated data
0:22
or the data which you are collecting through the surveys, directly from the human beings and so on
0:28
so like their names, sex, address, date of birth and so on. Here this data will be represented or can be
0:36
represented in the form of rows and columns. So, database is a good example of structured data
0:42
So, next data is our unstructured data. Unstructured data means the sensor generated images
0:49
the videos, the PDF, the text file, so they cannot be expressed in the terms of or cannot be
0:55
represented in the terms of rows and columns will be known as the unstructured data
1:01
And the last category is our semi data In case of semi data to some extent it is structured to some extent it is unstructured So JSON files XML files can be considered as a semi data So let us go for some more detailing
1:18
The big data are categorized into three different types, that is a structured data
1:24
unstructured data, and semi-structured data. So let me discuss each one of them one by one
1:31
So at first we are going for this database. You can find that this is a good example of a structured data
1:38
We're having certain columns. So there is the employee number, name, age, department and salary
1:43
And here we're having the respective roles containing the respective information. So here we're having the records and here we're having the columns
1:51
So structured data are those type of data which are stored already in an order
1:57
And there are nearly 20% of the total existing data nowadays and these data are structured
2:04
and all the data generated from sensors, web blocks, and these all machine generated structured data
2:10
So, these are the machine generated structure data. There is a web blocks, there is our sensor data, and so on
2:17
The human generated structured data are those which are taken as information from a human like their names addresses gender and data part and so on
2:27
So, the example of structured data is database. So now let us concentrate on the unstructured data
2:34
So, these are unstructured data. So the unstructured data have no clear format in storage, and we can store structured data in rows and columns database
2:43
but unstructured data cannot be stored like. cannot be stored like that. So, unstructured data cannot be stored in the form of rows
2:49
and columns. And we are having at least 80% of the data nowadays existing which are
2:57
unstructured. All satellite generated images, scientific data, or images are categorized as machine generated unstructured data. So the images which will be sent by the respective
3:08
satellites can be treated falling in the category of unstructured data. There are various
3:15
types of human generated unstructured data and these are the images videos social media
3:20
data etc so the example of unstructured data are the text documents pdfs images and videos etc so this is the theory or this is the concept against this and structured data so now let us come to the last category that is a semi data so it is
3:39
very difficult to categorize this type of data sometimes they look structured
3:43
and sometimes they will be looking as unstructured so that's why these data are
3:48
known as semi-structured data we cannot store this type of data using traditional
3:53
database format and but it contains some organization properties. And the examples of semi-structured data are spreadsheet files which we have in our Excel
4:04
in our calc, we are having the spreadsheet files. So, spreadsheet file is a good example of semi-structured data
4:11
We're having the XML or JSON documents. There is a JavaScript object notation
4:17
So extended markup language. There is a full form of XML. And no SQL database are the data items which are falling under this unstructured data
4:26
So, no skill is one kind of database where we can keep this type of data in a very efficient way
4:34
So, in this video we have discussed that what are the different types of data that big data
4:39
is going to handle. Thanks for watching this video
#Cloud Storage
#Computer Science
#Data Formats & Protocols
#Data Management
#Enterprise Technology
#File Sharing & Hosting
#Networking
#Programming
#Web Stats & Analytics