Data Science - What is Data?

What is Data?

Data is a collection of information.

One purpose of Data Science is to structure data, making it interpretable and easy to work with.

Data can be categorized into two groups:

  • Structured data
  • Unstructured data

Unstructured Data

Unstructured data is not organized. We must organize the data for analysis purposes.


Unstructured Data



Unstructured Data

Structured Data

Structured data is organized and easier to work with.

Structured Data

How to Structure Data?

We can use an array or a database table to structure or present data.

Example of an array:

[80, 85, 90, 95, 100, 105, 110, 115, 120, 125]

The following example shows how to create an array in Python:


Array = [80859095100105110115120125]

It is common to work with very large data sets in Data Science.

In this tutorial we will try to make it as easy as possible to understand the concepts of Data Science. We will therefore work with a small data set that is easy to interpret.


DS What is Data