Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data Wrangling, References:
Relational Databases for Data Analysis /…
Data Wrangling
1) Review the data
- View a few rows of data
- View the various data types
- View a quick summary / description of the data
-
Is the data:
- Accurate?
- Valid?
- Representative?
Data inspection techniques:
(Reference: Exploratory Data Analysis (EDA) | Codeacademy)
- .head()—first five rows
- .describe()—numerical summaries
- .info()
- .isnull()—locate missing (null) values
- .unique()—return unique values
Types of Acquired Data
Structured data
Relational databases are the primary means of storage for structured data. The relational model can be viewed as a database model that has multiple tables that each describe a particular entity of the database.
When we consider relational databases as a collection of tables, what we call a schema, we can visualize them with entity-relationship diagrams, which give us a chance to view the data within each table, and how each table relates to the others.
-
-
-