Please enable JavaScript.
Coggle requires JavaScript to display documents.
Chapter 9: Data Integration (Two Methods (Joins (Types of joins (Inner…
Chapter 9: Data Integration
Adding additional features/ obtaining more observations
Two Methods
Joins
access more features
combines two datasets with a shared identity value
using the language of tables
Types of joins
Inner Join
requires the same identity value
data integration processes
Outer Join
Left, Right, Full
Unions
to access more observations
based on the assumption that there are multiple columns between A and C
combine two datasets (contain unique sets of sharing the same or very similar columns
full outer join: connects lists if there are overlaps between rows
difference between the two: machine learning/ training is key
Real time example: Salesforce in bringing students' information in one space