Please enable JavaScript.
Coggle requires JavaScript to display documents.
Chapter 9: Data Integration (Unions- allow us to access more observations,…
Chapter 9: Data Integration
Why do we need more data
More data helps us make better predictions
Allows us to look for additional features of interest that we haven't currently found
Joins- allow us to access more features
Combines two data sets with a shared identitty value
After a join, new rows are created
Types of joins
Inner Join
Used most commonly for data integration process
Outer join
Left outer join
Right outer join
Full outer join
Godd for connecting different lists if there are over;a[s between rows and a shared identifier
Unions- allow us to access more observations
Based on the assumption that there are multiple columns in common.
Allows to combine two datasets
Used when there are datasets with unique sets of cases sharing the same or similar columns