Please enable JavaScript.
Coggle requires JavaScript to display documents.
Chapter 12: Data Reduction and Splitting (Unique Rows (12.1) (Removing…
Chapter 12: Data Reduction and Splitting
Splitting rows
Unique Rows (12.1)
Removing duplicate rows
1.) Partial Match Removal
Removal of full rows based on the identical content of a few colimns
2.) Complete match removal
Removal based on identical content in all columns
To conduct
1.) Listing rows to keep first
2.) Which columns are identical
Apply unique function
Keeps only unique rows
Summarize function
Into with how many times there was an occurence
Make assumptions
Filtering (12.2)
Split up data into 2 tables based on characteristics
Union example.
Must be uniform
Trains machine learning
Often very simple
Often with measurements
Example: KG/oz for dairy company