Please enable JavaScript.
Coggle requires JavaScript to display documents.
Ch 12 (Filtering (Only training set can be used to train machine models…
Ch 12
Filtering
Only training set can be used to train machine models
test file
filter
age missing
Only to a subset
Sampling
evaluate and build models
random sampling
Unique identifier, feature, and target
holdout sample
remaining 5 modes
cross validation deemed appropriate and valuable
accuracy
improve do step 2-6 again
Mirror characteristics of pop
Splitting data into different parts
Partial match removal
complete match removal
Home address example
Summarize function