Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapter 5: Overfitting and Its Avoidance (Generalization (Table Model…

- - - - Fitting Graph: Shows the accuracy of a model as a function of complexity
      - Holdout Data: "Hold out some data for which we know the value of the target variable, but which will not be used to build the model
        
        Like creating a "lab test" of generalization performance
        
        We will hide from the model the actual values for the target on the holdout data
      - Generalization Performance- Estimate by comparing the predicted values with the hidden true values
      - More overfitting when the model is more complex
    - - Any training instance given to the tree for classification will make its way down, eventually landing at the appropriate leaf
      - Will it generalize?
        
        Tree should be slightly better than a lookup table because it will give all unseen data some form of classification
        
        Rather than just failing every match