Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapter 7 (Classifiers: Don't know class so estimate class (Positives…

- - - - Rarer than negatives
  - - - True classes are P and N
        
        Predicted classes are Y and N
  - - - Class distribution is skewed
        
        The more skewed means accuracy breaks down
        
        Accuracy is the wrong thing to measure
- - - - Example: False negative: Patient told no cancer but has False positive: told cancer but no
        
        The false negative is more serious
- - - - But root mean squared value of what?
      - Is it meaningful
      - Is there a better metric
- - - - Structure of problem
      - elements of analysis that can be extracted from data
      - elements of analysis need from other sources
    - - p(o) is probability
      - v(o) is value
- - - - Same matrix as confusion except compares cost and benefit
        
        Cost and benefit come from external sources
      - Then you multiply benefit by probability for expected profit
        
        Class Priors specify likelihood of seeing positive and negative
      - Be careful about double counting
        
        Counting a benefit and a negative cost for the same thing
- - - - One baseline could be a majority classifier
        
        classifier that always chooses the majority in the training data set
        
        Maximizing prediction accuracy is not the goal