Please enable JavaScript.

Coggle requires JavaScript to display documents.

ML Model development - Coggle Diagram

- - - - Training Set
        
        for training model
      - Validation Set
        
        for unbiased evaluation
    - - Test Set
        
        for final evaluation & generalization error
- - - - Different ML model
      - Better feature selection
      - Tuning hyperparameters
  - - - Train and validate on different dataset
      - A simpler ML Model
- - - - Patterns
        
        Identify feature pairs that are highly correlated, remove one :red_cross:
        
        identify the feature-target pair that are highly correlated, keep one :check:
        
        :warning: High correlated feature
        
        Linear/Logistic regression models may degrade performance when highly correlated features (rooms, sqft e.g.)
        
        Decision tree is immune to this problem :question: How
        
        :star: Linear/Logistic regression models may improve performance when highly target-correlated features
      - Anomalies:
        
        Missing Data
        
        Class Imbalances
        
        Outlier Detection
      - Insights :question: