Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapter 15: Build Candidate Models (Starting the Process (Option of which…

- - - - 20% of data put in "lock-box" to be tested right before system goes to production
    - - Works harder to maintain same list. of target values
    - - More folds = drawbacks
        
        Less reliable
        
        DataRobot makes consequential decisions based on first validation fold
      - 5 most common fold number
  - - - Making sure all validation cases occur in a time after the time of the cases
- - - - transfers user's target feature decision
    - - Uses decisions we made in advanced settings
    - - DataRobot will save the distribution of the target to analysis system
      - Used for later use in decisions about which models to run
    - - Relevant:
        
        dataset is large (over 500)
        
        All the initial evaluations before step will be conducted
    - - Actual partitions are stored in cross-validation folds and holdout sets are stored in a separate file on disk
    - - inför form steps 3-6 is used to determine which blueprints to run in autopilot process