Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapters 21-23 (21 (Choose Deployment Strategy (Batch (uses the DataRobot…

- - - - uses the DataRobot API to upload and score multiple large files in parallel. The code required to use batch scoring is available at https://github.com/datarobot/batch-scoring
    - - allows for exporting the selected model as an executable file to be used in an Apache Spark environment
        
        Spark is a fast and widely distributed data processing environment
    - - creates an approximation of the selected model, available as code in the Python and Java programming languages Prime Scoring is availability based on DataRobot account type
    - - created on the DataRobot server, allowing a developer to write a program that uploads new data to the API, which then returns a probability of the prediction target
    - - accessed through the Predict screen of the selected model
- - - - source of data as target
        
        The methodology here is to create a new target that specifies whether a case was used to create the original model or whether that case was retrieved from the production system after the model was used for prediction.
        
        recommended to use either the same measure used for model selection, or the Matthews Correlation Coefficient. It is also recommended to automate threshold testing and set up message and alert transmissions to signal when the determined threshold is reached.