Please enable JavaScript.
Coggle requires JavaScript to display documents.
Chapters 13-15 (Model selection process: (SAMPLE, CROSS VALIDATION,…
Chapters 13-15
-
-
-
-
If you get two barcharts from different data that are the same that can mean a few things. this code has already been tagged as a
categorical data type by DataRobot
DataRobot is capable of detecting and tagging currencies based on the existence of DataRobot 144 currency symbols such as $
Evaluating feature content: You need to be sure of what types of features are being tagged in different situations.
14.4 Missing values: Both the question mark and data robot will tell you how many values are missing. It may just say "missing". Other rows that are put as missing are nulls. They can also be stored as N/A.
-
15.2 advanced options
Rnadom sampling is simmilar to what we havw already looked at. You can hold out 10% of the data or 20% and then you cna make the decison of weather or not to put it into "production".
this is a method for determining exactly which cases are used in different folds. Partition Feature is different from the other approaches in that the user must do their own random or
semi-random assignment of cases.
The Group approach accomplishes much of the same as with the partition feature, but with some key differences
Summarization of our findings: summarize, for these patients in this data set, the most frequent value for length of hospital stay was two days., the mean length was 4.43 days, and the median was 4, suggesting that this data is an example of a right-skewed distribution.