Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data Transformations (Numerical (Add, Subtract, Multiply, Divide),…
Data Transformations
Numerical
Add
Subtract
Multiply
Divide
Categorical
Combine Categories
Turn multi category columns into several binary ones
One hot encoding (dummy encoding)
Improves predictability
Text
Extract New Columns
Ch 10.1.2 Regular expressions
Escape characters
(\d) for any digit
(\D) for any character, no digits
(\w) for any alphanumeric character
(\W) returns any non alphanumeric characters ($#%
)
( . ) has the greatest scope returning any and all characters
(?) means that the character directly proceeding it is optional
(\s) is for white space or spaces (differs depending on syntax)
(\S) is anything but whitespaces
([ ]) specify a list of allowable characters in a specific position (can use a dash inside brackets to specify range)
( | ) pipe symbol is the or command (ex: ab|cd means find ab or cd)
( ^ ) inside brackets means anything but alphanumeric values. when not in square brackets the carrot indicates the beginning of a string
( $ ) outside of brackets indicate the end of a string
({ }) can repeat how many times you want to use something (ex: [a-z]{2})