Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data Science Overview - Coggle Diagram
Data Science Overview
Natural Language Processing (NLP)
-
-
Text Classification
Context Models
-
Context-Aware
Position in the sentence and neighbouring words affect the word, enabling it to derive different meaning
(e.g. language models)
-
Time Series
ARIMA
(AR)(I)(MA)
-
Model
ARIMA(p, d, q)
- p = number of auto regressive terms
- d = number of differences
- q = number of moving average terms
Intuition
- Linear models need stationarity
- To achieve stationarity we take a difference of the time series itself
- Taking differences removes the trend (I term)
- Sometimes we over difference and need to add back (MA term)
- Sometimes differencing doesn't go far enough (AR term)
Stationarity
Constant mean (no trend), and constant variance
Top = Stationarity
Bottom = Non-stationarity
Augmented Dickey-Fuller Test
Tests for stationarity through the presence of a unit root
- A unit root is a stochastic trend (random walk with drift)
- If a series has a unit root, it shows a systematic, unpredictable pattern
-
-
Tools
Pandas
DBT
Deep Learning