Please enable JavaScript.

Coggle requires JavaScript to display documents.

Text Analytics - Coggle Diagram

- - - - Removing suffixes from words to create a so-called root word
    - - Includes the meaning of the word when converting words to their base forms (Lemma)
  - - - Binary
      - Frequency-based
        
        Generate word cloud
      - Normalized frequency
      - tf-idf
        (term frequency–inverse document frequency)
- - - - Hand-coded Classifiers
        (S5 p16)
      - Generative Classifiers
        
        Naïve Bayes Model
        (S5 p17-20)
      - Discriminative Classifiers
        (S5 p21-26)
        
        Decision Tree Classifier
        
        Rocchio Classifier
        
        Support Vector Machines (SVMs)
  - - - Kmeans
    - - Latent Dirichlet Allocation (LDA)
- - - - Brill’s tagger
    - - Hidden Markov Models (HMM)
        (S6 p26-27)
      - Using NLTK
  - - - Using CoreNLP
        
        TokensRegex
        (S6 p49-53)
      - using NLTK
    - - Using NLTK or spaCy