Please enable JavaScript.

Coggle requires JavaScript to display documents.

CH. 10: Representing and Mining Text (Ways to Represent Text (Inverse…

- - - - Make every word lowercase
      - remove suffixes (called stemming)
      - Stopwords removed (stopword = very common word like the, and, of, on, etc.)
  - - - Set an arbitrary lower limit
    - - Set an arbitrary upper limit
  - - - words map to one or more topics
        
        final classifier is determined in terms of topics rather than words