Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapter 10: Representing and Mining Text (Representation (A document…

- - - - Normalize words
      - Words have been "Stemmed"
        
        Suffixes removed
      - Remove stopwords
        
        the,and, of, etc.
      - Term should not be too rare
      - Term should not be too common
- - - - Old media
      - New media
      - Have to understand customer feedback
  - - - Meant for humans, not computers
- - - - assume same day
      - Satisfied with the direction
        
        Change
        
        No change
      - Predict relatively large changes
      - Narrow the "causal radius"
        
        Surges
        
        Plunges
        
        Stable
- - - - and sometimes special knowledge