Please enable JavaScript.

Coggle requires JavaScript to display documents.

Provost Chapter 10 (Representation (a document is one piece of text (A…

- - - - A collection of documents is called a corpus
  - - - not too rare, not too common
- - - - text is relatively dirty
      - context is important, much
        more so than with other forms of data