Please enable JavaScript.

Coggle requires JavaScript to display documents.

Provost, Chapter 6: Clustering (nearest neighbors revisited: clustering…

- - - - data preperation
        
        words too rare or too frequent were eliminated
        
        TFIDF score
        
        gives score for each vocab word
        
        news story clusters
        
        9 clusters based on various thigns
        
        Correlation is not causation, semantic is no syntactic