Please enable JavaScript.
Coggle requires JavaScript to display documents.
Chapter 6: Similarity, Neighbors, & Clusters (Nearest-Neighbor…
Chapter 6: Similarity, Neighbors, & Clusters
Similarity & Distance
Visualizing Segmentations
Geometric interpretation of some classification models
Classification via Mathematical Functions
How two different model types divide up an instance space into regions based on closeness of instances with similar class labels
Nearest-Neighbor Reasoning
Use this measure to find companies most similar to best corporate customers or theonline customers more similar to retail consumers
Ex. Whiskey Analytics - Foster likes to try single malts - direct application
Describe whiskeys by feature vectors only by tasting notes
Predictive Modeling
choose training examples similar to new target then predict new example's target value based on nearest neighbors' target value
Classification
Classify new instance using nearest neighbor
Credit Card Example
Nearest neighbors scored yes, yes, and no so majority says Daivd will be yes
Probability Estimation
Credit card Example
More than yes/ no.. gives a probability (2/3)
Issues
Intelligibility
Dimensionality and doman knowledge
Numeric attributes may have vastly different ranges
Technical Details
Heterogenous Attributes
Numeric attributes - different scales and ranges
Clustering
Hierarchical Clustering
Groups points by their similarity
Group objects that are similar