Please enable JavaScript.

Coggle requires JavaScript to display documents.

3 CLASSIFIER, Cluster Analysis - is an unsupervised learning methods that…

- - - - 1) Partitioning Approach.
        Create several divisions and then assess them using some criteria, such as the total of square errors. Typical methods: k-means, k-medoids, CLARANS
      - 2) Hierarchical Approach
        Create a hierarchical decomposition of the set of data (or objects) using some criterion. Main types of hierarchical: Agglomerative and Divisive.
      - 3) Density-Based Approach
        Based on connectivity and density functions
      - 4) Grid-Based Approach.
        Based on a multiple-level granularity structure, a finite number of cells
      - 5) Model-Based Approach
        Each cluster is hypothesized and density function is clustered to locate the group.
- - - - Decision nodes - where the data is split
      - Leaves - decisions or the final outcomes
  - - - to aid in determining the most effective method for achieving a goal
  - - - Look at the training dataset.
        
        For example :
        
        Choose the best attributes to split data which separates two different labels into two sets
        
        Calculate the entropy of the dataset similarly after every split to calculate the gain
        
        Choose a condition that gives the highest gain
        
        In this step, the data will be splitting using each condition and checking the gain
        
        condition that will give the highest gain will be used to split first
        
        1 more item...
        
        Split the dataset.
        
        For example :
        
        S is a sample of training examples
        
        $p_+$ is the proportion of positive examples in S
        
        $p_-$ is the proportion of negative examples in S