Please enable JavaScript.

Coggle requires JavaScript to display documents.

Data Mining Final Exam (Spring 2017) (Supervised Learning Clustering…

- - - - 2 Compute mean points
      - 3 Assign objects to closest mean point
      - 1 Partition object into subsets
      - 4 Go to step 2 until there are no chnages
- - - - :star: ID3
        
        Attribute Selection
        
        Entropy
        
        Conditional Entropy
        
        Information Gain
        
        Split
      - :red_cross: CART: Gini Index
      - :star: C4.5
        
        Gain/Split Info
    - - :red_cross: Bayes Theorem
      - :star: Naïve Bayes
  - - - Cross-validation
      - :!: Bootstrap
      - Random Sampling
      - :star: Confusion Matrix
- - - - Ignore
      - Fill Automatically
        
        Mean of attr
        
        Mean of cluster
        
        Global constant
      - Fill Manually
    - - Binning
        
        By median
        
        By boundries
        
        By mean
      - Regression
      - Clustering to remove outliers
      - Human inspection
  - - - :star: Nominal: Chi-square
      - :star: Numerical: Coefficient
  - - - Parametric
        
        Regression
      - Non-parametric
        
        Histograms
        
        Clustering #
        
        Sampling
        
        Without Replacement
        
        With Replacement
        
        Random
        
        Stratified
    - - Lossless
      - Lossy
    - - Principle Component Analysis
      - Feature Subset Selection
        
        Information Gain
        
        Decision Tree #
      - Wavelet Transform
  - - - Feature Construction
      - Aggregation (Data Cube Const.)
      - Smoothing
      - Normalization
        
        :star: Min-Max
        
        :star: Z-Score
        
        :star: Decimal Scaling
      - Intervals (e.g., age)
    - - Binning #
      - Histogram Analysis
      - Clustering Analysis
      - Decision Tree
      - Correlation