Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data mining (Predictive analytics:obtain a model - an approximation of an…
Data mining
Predictive analytics:obtain a model - an approximation of an unknown function f - maps the values of a set of variables vs values of a target variable --> predict the target variable for new observations
-
process: (i) make assumptions on the shape of the unknown function f()
(ii) try to search for the optimal instance of this assumed form (~~ dataset provided, preference criteria --> compare instances
-
-
-
-
Support Vector Machines
use a non linear mapping of the original data into a very high dimension space where the classes can be separated linearly by an hyperplane
Modeling
-
-
clustering
-
methods
Partitioning methods
-
use the info on the distances between cases in the dataset to obtain k "best" groups ~ a certain criterion
iterative process : some cases may be moved btwn the clusters -> improve overall quality of the solution
-
Density based methods
-
find regions of the feature space where cases are packed together wth high density --> find outliers
Grid - based methods
-
high computational efficiency, often integrated with hierarchical / density based method
clustering
partitioning method
process
-
chose a certain criterion - assign a score for each cluster/ group of cases h(c), given a clustering solution formed by k cluster C =c1,c2,c3,...,ck
obtain the overall score of this clustering H(C,k)
-
-
-