Topic 1: Introduction to Data Mining

  1. Intro

Pattern Recognition

Process recognizing a pattern using machine, view through seeveral aspects

by Human

Perceptual (emotions, feelings)

Specialized - decision making

Computer

Benefit of automated pattern recognition

Advantage in complex calculations

Data

Learn/observe from large amounts of data

Study the dependencies and extract knowledge from data

Data

Basic facts such as names, numbers or characters that come in different forms

Knowledge

Processed or organized data that is given some values to uncover the relationship for deeper understanding.

Sample of knowledge in the form of IF then ELSE rules

Data Mining

Extraction of interesting (previously unknown and potentially useful) patterns or knowledge from huge amout of data

Exploration and analysis, by automatic or semi-automatic means, large of quantities of data in order to discover meaningful patterns

Big Data

Term refers to a large amount of data where the concept is related to the characteristics of the data itself

5V's of Big Data

Volume

Variety

Veracity

Value

Velocity

Big Data Mining

refer to collective data mining or extraction techniques that is performed on large volume of data or big data

Goal - discover insights from social media platforms with thousand of postings