Please enable JavaScript.
Coggle requires JavaScript to display documents.
DATA SCIENCE (Statistics (Pick a Dataset, Descriptive Stats, Normal…
DATA SCIENCE
Statistics
Pick a Dataset
Descriptive Stats
Normal Distributions
Exploratory Data Analysis
Mistograms
Percentiles & Outliers
Probability Theory
Bayesian Theory
Random Variables
Cumul Distribution Function
Continuos Distributions
Skewness
ANOVA
Prob Den Function
Central Limit Theorem
Monte Carlo Simulation
p-Value
Chi2 Test
Estimation
Confidence Interval
MLE
Kernel Density Estimate
Regression
Covariance
Correlation
Person coefficient
Causation
Least2 Fit
Euclidean Distance
Progamming
Python Basics
Excel
R Setup / R studio
R Basics
Expressions
Variables
Vectors
Matrices
Arrays
Factors
Lists
Data Frames
Functional Programming
CSV Data
Raw Data
Subsetting Data
Manipulate Data Frame
Functional
Factor Analysis
Install Rackages
Rapid Miner
Regression
Fundamentals
Matrices & Linear Algebra Fund's
Hash Functions
Binary Trees
Relational Algebra
Sun Modelling
Star Schemas
Database Basics
Joins
CAP Theorum
Tabular Data
Sharding
OLAP
Multidimersional Modelling
ETL
Reporting vs BI vs Analytics
JASON & XML
NoSQL Fundamentals
Redex
SQL
Vendor Landscape
Env.Setup
Machine Learning
What is Mochine Learning?
Numerical Var
Categorical Var
Supervised Learning
Unsupervised Learning
Concepts, Inputs & Attributes
Training & Test Data
Classitier
Prediction
Lift
Over fitting
Bias & Variance
Trees & Classi fication
Decision Trees
Boosting
Logistic Regression
Ranking
Linear Regression
Perceptron
Hierarchical Clustering
Neural Networks
Big Data
Map Reduce Fundamentals
Hadoor Components
HDFS
Data Replication Principles
Setup Hadoop
Name & Data Nodes
Job & Task Tracker
MapReduce Programming
Sqoop: Loading Data in HDFS
SQL with PIG
DWH with Hive
Using Mahout
Zookeeper Avro
RHadoop, RHIPE
rmr
Cassandra
Mongo DB
Neo4j
Text Mining/ NLP
Corpus
Named Entity Recognition
Text Aralysis
VIMA
Term Document Analysis
Term Frequency & Weight
Support Vector Machines
Association Relus
Market Based Analysis
Feature Extraction
Using Mahout
Using Weka
Using NLTK
Classify Text
Vocabulary Mapping
Visualisation
Data Exploration
ggplot2
Mistogram & Pie
Tree & Tree Map
Scatter plot
Line Charts
Spatial Charts
Surrey plot
Timeline
Decision Tree
D3, js
Info Vis
IBM Many Eyes
Tableau
QlikView
Tod box
Java
Java Script
Python
Hadoop
Spark, Storm
Flume, Scribe, Chukwa
Neo4j
Mongo Db
Cassandra
RHIPE
Webscraper, Flume, Sqoop
Nutch, Talend, Scraperwiki
Skills
Curious
Creative
Tenacious
Resourceful
Inventive
Innoviative
Deep Technical Skills
Communication Skills
Presentation Skills
Sees beyond the obvious
Data Ingestion
Summary of Data Formats
Data Discovery
Data Sources & Acquisition
Data Integration
Data Fusion
Transformation & Enrichment
Data Survey
Google Open Refine
How much Data
Using ETL
Data Munging
Normalisation
Data Scrubbing
Handling Missing Values
Unbiased Estimators
Binning Sparse Values
Feature Extraction
De - noising
Principal Component Analysis