Please enable JavaScript.
Coggle requires JavaScript to display documents.
Big data & Hadoop - Coggle Diagram
Big data & Hadoop
Hadoop Ecosystem
Spark
Pig
Hive
Hbase
YARN
Flume
Characteristics of big data
veracity
velocity
value
variety
Volume
Types of big data
Semi-structured
Unstructured
Structured
Traditional vs Big data business approach
Traditional approach:
Data Sources: Limited, internal data
Processing: Batch processing, manual
Storage: Relational databases
Analysis: Basic reporting, analysis
Big data approach:
Data Sources: Vast, diverse sources (e.g., social media, sensors)
Processing: Real-time, batch processing, distributed computing
Storage: NoSQL databases, distributed file systems (e.g., Hadoop HDFS)
Analysis: Advanced analytics, machine learning, predictive modeling
Core Hadoop components:
MapReduce
HDFS (Hadoop Distributed File System)