Please enable JavaScript.
Coggle requires JavaScript to display documents.
Hadoop Ecosystem - Coggle Diagram
Hadoop Ecosystem
Big Data
Volume
Variety
Velocity
Veracity
Data Science
Maths
Statistics
Data
Crunching
Software
Engineering
Hadoop
Store
Process
Big Data
Open Source
Apache
Architecture
HDFS
Storage
Cluster
YARN
Resources
Applications
MapReduce
HIve
Spark
Distributed
Nodes
Worker
Proccessing
Horizontal Scaling
Master
Virtualized
Distributions
Apache
Open Source
Free
Community Support
Hortonworks
Open Source
Cloudera
MapR
Expensive
Hadoop
HDFS
NameNode
Metadata
Heartbeat + Block Report
From
Data Node
Big File
Split 120MB
Copied
3 times
Acknowledgement
import
file
hadoop fs -put <full_path_file> <file_after_import>
MapReduce
Processing
In
Data Node
Parallelized
Map
Key : Value
Shuffle & Sort
Reduce