Please enable JavaScript.

Coggle requires JavaScript to display documents.

Big Data - Coggle Diagram

- - - - Vertical Scaling (Scale-Up)
    - - Horizontal Scaling (Scale-Out)
- - - - Responsibilitie
        
        Replication
        
        Fault tolerance
        
        Storage
      - Architecture
        
        NameNode
        
        DataNode
        
        Blocks
        
        Why Blocks
        
        Replication
        
        Why Replication Matters
    - - MapReduce Stages
        
        Input Split
        
        Mapping
        
        Shuffling
        
        Reducin
      - Limitations of MapReduce
        
        SPARK
      - What is MapReduce
    - - Responsibility
        
        Resource management.
        
        CPU allocation
        
        Memory allocation
        
        Job scheduling
        
        Cluster monitoring
      - Important Components
        
        ResourceManager
        
        NodeManager
        
        Container
        
        ApplicationMaster
  - - - HDFS
    - - MapReduce
    - - Hive
        
        Why Hive
        
        HiveQL
        
        SQL like language
        
        Internal Workflow
        
        Hive Query
        
        Hive Compiler
        
        MpaReduce/ Spark Job
        
        HDFS
    - - Pig
        
        Why Pig
        
        Pig Vs Hiv
    - - HBase
        
        Why HBas
    - - Kafka
    - - YARN
    - - Sqoop
    - - Spark
    - - Flume