Please enable JavaScript.

Coggle requires JavaScript to display documents.

Big Data Platform Service Availability (MapReduce Counters (Job counters,…

- - - - CapacityRemaining :explode:
        Available capacity (Resource: Utilization)
        Disk use not to exceed 80% capacity.
      - CorruptBlocks/MissingBlocks :explode:
        Number of corrupt/missing blocks (Resource: Error/Resource: Availability)
        Corrupt blocks are replicated from healthy copies.
        Missing blocks have no known copy.
      - VolumeFailuresTotal :explode:
        Number of failed volumes (Resource: Error)
        A failed volume will not bring your cluster to grinding halt, you most want to know when hardware failures occur, so that you can replace the failed hardware.
      - NumLiveDataNodes/NumDeadDataNodes :explode:
        Count of alive DataNodes/Count of dead DataNodes (Resource: Availability)
        When the NameNode does not hear from a DataNode for 30 seconds, that DataNode is marked as “stale.” Should the DataNode fail to communicate with the NameNode for 10 minutes following the transition to the “stale” state, the DataNode is marked “dead.”
      - FilesTotal
        Total count of files tracked by the NameNode (Resource: Utilization)
      - TotalLoad
        The current number of concurrent file accesses (read/write) across all DataNodes. (Resource: Utilization)
      - BlockCapacity/BlocksTotal
        Maximum number of blocks allocable/Count of blocks tracked by NameNode (Resource: Utilization)
      - UnderReplicatedBlocks
        Count of under-replicated blocks (Resource: Availability)
      - NumStaleDataNodes
        Count of stale DataNodes (Resource: Availability)
    - - ConcurrentMarkSweep count
        Number of old-generation collections. ConcurrentMarkSweep collections free up unused memory in the old generation of the heap. (Other)
      - ConcurrentMarkSweep time
        Elapsed time of old-generation collections, in milliseconds (Other)