Please enable JavaScript.

Coggle requires JavaScript to display documents.

Raft (algorithm (Safety (new leader is elected in case the current one is…

- - - - servers start up as Follower state and remains as long as it receives RPCs from a leader or candidate
      - Leader sends periodic heartbeats to all Followers to keep its leader state
      - once a follower receives no RPC for a period of time (election timeout), it begins an election by
        1) increment its current term
        2) transits to Candidate state
        3) votes itself and issues a RequestVote RPCs to all other servers in parallel
      - stays in Candidate state until
        a) it wins election (received votes from majority) and becomes Leader, then send heartbeat to all servers to notify and prevent new elections
        b) received AppendEntries RPC with term >= current term of itself (another server has become Leader), then remember the new Leader and go back to Follower state
        c) timed out with no winner (tied split vote), then the Candidate start a new election by incrementing its term and initiating another round of RequestVote RPCs
        ( :!: could repeat forever, use randomized election timeouts to avoid )
    - - voter denies if its own log is more up-to-date than the candidate
      - how to determine "up-to-date"?
        
        RequestVote RPC includes term and index of the last entry in the Candidate's log
        
        the one with greater term is more up-to-date
        
        if terms are the same, the one with greater index is more up-to-date
  - - - the Leader appends the command from client to its log as a new entry
      - the Leader issues AppendEntries RPCs in parallel to all other servers to replicate the log entry
      - when safely replicated, the Leader applies the log entry to its state machine and returns the result to the client
      - if any Followers failed to receive, the Leader retry indefinitely until all Followers eventually store all log entries
    - - state machine command
      - term number when the entry was received by the Leader
      - integer index identifying its position in the log
    - - NEVER commits log entries from previous terms by counting replicas
        ONLY log entries from the Leader's current term are committed by counting replicas, and once an entry from the current term has been committed in this way, then ALL prior entries are committed indirectly because of the Log Matching Property
      - Leader CANNOT immediately conclude that an entry from previous term is committed once it's stored on majority of servers
      - log entries retain the same term number all the time, across all logs
  - - - first switches to a transitional config, "joint consensus", which combines both old and new configs
      - once the joint consensus is committed, the cluster transitions to the new config
      - Leader receive request to change config
        
        the Leader stores the config for joint consensus as a log entry and replicates that entry via AppendEntries RPCs
        
        Once a server receives and adds the new config to log, use that newest config for future decisions
        
        Leader create log entry for new config and replicate across cluster
      - Only server with joint consensus log entry can be elected as Leader
    - - Log entries are replicated to all servers in both config
      - any server from either config may serve as Leader
      - Agreement for election and entry commitment requires separate majorities from both old and new configs
      - allows individual servers to transition between configurations at different times without compromising safety
      - allows the cluster to continue serving requests throughout the config change
- - - - initiated by Candidate nodes during election
    - - initiated by Leader to replicate log entries and to provide heartbeat
    - - transferring snapshots between servers