Please enable JavaScript.

Coggle requires JavaScript to display documents.

IO batching for NVMe FC - Coggle Diagram

- - - - nvme/rdma moves network stack to HW
      - userspace netowork/storage stack
    - - network link: 40 ~ 100Gbps
      - disaggregated storage
        
        access storage over network
    - - can we not change sw arch to get similar perf?
        
        no userpsace change
        
        no network stack change
      - i10
        
        design
        
        saturates a 100Gbps link using a commodity server
        
        CPU utilization similar to state-of-the-art user-space stacks
        and NVMe-over-RDMA products
        
        the inefficiency lies at the boundary of the two stacks!
        
        two ideas
        
        End-to-end dedicated resources and batching
        
        Delayed Doorbells
        
        dispatch batching
        
        result
        
        drawback
        
        high latency
        
        1.7X nvme-rdma latency
        
        achieves throughput-per-core
        comparable to NVMe-over-RDMA
        
        reduces the
        CPU utilization by 2.5× for nvme-tcp
  - - - can't work well
        
        io uring may just poll in the 50us windows
- - - - how to model if we are working hard
      - how to model if we are inefficient
        
        thoughput ?