Apache Kafka

Apache Kafka Series

Learn Apache Kafka for Beginners

Kafka Streams for Data Processing

  1. Kafka Introduction

Created by LinkedIn

  1. Kafka Theory

Distributed, resilient architecture, fault tolerant

Horizontal scalability

High performance - realtime

Decoupling of data streams and systems

100s of brokers

Millions of messages / second

Topics, partitions and offsets

Brokers

Topics

Partitions

Offsets

Topic replication factor

Producers

Leader for a Partition

Message Keys

Consumers

Consumer Groups

Consumer Offsets