Please enable JavaScript.

Coggle requires JavaScript to display documents.

Building data pipelines - Coggle Diagram

- - - - at least once
      - exactly once
        
        using external datastore for keys
  - - - process as it passes through
      - save time and storage
      - con't get the raw data or modify later
    - - process after storage
      - flexibility to users
      - ingest raw data
      - more CPU and storage space
  - - - each two frameworks together and many couples
      - mess of integration
      - effort to deploy and maintain and monitor
    - - store raw data and let everyone use data as he wants
- - - - from source convert then to worker
    - - from worker to target system
    - - configurations
        
        group.id
        
        bootstrap.servers
        
        value converter
        
        key converter
  - - - how many tasks to run
      - split the data-copying between tasks
      - getting configurations for the tasks
    - - get data
    - - executes connectors and tasks
      - auto commit offsets
      - REST API configuration
    - - source connector read data and generate schema
      - sink connector gets schema and convert the data back
      - converts data to the desired format