Please enable JavaScript.
Coggle requires JavaScript to display documents.
Dataset API, Flink - Coggle Diagram
Dataset API
Stream
Spark
Hadoop
Flink
Dataset API
Source input
Read from File
Dataset
String
Tuple1 -Tuple25
Doing transforamation
FlatMap (Allow single input but multiple output)
Filter
Map (Single input single output
Execute
Join
Hints
Broadcast - Copy the 1 set to all the node memory
Repartition - Partition the dataset and create a hash for fast lookup
Optimization
Read from file to Primitive
Read from CSV
GroupBy
Sum
Execute Method