Please enable JavaScript.

Coggle requires JavaScript to display documents.

APACHE SPARK (Solve a ML problem (a DATAFRAME will be returned with SCHEMA…

- - - - paralleled computations use SHARED variables
      - broadcast && accumulators are examples of shared variables
    - - Transformations
        
        apply to each element of a RDD
        
        only apply, NOT change RDD
        
        like map
      - Action
        
        like Reduce
        
        agg/reduce based on keys