Coggle requires JavaScript to display documents.
offline batch processing , SQL query qnd data mining, graph computing and machine learningand can be seamlessly used in an applicaiton
Performance of Spark is ver x100 higher than that of MapReducewith multiple iterationsand not soo many advantages for fewer iterations
map / flatMap , filter and reduceByKey
take / count / saveAsTextFile
YARN- client is suitable for testing , whereas YARN-Cluster is suitable for productionif the task submission node in YARN-Client mode is down and the entire task fails << such a situation in YARN-Cluster mode will not affect the entire task
just a module that can parse SQL language into RDDs and use Spark Core to Execute
Data can be injected from many sources like Kafka , HDFS and can be processed using complex algorithms expressed with high level function like Map , Reduce , Join and WindowFinally processed data can be pushed out to file systems and databases