Please enable JavaScript.
Coggle requires JavaScript to display documents.
Big Data-(pySpark) - Coggle Diagram
Big Data-(pySpark)
Apache Spark
-
Mostly we use SparkSession:
Spark session is a unified entry point of a spark application from Spark 2.0. It provides a way to interact with various spark’s functionality with a lesser number of constructs. Instead of having a spark context, hive context, SQL context, now all of it is encapsulated in a Spark session.
-
-
-
-
Section-5
-
-
-
-
Set Log Level i.e.info,error etc..
Deal With DataSkew Issue
-
-
-
use repartition() and pass number of core,It'll nicely
Spread out.
-
-
-
-
-
-
-
-
-