Tutorial by Examples

In Apache Spark while doing shuffle operations like join and cogroup a lot of data gets transferred across network. Now, to control the number of partitions over which shuffle happens can be controlled by configurations given in Spark SQL. That configuration is as follows: spark.sql.shuffle.partiti...

Page 1 of 1