Typo in spark.sql.autoBroadcastJoinThreshold ...?

Question

Presumably I could have found a typo in Spark version 3.1.1. I am using Scala version 2.12.10 (OpenJDK 64-Bit Server VM, Java 11.0.11)

scala> spark.conf.get("spark.sql.autoBroadcastJoinThreshold")

res0: String = 10485760b

But should possibly be: 104857600.

Therefore:

scala> spark.conf.set("spark.sql.autoBroadcastJoinThreshold", 104857600)

When you deploy with "10485760b", Spark cannot detect that one of the joined DataFrames is small (10 MB by default). The threshold for automatic broadcast join detection could be disabled. I hope my comment helps someone?

Typo in spark.sql.autoBroadcastJoinThreshold ...?

Answers (1)

Related Questions