DatabricksDatabricks
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK · Question #126
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Question #126: Real Exam Question with Answer & Explanation
Sign in or unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to reveal the answer and full explanation for question #126. The question stem and answer options stay visible for context.
Spark Performance Tuning
Question
The default value of spark.sql.shuffle.partitions is 200. Which of the following describes what that means?
Options
- ABy default, all DataFrames in Spark will be spit to perfectly fill the memory of 200 executors.
- BBy default, new DataFrames created by Spark will be split to perfectly fill the memory of 200
- CBy default, Spark will only read the first 200 partitions of DataFrames to improve speed.
- DBy default, all DataFrames in Spark, including existing DataFrames, will be split into 200 unique
- EBy default, DataFrames will be split into 200 unique partitions when data is being shuffled.
Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to see the answer
You've previewed enough free DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK questions. Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.
Topics
#Spark Configuration#Spark SQL Shuffle#DataFrame Partitions