nerdexam
DatabricksDatabricks

DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK · Question #28

DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Question #28: Real Exam Question with Answer & Explanation

Sign in or unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to reveal the answer and full explanation for question #28. The question stem and answer options stay visible for context.

Spark DataFrame Partitioning and Shuffling

Question

The code block shown below contains an error. The code block is intended to return a new 12- partition DataFrame from the 8-partition DataFrame storesDF by inducing a shuffle. Identify the error. Code block: storesDF.coalesce(12)

Options

  • AThe coalesce() operation cannot guarantee the number of target partitions - the repartition()
  • BThe coalesce() operation does not induce a shuffle and cannot increase the number of partitions -
  • CThe coalesce() operation will only work if the DataFrame has been cached to memory - the
  • DThe coalesce() operation requires a column by which to partition rather than a number of
  • EThe number of resulting partitions, 12, is not achievable for an 8-partition DataFrame.

Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to see the answer

You've previewed enough free DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK questions. Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Spark DataFrame#Partitioning#Shuffle#coalesce vs repartition
Full DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK PracticeBrowse All DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Questions