DatabricksDatabricks
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK · Question #174
DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Question #174: Real Exam Question with Answer & Explanation
The correct answer is C: storesDF.repartition(). See the full explanation below for the reasoning.
Spark DataFrame Operations and Data Distribution
Question
Which of the following operations will always return a new DataFrame with updated partitions from DataFrame storesDF by inducing a shuffle?
Options
- AstoresDF.coalesce()
- BstoresDF.rdd.getNumPartitions()
- CstoresDF.repartition()
- DstoresDF.union()
- EstoresDF.intersect()
Topics
#DataFrame Transformations#Data Partitioning#Shuffle Operations#Performance Implications
Community Discussion
No community discussion yet for this question.