nerdexam
DatabricksDatabricks

DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK · Question #12

DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Question #12: Real Exam Question with Answer & Explanation

Sign in or unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to reveal the answer and full explanation for question #12. The question stem and answer options stay visible for context.

Efficiently use Spark SQL and DataFrame API for data transformations and analysis.

Question

Which of the following code blocks will most quickly return an approximation for the number of distinct values in column division in DataFrame storesDF?

Options

  • AstoresDF.agg(approx_count_distinct(col("division")).alias("divisionDistinct"))
  • BstoresDF.agg(approx_count_distinct(col("division"), 0.01).alias("divisionDistinct"))
  • CstoresDF.agg(approx_count_distinct(col("division"), 0.15).alias("divisionDistinct"))
  • DstoresDF.agg(approx_count_distinct(col("division"), 0.0).alias("divisionDistinct"))
  • EstoresDF.agg(approx_count_distinct(col("division"), 0.05).alias("divisionDistinct"))

Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK to see the answer

You've previewed enough free DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK questions. Unlock DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Spark SQL Functions#DataFrame Aggregations#Performance Optimization#Distinct Count Approximation
Full DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK PracticeBrowse All DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK Questions