nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #36

CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #36: Real Exam Question with Answer & Explanation

Sign in or unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to reveal the answer and full explanation for question #36. The question stem and answer options stay visible for context.

Optimizing Data Lake Performance

Question

A Delta Lake table representing metadata about content posts from users has the following schema: user_id LONG, post_text STRING, post_id STRING, longitude FLOAT, latitude FLOAT, post_time TIMESTAMP, date DATE This table is partitioned by the date column. A query is run with the following filter: longitude < 20 & longitude > -20 Which statement describes how data will be filtered?

Options

  • AStatistics in the Delta Log will be used to identify partitions that might Include files in the filtered
  • BNo file skipping will occur because the optimizer does not know the relationship between the
  • CThe Delta Engine will use row-level statistics in the transaction log to identify the flies that meet
  • DStatistics in the Delta Log will be used to identify data files that might include records in the
  • EThe Delta Engine will scan the parquet file footers to identify each row that meets the filter criteria.

Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to see the answer

You've previewed enough free CERTIFIED-DATA-ENGINEER-PROFESSIONAL questions. Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Delta Lake#Query Optimization#File Skipping#Column Statistics
Full CERTIFIED-DATA-ENGINEER-PROFESSIONAL PracticeBrowse All CERTIFIED-DATA-ENGINEER-PROFESSIONAL Questions