nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #71

CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #71: Real Exam Question with Answer & Explanation

Sign in or unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to reveal the answer and full explanation for question #71. The question stem and answer options stay visible for context.

Data Ingestion and Transformation

Question

A data engineer is configuring a pipeline that will potentially see late-arriving, duplicate records. In addition to de-duplicating records within the batch, which of the following approaches allows the data engineer to deduplicate data against previously processed records as it is inserted into a Delta table?

Options

  • ASet the configuration delta.deduplicate = true.
  • BVACUUM the Delta table after each batch completes.
  • CPerform an insert-only merge with a matching condition on a unique key.
  • DPerform a full outer join on a unique key and overwrite existing data.
  • ERely on Delta Lake schema enforcement to prevent duplicate records.

Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to see the answer

You've previewed enough free CERTIFIED-DATA-ENGINEER-PROFESSIONAL questions. Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Delta Lake#Data Deduplication#MERGE INTO#Upsert
Full CERTIFIED-DATA-ENGINEER-PROFESSIONAL PracticeBrowse All CERTIFIED-DATA-ENGINEER-PROFESSIONAL Questions