CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #71
CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #71: Real Exam Question with Answer & Explanation
Sign in or unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to reveal the answer and full explanation for question #71. The question stem and answer options stay visible for context.
Question
A data engineer is configuring a pipeline that will potentially see late-arriving, duplicate records. In addition to de-duplicating records within the batch, which of the following approaches allows the data engineer to deduplicate data against previously processed records as it is inserted into a Delta table?
Options
- ASet the configuration delta.deduplicate = true.
- BVACUUM the Delta table after each batch completes.
- CPerform an insert-only merge with a matching condition on a unique key.
- DPerform a full outer join on a unique key and overwrite existing data.
- ERely on Delta Lake schema enforcement to prevent duplicate records.
Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to see the answer
You've previewed enough free CERTIFIED-DATA-ENGINEER-PROFESSIONAL questions. Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.