nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #13

CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #13: Real Exam Question with Answer & Explanation

Sign in or unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to reveal the answer and full explanation for question #13. The question stem and answer options stay visible for context.

Data Ingestion

Question

An upstream system is emitting change data capture (CDC) logs that are being written to a cloud object storage directory. Each record in the log indicates the change type (insert, update, or delete) and the values for each field after the change. The source table has a primary key identified by the field pk_id. For auditing purposes, the data governance team wishes to maintain a full record of all values that have ever been valid in the source system. For analytical purposes, only the most recent value for each record needs to be recorded. The Databricks job to ingest these records occurs once per hour, but each individual record may have changed multiple times over the course of an hour. Which solution meets these requirements?

Options

  • ACreate a separate history table for each pk_id resolve the current state of the table by running a
  • BUse merge into to insert, update, or delete the most recent entry for each pk_id into a bronze
  • CIterate through an ordered set of changes to the table, applying each in turn; rely on Delta Lake's
  • DUse Delta Lake's change data feed to automatically process CDC data from an external system,
  • EIngest all log information into a bronze table; use merge into to insert, update, or delete the most

Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL to see the answer

You've previewed enough free CERTIFIED-DATA-ENGINEER-PROFESSIONAL questions. Unlock CERTIFIED-DATA-ENGINEER-PROFESSIONAL for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Change Data Capture (CDC)#Delta Lake#Medallion Architecture#Data Ingestion
Full CERTIFIED-DATA-ENGINEER-PROFESSIONAL PracticeBrowse All CERTIFIED-DATA-ENGINEER-PROFESSIONAL Questions