nerdexam
GoogleGoogle

PROFESSIONAL-DATA-ENGINEER · Question #21

PROFESSIONAL-DATA-ENGINEER Question #21: Real Exam Question with Answer & Explanation

Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #21. The question stem and answer options stay visible for context.

Submitted by satoshi_tk· Mar 30, 2026

Question

Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the data. How should you deduplicate the data most efficiency?

Options

  • AAssign global unique identifiers (GUID) to each data entry.
  • BCompute the hash value of each data entry, and compare it with all historical data.
  • CStore each data entry as the primary key in a separate database and apply an index.
  • DMaintain a database table to store the hash value and other metadata for each data entry.

Unlock PROFESSIONAL-DATA-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Full PROFESSIONAL-DATA-ENGINEER PracticeBrowse All PROFESSIONAL-DATA-ENGINEER Questions