PROFESSIONAL-DATA-ENGINEER · Question #21
PROFESSIONAL-DATA-ENGINEER Question #21: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #21. The question stem and answer options stay visible for context.
Question
Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the data. How should you deduplicate the data most efficiency?
Options
- AAssign global unique identifiers (GUID) to each data entry.
- BCompute the hash value of each data entry, and compare it with all historical data.
- CStore each data entry as the primary key in a separate database and apply an index.
- DMaintain a database table to store the hash value and other metadata for each data entry.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.