PROFESSIONAL-DATA-ENGINEER · Question #167
PROFESSIONAL-DATA-ENGINEER Question #167: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #167. The question stem and answer options stay visible for context.
Question
After migrating ETL jobs to run on BigQuery, you need to verify that the output of the migrated jobs is the same as the output of the original. You've loaded a table containing the output of the original job and want to compare the contents with output from the migrated job to show that they are identical. The tables do not contain a primary key column that would enable you to join them together for comparison. What should you do?
Options
- ASelect random samples from the tables using the RAND() function and compare the samples.
- BSelect random samples from the tables using the HASH() function and compare the samples.
- CUse a Dataproc cluster and the BigQuery Hadoop connector to read the data from each table and calculate a hash from non-timestamp columns of the table
- DCreate stratified random samples using the OVER() function and compare equivalent samples from each table.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.