PROFESSIONAL-DATA-ENGINEER · Question #193
PROFESSIONAL-DATA-ENGINEER Question #193: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #193. The question stem and answer options stay visible for context.
Question
You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt. You need to modify the Cloud Dataflow pipeline to filter out this corrupt data. What should you do?
Options
- AAdd a SideInput that returns a Boolean if the element is corrupt.
- BAdd a ParDo transform in Cloud Dataflow to discard corrupt elements.
- CAdd a Partition transform in Cloud Dataflow to separate valid data from corrupt data.
- DAdd a GroupByKey transform in Cloud Dataflow to group all of the valid data together and discard the rest.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.