PROFESSIONAL-DATA-ENGINEER · Question #32
PROFESSIONAL-DATA-ENGINEER Question #32: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #32. The question stem and answer options stay visible for context.
Question
Your company is running their first dynamic campaign, serving different offers by analyzing real-time data during the holiday season. The data scientists are collecting terabytes of data that rapidly grows every hour during their 30-day campaign. They are using Google Cloud Dataflow to preprocess the data and collect the feature (signals) data that is needed for the machine learning model in Google Cloud Bigtable. The team is observing suboptimal performance with reads and writes of their initial load of 10 TB of data. They want to improve this performance while minimizing cost. What should they do?
Options
- ARedefine the schema by evenly distributing reads and writes across the row space of the table.
- BThe performance issue should be resolved over time as the site of the BigDate cluster is increased.
- CRedesign the schema to use a single row key to identify values that need to be updated frequently in the cluster.
- DRedesign the schema to use row keys based on numeric IDs that increase sequentially per user viewing the offers.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.