CERTIFIED-DATA-ANALYST-ASSOCIATE · Question #36
CERTIFIED-DATA-ANALYST-ASSOCIATE Question #36: Real Exam Question with Answer & Explanation
The correct answer is A: The required compute resources could be costly. Achieving sub-minute dashboard refresh rates requires keeping compute resources running continuously to re-query the gold-level tables as frequently as every minute, which can become very expensive - this is the legitimate caution a responsible data analyst should raise before co
Question
A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute. A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables. Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?
Options
- AThe required compute resources could be costly
- BThe gold-level tables are not appropriately clean for business reporting
- CThe streaming data is not an appropriate data source for a dashboard
- DThe streaming cluster is not fault tolerant
- EThe dashboard cannot be refreshed that quickly
Explanation
Achieving sub-minute dashboard refresh rates requires keeping compute resources running continuously to re-query the gold-level tables as frequently as every minute, which can become very expensive - this is the legitimate caution a responsible data analyst should raise before committing to that SLA.
Why the distractors are wrong:
- B is wrong because gold-level tables in a medallion architecture are by definition cleansed and business-ready - that is their purpose.
- C is wrong because streaming data sources are entirely appropriate for dashboards; near-real-time dashboarding is a standard use case.
- D is wrong because Structured Streaming is fault-tolerant by design, using checkpointing and write-ahead logs to recover from failures.
- E is wrong because dashboards can be refreshed that quickly - the limitation is economic, not technical.
Memory tip: In medallion architecture questions, remember Bronze = raw, Silver = validated, Gold = business-ready. When you see a question about gold tables + dashboards, the data quality and appropriateness concerns are already solved. Shift your thinking to operational concerns - and continuous compute for frequent refresh cycles is a real cost driver.
Community Discussion
No community discussion yet for this question.