nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ANALYST-ASSOCIATE · Question #36

CERTIFIED-DATA-ANALYST-ASSOCIATE Question #36: Real Exam Question with Answer & Explanation

The correct answer is A: The required compute resources could be costly. Achieving sub-minute dashboard refresh rates requires keeping compute resources running continuously to re-query the gold-level tables as frequently as every minute, which can become very expensive - this is the legitimate caution a responsible data analyst should raise before co

Question

A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute. A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables. Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?

Options

  • AThe required compute resources could be costly
  • BThe gold-level tables are not appropriately clean for business reporting
  • CThe streaming data is not an appropriate data source for a dashboard
  • DThe streaming cluster is not fault tolerant
  • EThe dashboard cannot be refreshed that quickly

Explanation

Achieving sub-minute dashboard refresh rates requires keeping compute resources running continuously to re-query the gold-level tables as frequently as every minute, which can become very expensive - this is the legitimate caution a responsible data analyst should raise before committing to that SLA.

Why the distractors are wrong:

  • B is wrong because gold-level tables in a medallion architecture are by definition cleansed and business-ready - that is their purpose.
  • C is wrong because streaming data sources are entirely appropriate for dashboards; near-real-time dashboarding is a standard use case.
  • D is wrong because Structured Streaming is fault-tolerant by design, using checkpointing and write-ahead logs to recover from failures.
  • E is wrong because dashboards can be refreshed that quickly - the limitation is economic, not technical.

Memory tip: In medallion architecture questions, remember Bronze = raw, Silver = validated, Gold = business-ready. When you see a question about gold tables + dashboards, the data quality and appropriateness concerns are already solved. Shift your thinking to operational concerns - and continuous compute for frequent refresh cycles is a real cost driver.

Community Discussion

No community discussion yet for this question.

Full CERTIFIED-DATA-ANALYST-ASSOCIATE PracticeBrowse All CERTIFIED-DATA-ANALYST-ASSOCIATE Questions