nerdexam
DatabricksDatabricks

CERTIFIED-DATA-ANALYST-ASSOCIATE · Question #53

CERTIFIED-DATA-ANALYST-ASSOCIATE Question #53: Real Exam Question with Answer & Explanation

The correct answer is A: A refresh schedule with an interval of 10 minutes or less. Setting a refresh schedule with an interval of 10 minutes or less directly controls how frequently the dashboard re-queries the gold-level tables - matching the stakeholder requirement means the dashboard will always reflect data no older than 10 minutes after it lands. Option B

Question

A data engineering team has created a Structured Streaming pipeline that processes data in micro- batches and populates gold-level tables. The microbatches are triggered every 10 minutes. A data analyst has created a dashboard based on this gold level data. The project stakeholders want to see the results in the dashboard updated within 10 minutes or less of new data becoming available within the gold-level tables. What is the ability to ensure the streamed data is included in the dashboard at the standard requested by the project stakeholders?

Options

  • AA refresh schedule with an interval of 10 minutes or less
  • BA refresh schedule with an always-on SQL Warehouse (formerly known as SQL Endpoint
  • CA refresh schedule with stakeholders included as subscribers
  • DA refresh schedule with a Structured Streaming cluster

Explanation

Setting a refresh schedule with an interval of 10 minutes or less directly controls how frequently the dashboard re-queries the gold-level tables - matching the stakeholder requirement means the dashboard will always reflect data no older than 10 minutes after it lands. Option B (always-on SQL Warehouse) reduces query startup latency but does nothing to control how often the dashboard refreshes; it would only matter if warehouse cold-start time were the bottleneck. Option C (adding subscribers) simply sends notifications when a refresh runs - it has no effect on the refresh cadence itself. Option D is a category error: Structured Streaming clusters are for processing pipelines, not for serving dashboard queries, which run against SQL Warehouses.

Memory tip: Match the refresh interval to the SLA - if stakeholders want data within X minutes, set the schedule to X minutes or less. Think of it as "the dashboard can only be as fresh as its refresh schedule allows."

Community Discussion

No community discussion yet for this question.

Full CERTIFIED-DATA-ANALYST-ASSOCIATE PracticeBrowse All CERTIFIED-DATA-ANALYST-ASSOCIATE Questions