You are designing a pipeline that publishes application events to a Pub/Sub topic. Although message ordering is not important, you need to be able to aggregate events across disjoint hourly intervals

Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #252. The question stem and answer options stay visible for context.

Submitted by minji_kr· Mar 30, 2026Designing data processing systems

Question

You are designing a pipeline that publishes application events to a Pub/Sub topic. Although message ordering is not important, you need to be able to aggregate events across disjoint hourly intervals before loading the results to BigQuery for analysis. What technology should you use to process and load this data to BigQuery while ensuring that it will scale with large volumes of events?

Options

ACreate a Cloud Function to perform the necessary data processing that executes using the Pub/Sub trigger every time a new message is published to the topic.
BSchedule a Cloud Function to run hourly, pulling all available messages from the Pub/Sub topic and performing the necessary aggregations.
CSchedule a batch Dataflow job to run hourly, pulling all available messages from the Pub/Sub topic and performing the necessary aggregations.
DCreate a streaming Dataflow job that reads continually from the Pub/Sub topic and performs aggregations using tumbling windows.

Unlock PROFESSIONAL-DATA-ENGINEER to see the answer

You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock PROFESSIONAL-DATA-ENGINEER - $49.99 / 30 days Sign in

Topics

#Pub/Sub#Dataflow#tumbling windows#streaming aggregation

Full PROFESSIONAL-DATA-ENGINEER Practice