CERTIFIED-DATA-ENGINEER-PROFESSIONAL · Question #23
CERTIFIED-DATA-ENGINEER-PROFESSIONAL Question #23: Real Exam Question with Answer & Explanation
The correct answer is D: Structured Streaming models new data arriving in a data stream as new rows appended to an. The key idea in Structured Streaming is to treat a live data stream as a table that is being continuously appended. This leads to a new stream processing model that is very similar to a batch processing model. You will express your streaming computation as standard batch-like que
Question
Which statement characterizes the general programming model used by Spark Structured Streaming?
Options
- AStructured Streaming leverages the parallel processing of GPUs to achieve highly parallel data
- BStructured Streaming is implemented as a messaging bus and is derived from Apache Kafka.
- CStructured Streaming uses specialized hardware and I/O streams to achieve sub-second latency
- DStructured Streaming models new data arriving in a data stream as new rows appended to an
- EStructured Streaming relies on a distributed network of nodes that hold incremental state values
Explanation
The key idea in Structured Streaming is to treat a live data stream as a table that is being continuously appended. This leads to a new stream processing model that is very similar to a batch processing model. You will express your streaming computation as standard batch-like query as on a static table, and Spark runs it as an incremental query on the unbounded input table. Let’s understand this model in more detail. https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
Topics
Community Discussion
No community discussion yet for this question.