nerdexam
AmazonAmazon

SAA-C03 · Question #81

SAA-C03 Question #81: Real Exam Question with Answer & Explanation

The correct answer is B: Configure a transient cluster that runs the primary node and core nodes on On-Demand. For cost-effectiveness and high availability in Amazon EMR workloads, the best approach is to configure atransient cluster(which runs for the duration of the job and then terminates) withOn- Demand Instancesfor the primary and core nodes, andSpot Instancesfor the task nodes. Here

Submitted by andreas_gr· Mar 4, 2026Design Cost-Optimized Architectures

Question

A company has a large data workload that runs for 6 hours each day. The company cannot lose any data while the process is running. A solutions architect is designing an Amazon EMR cluster configuration to support this critical data workload. Which solution will meet these requirements MOST cost-effectively?

Options

  • AConfigure a long-running cluster that runs the primary node and core nodes on On-Demand
  • BConfigure a transient cluster that runs the primary node and core nodes on On-Demand
  • CConfigure a transient cluster that runs the primary node on an On-Demand Instance and the core
  • DConfigure a long-running cluster that runs the primary node on an On-Demand Instance, the core

Explanation

For cost-effectiveness and high availability in Amazon EMR workloads, the best approach is to configure atransient cluster(which runs for the duration of the job and then terminates) withOn- Demand Instancesfor the primary and core nodes, andSpot Instancesfor the task nodes. Here's Primary and core nodes on On-Demand Instances: These nodes are critical because they manage the cluster and store data on HDFS. Running them on On-Demand Instances ensures stability and that no data is lost, as Spot Instances can be interrupted. Task nodes on Spot Instances: Task nodes handle additional processing and can be used with Spot Instances to reduce costs. Spot Instances are much cheaper but can be interrupted, which is fine for non-critical tasks as the framework can handle retries. Atransient clusteris more cost-effective than a long-running cluster for workloads that only run for 6 hours a day. Transient clusters automatically terminate after the workload completes, saving costs by not keeping the cluster running when it's not needed. Option A: A long-running cluster may result in unnecessary costs when the cluster isn't being Option C: Running core nodes on Spot Instances risks data loss if the Spot Instances are interrupted, violating the requirement for zero data loss. Option D: Running both core and task nodes on Spot Instances is highly risky for data-critical

Community Discussion

No community discussion yet for this question.

Full SAA-C03 PracticeBrowse All SAA-C03 Questions