nerdexam
AmazonAmazon

SAP-C02 · Question #125

SAP-C02 Question #125: Real Exam Question with Answer & Explanation

The correct answer is B: Store data m Amazon S3. Use the AWS Glue Data Catalog and Amazon Athena to query data.. You need Redshift cluster to run Redshift spectrum which is expensive. Cost per query is same in Redshift spectrum vs Athena though Athena is not for complex or parallel queries with large data

Submitted by yousef_jo· Mar 6, 2026Continuous Improvement for Existing Solutions

Question

A company is collecting a large amount of data from a fleet of loT devices. Data is stored as Optimized Row Columnar (ORC) files in the Hadoop Distributed File System (HDFS) on a persistent Amazon EMR cluster. The company's data analytics team queries the data by using SQL in Apache Presto deployed on the same EMR cluster Queries scan large amounts of data always run for less than 15 minutes, and run only between 5 PM and 10 PM. The company is concerned about the high cost associated with the current solution. A solutions architect must propose the most cost-effective solution that will allow SQL data queries. Which solution will meet these requirements?

Options

  • AStore data m Amazon S3. Use Amazon Redshift Spectrum to query data.
  • BStore data m Amazon S3. Use the AWS Glue Data Catalog and Amazon Athena to query data.
  • CStore data in EMR File System (EMRFS). Use Presto n Amazon EMR to query data.
  • DStore data in Amazon Redshift. Use Amazon Redshift to query data.

Explanation

You need Redshift cluster to run Redshift spectrum which is expensive. Cost per query is same in Redshift spectrum vs Athena though Athena is not for complex or parallel queries with large data

Community Discussion

No community discussion yet for this question.

Full SAP-C02 PracticeBrowse All SAP-C02 Questions