An ecommerce company collects daily customer transaction logs in CSV format and stores the logs in Amazon S3. The company uses Amazon Athena to scan a subset of attributes from the logs on the same da

Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #302. The question stem and answer options stay visible for context.

Data Ingestion and Transformation

Question

An ecommerce company collects daily customer transaction logs in CSV format and stores the logs in Amazon S3. The company uses Amazon Athena to scan a subset of attributes from the logs on the same day the company receives each log. Query times are increasing because of increasing transaction volume. The company wants to improve query performance. Which solution will meet these requirements with the SHORTEST query times?

Options

AConvert the CSV logs into multiple ORC files for better parallelism in Athena. Partition by date in
BConvert the CSV logs to JSON. Partition by date in Amazon S3. Use Athena with dynamic
CConvert the CSV logs to Avro. Partition by date in Amazon S3. Use Athena with projection-based
DConvert the CSV logs to a single Apache Parquet file for each day Partition the data by date in

Unlock DEA-C01 to see the answer

You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock DEA-C01 - $49.99 / 30 days Sign in

Topics

#Columnar Storage#Amazon Athena Performance#Query Optimization#Data Partitioning

Full DEA-C01 Practice