nerdexam
AmazonAmazon

DEA-C01 · Question #232

DEA-C01 Question #232: Real Exam Question with Answer & Explanation

Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #232. The question stem and answer options stay visible for context.

Data Ingestion and Transformation

Question

A company wants to build a dimension table in an Amazon S3 bucket. The bucket contains historical data that includes 10 million records. The historical data is 1 TB in size. A data engineer needs a solution to update changes for up to 10,000 records in the base table every day. Which solution will meet this requirement with the LOWEST runtime?

Options

  • ADevelop an Apache Spark job in Amazon EMR to read the historical data and the new changes
  • BDevelop an AWS Glue Python job to read the historical data and new changes into two Pandas
  • CDevelop an AWS Glue Apache Spark job to read the historical data and new changes into two
  • DDevelop an Amazon EMR job to read new changes into Apache Spark DataFrames. Use the

Unlock DEA-C01 to see the answer

You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Data Lake Updates#Apache Spark#Amazon EMR#S3 Data Processing
Full DEA-C01 PracticeBrowse All DEA-C01 Questions