nerdexam
AmazonAmazon

MLA-C01 · Question #96

MLA-C01 Question #96: Real Exam Question with Answer & Explanation

Sign in or unlock MLA-C01 to reveal the answer and full explanation for question #96. The question stem and answer options stay visible for context.

Data Preparation for Machine Learning

Question

An ML engineer needs to merge and transform data from two sources to retrain an existing ML model. One data source consists of .csv files that are stored in an Amazon S3 bucket. Each .csv file consists of millions of records. The other data source is an Amazon Aurora DB cluster. The result of the merge process must be written to a second S3 bucket. The ML engineer needs to perform this merge-and-transform task every week. Which solution will meet these requirements with the LEAST operational overhead?

Options

  • ACreate a transient Amazon EMR cluster every week. Use the cluster to run an Apache Spark job
  • BCreate a weekly AWS Glue job that uses the Apache Spark engine. Use DynamicFrame native
  • CCreate an AWS Lambda function that runs Apache Spark code every week to merge and
  • DCreate an AWS Batch job that runs Apache Spark code on Amazon EC2 instances every week.

Unlock MLA-C01 to see the answer

You've previewed enough free MLA-C01 questions. Unlock MLA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#AWS Glue#ETL#Apache Spark#Managed Data Processing
Full MLA-C01 PracticeBrowse All MLA-C01 Questions