nerdexam
AmazonAmazon

DEA-C01 · Question #142

DEA-C01 Question #142: Real Exam Question with Answer & Explanation

Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #142. The question stem and answer options stay visible for context.

Data Ingestion and Transformation

Question

A company uploads .csv files to an Amazon S3 bucket. The company's data platform team has set up an AWS Glue crawler to perform data discovery and to create the tables and schemas. An AWS Glue job writes processed data from the tables to an Amazon Redshift database. The AWS Glue job handles column mapping and creates the Amazon Redshift tables in the Redshift database appropriately. If the company reruns the AWS Glue job for any reason, duplicate records are introduced into the Amazon Redshift tables. The company needs a solution that will update the Redshift tables without duplicates. Which solution will meet these requirements?

Options

  • AModify the AWS Glue job to copy the rows into a staging Redshift table. Add SQL commands to
  • BModify the AWS Glue job to load the previously inserted data into a MySQL database. Perform an
  • CUse Apache Spark's DataFrame dropDuplicates() API to eliminate duplicates. Write the data to
  • DUse the AWS Glue ResolveChoice built-in transform to select the value of the column from the

Unlock DEA-C01 to see the answer

You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#AWS Glue#Amazon Redshift#Upsert#Data Deduplication
Full DEA-C01 PracticeBrowse All DEA-C01 Questions