nerdexam
AmazonAmazon

DEA-C01 · Question #248

DEA-C01 Question #248: Real Exam Question with Answer & Explanation

Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #248. The question stem and answer options stay visible for context.

Data Ingestion and Transformation

Question

A company uses Amazon S3 and AWS Glue Data Catalog to manage a data lake that contains contact information for customers. The company uses PySpark and AWS Glue jobs with a DynamicFrame to run a workflow that processes data within the data lake. A data engineer notices that the workflow is generating errors as a result of how customer postal codes are stored in the data lake. Some postal codes include unnecessary numbers or invalid characters. The data engineer needs a solution to address the errors and correct the postal codes in the data lake.

Options

  • ACreate a schema definition for PySpark that matches the format the processing workflow requires
  • BUse AWS Glue workflow properties to allow job state sharing. Configure the AWS Glue jobs to
  • CConfigure the column.push_down_predicate setting and the catalogPartitionPredicate settings for
  • DSet the DynamicFrame additional_options parameter `useS3ListImplementation' to True.

Unlock DEA-C01 to see the answer

You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#Data Quality#Data Transformation#AWS Glue#PySpark Schema
Full DEA-C01 PracticeBrowse All DEA-C01 Questions