DEA-C01 · Question #248
DEA-C01 Question #248: Real Exam Question with Answer & Explanation
Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #248. The question stem and answer options stay visible for context.
Question
A company uses Amazon S3 and AWS Glue Data Catalog to manage a data lake that contains contact information for customers. The company uses PySpark and AWS Glue jobs with a DynamicFrame to run a workflow that processes data within the data lake. A data engineer notices that the workflow is generating errors as a result of how customer postal codes are stored in the data lake. Some postal codes include unnecessary numbers or invalid characters. The data engineer needs a solution to address the errors and correct the postal codes in the data lake.
Options
- ACreate a schema definition for PySpark that matches the format the processing workflow requires
- BUse AWS Glue workflow properties to allow job state sharing. Configure the AWS Glue jobs to
- CConfigure the column.push_down_predicate setting and the catalogPartitionPredicate settings for
- DSet the DynamicFrame additional_options parameter `useS3ListImplementation' to True.
Unlock DEA-C01 to see the answer
You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.