DEA-C01 · Question #44
DEA-C01 Question #44: Real Exam Question with Answer & Explanation
Sign in or unlock DEA-C01 to reveal the answer and full explanation for question #44. The question stem and answer options stay visible for context.
Question
A company receives a daily file that contains customer data in .xls format. The company stores the file in Amazon S3. The daily file is approximately 2 GB in size. A data engineer concatenates the column in the file that contains customer first names and the column that contains customer last names. The data engineer needs to determine the number of distinct customers in the file. Which solution will meet this requirement with the LEAST operational effort?
Options
- ACreate and run an Apache Spark job in an AWS Glue notebook. Configure the job to read the S3
- BCreate an AWS Glue crawler to create an AWS Glue Data Catalog of the S3 file. Run SQL
- CCreate and run an Apache Spark job in Amazon EMR Serverless to calculate the number of
- DUse AWS Glue DataBrew to create a recipe that uses the COUNT_DISTINCT aggregate function
Unlock DEA-C01 to see the answer
You've previewed enough free DEA-C01 questions. Unlock DEA-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.