MLS-C01 · Question #276
MLS-C01 Question #276: Real Exam Question with Answer & Explanation
Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #276. The question stem and answer options stay visible for context.
Question
A data scientist is working on a forecast problem by using a dataset that consists of .csv files that are stored in Amazon S3. The files contain a timestamp variable in the following format: March 1st, 2020, 08:14pm There is a hypothesis about seasonal differences in the dependent variable. This number could be higher or lower for weekdays because some days and hours present varying values, so the day of the week, month, or hour could be an important factor. As a result, the data scientist needs to transform the timestamp into weekdays, month, and day as three separate variables to conduct an analysis. Which solution requires the LEAST operational overhead to create a new dataset with the added features?
Options
- ACreate an Amazon EMR cluster. Develop PySpark code that can read the timestamp variable as
- BCreate a processing job in Amazon SageMaker. Develop Python code that can read the
- CCreate a new flow in Amazon SageMaker Data Wrangler. Import the S3 file, use the Featurize
- DCreate an AWS Glue job. Develop code that can read the timestamp variable as a string,
Unlock MLS-C01 to see the answer
You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.