MLS-C01 · Question #339
MLS-C01 Question #339: Real Exam Question with Answer & Explanation
Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #339. The question stem and answer options stay visible for context.
Question
A data scientist receives a new dataset in .csv format and stores the dataset in Amazon S3. The data scientist will use the dataset to train a machine learning (ML) model. The data scientist first needs to identify any potential data quality issues in the dataset. The data scientist must identify values that are missing or values that are not valid. The data scientist must also identify the number of outliers in the dataset. Which solution will meet these requirements with the LEAST operational effort?
Options
- ACreate an AWS Glue job to transform the data from .csv format to Apache Parquet format. Use
- BLeave the dataset in .csv format. Use an AWS Glue crawler and Amazon Athena with appropriate
- CCreate an AWS Glue job to transform the data from .csv format to Apache Parquet format. Import
- DLeave the dataset in .csv format. Import the data into Amazon SageMaker Data Wrangler. Use
Unlock MLS-C01 to see the answer
You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.