MLS-C01 · Question #222
MLS-C01 Question #222: Real Exam Question with Answer & Explanation
Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #222. The question stem and answer options stay visible for context.
Question
A data scientist is working on a model to predict a company's required inventory stock levels. All historical data is stored in .csv files in the company's data lake on Amazon S3. The dataset consists of approximately 500 GB of data The data scientist wants to use SQL to explore the data before training the model. The company wants to minimize costs. Which option meets these requirements with the LEAST operational overhead?
Options
- ACreate an Amazon EMR cluster. Create external tables in the Apache Hive metastore,
- BUse AWS Glue to crawl the S3 bucket and create tables in the AWS Glue Data Catalog. Use
- CCreate an Amazon Redshift cluster. Use the COPY command to ingest the data from Amazon S3.
- DCreate an Amazon Redshift cluster. Create external tables in an external schema, referencing the
Unlock MLS-C01 to see the answer
You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.