nerdexam
AmazonAmazon

MLS-C01 · Question #45

MLS-C01 Question #45: Real Exam Question with Answer & Explanation

The correct answer is B: Create an AWS Glue crawler to populate the AWS Glue Data Catalog. Then, author an AWS. AWS Glue is the correct answer because this option requires the least amount of setup and maintenance since it is serverless, and it does not require management of the infrastructure. A, C, and D are all solutions that can solve the problem, but require more steps for configurati

Data Engineering

Question

A company is setting up a system to manage all of the datasets it stores in Amazon S3. The company would like to automate running transformation jobs on the data and maintaining a catalog of the metadata concerning the datasets. The solution should require the least amount of setup and maintenance. Which solution will allow the company to achieve its goals?

Options

  • ACreate an Amazon EMR cluster with Apache Hive installed. Then, create a Hive metastore and a
  • BCreate an AWS Glue crawler to populate the AWS Glue Data Catalog. Then, author an AWS
  • CCreate an Amazon EMR cluster with Apache Spark installed. Then, create an Apache Hive
  • DCreate an AWS Data Pipeline that transforms the data. Then, create an Apache Hive metastore

Explanation

AWS Glue is the correct answer because this option requires the least amount of setup and maintenance since it is serverless, and it does not require management of the infrastructure. A, C, and D are all solutions that can solve the problem, but require more steps for configuration, and require higher operational overhead to run and maintain.

Topics

#AWS Glue#Data Catalog#ETL#Serverless Data Processing

Community Discussion

No community discussion yet for this question.

Full MLS-C01 PracticeBrowse All MLS-C01 Questions