PROFESSIONAL-CLOUD-DEVELOPER · Question #19
PROFESSIONAL-CLOUD-DEVELOPER Question #19: Real Exam Question with Answer & Explanation
The correct answer is D: Create a Cloud Dataproc cluster on Google Cloud Platform, and then migrate your Hadoop code. https://cloud.google.com/architecture/hadoop/hadoop-gcp-migration-overview Keeping your data in a persistent HDFS cluster using Dataproc is more expensive than storing your data in Cloud Storage, which is what we recommend, as explained later. Keeping data in an HDFS cluster also
Question
Your company is planning to migrate their on-premises Hadoop environment to the cloud. Increasing storage cost and maintenance of data stored in HDFS is a major concern for your company. You also want to make minimal changes to existing data analytics jobs and existing architecture. How should you proceed with the migration?
Options
- AMigrate your data stored in Hadoop to BigQuery.
- BCreate Compute Engine instances with HDD instead of SSD to save costs.
- CCreate a Cloud Dataproc cluster on Google Cloud Platform, and then migrate your Hadoop
- DCreate a Cloud Dataproc cluster on Google Cloud Platform, and then migrate your Hadoop code
Explanation
https://cloud.google.com/architecture/hadoop/hadoop-gcp-migration-overview Keeping your data in a persistent HDFS cluster using Dataproc is more expensive than storing your data in Cloud Storage, which is what we recommend, as explained later. Keeping data in an HDFS cluster also limits your ability to use your data with other Google Cloud products. Google Cloud includes Dataproc, which is a managed Hadoop and Spark environment. You can use Dataproc to run most of your existing jobs with minimal alteration, so you don't need to move away from all of the Hadoop tools you already know.
Topics
Community Discussion
No community discussion yet for this question.