PROFESSIONAL-DATA-ENGINEER · Question #229
PROFESSIONAL-DATA-ENGINEER Question #229: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #229. The question stem and answer options stay visible for context.
Question
You want to migrate an on-premises Hadoop system to Cloud Dataproc. Hive is the primary tool in use, and the data format is Optimized Row Columnar (ORC). All ORC files have been successfully copied to a Cloud Storage bucket. You need to replicate some data to the cluster's local Hadoop Distributed File System (HDFS) to maximize performance. What are two ways to start using Hive in Cloud Dataproc? (Choose two.)
Options
- ARun the gsutil utility to transfer all ORC files from the Cloud Storage bucket to HDFS.
- BRun the gsutil utility to transfer all ORC files from the Cloud Storage bucket to any node of the Dataproc cluster.
- CRun the gsutil utility to transfer all ORC files from the Cloud Storage bucket to the master node of the Dataproc cluster.
- DLeverage Cloud Storage connector for Hadoop to mount the ORC files as external Hive tables.
- ELoad the ORC files into BigQuery.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.