PROFESSIONAL-DATA-ENGINEER · Question #357
PROFESSIONAL-DATA-ENGINEER Question #357: Real Exam Question with Answer & Explanation
Sign in or unlock PROFESSIONAL-DATA-ENGINEER to reveal the answer and full explanation for question #357. The question stem and answer options stay visible for context.
Question
You created an analytics environment on Google Cloud so that your data scientist team can explore data without impacting the on-premises Apache Hadoop solution. The data in the on-premises Hadoop Distributed File System (HDFS) cluster is in Optimized Row Columnar (ORC) formatted files with multiple columns of Hive partitioning. The data scientist team needs to be able to explore the data in a similar way as they used the on-premises HDFS cluster with SQL on the Hive query engine. You need to choose the most cost-effective storage and processing solution. What should you do?
Options
- AImport the ORC files to Bigtable tables for the data scientist team.
- BImport the ORC files to BigQuery tables for the data scientist team.
- CCopy the ORC files on Cloud Storage, then deploy a Dataproc cluster for the data scientist team.
- DCopy the ORC files on Cloud Storage, then create external BigQuery tables for the data scientist team.
Unlock PROFESSIONAL-DATA-ENGINEER to see the answer
You've previewed enough free PROFESSIONAL-DATA-ENGINEER questions. Unlock PROFESSIONAL-DATA-ENGINEER for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.