A company uses an Amazon EMR cluster with 50 nodes to process operational data and make the data available for data analysts. These jobs run nightly use Apache Hive with the Apache Jez framework as a

Sign in or unlock DAS-C01 to reveal the answer and full explanation for question #150. The question stem and answer options stay visible for context.

Processing

Question

A company uses an Amazon EMR cluster with 50 nodes to process operational data and make the data available for data analysts. These jobs run nightly use Apache Hive with the Apache Jez framework as a processing model and write results to Hadoop Distributed File System (HDFS) In the last few weeks, jobs are failing and are producing the following error message "File could only be replicated to 0 nodes instead of 1". A data analytics specialist checks the DataNode logs the NameNode logs and network connectivity for potential issues that could have prevented HDFS from replicating data. The data analytics specialist rules out these factors as causes for the issue. Which solution will prevent the jobs from failing'?

Options

AMonitor the HDFSUtilization metric. If the value crosses a user-defined threshold add task nodes
BMonitor the HDFSUtilization metric If the value crosses a user-defined threshold add core nodes
CMonitor the MemoryAllocatedMB metric. If the value crosses a user-defined threshold, add task
DMonitor the MemoryAllocatedMB metric. If the value crosses a user-defined threshold, add core

Unlock DAS-C01 to see the answer

You've previewed enough free DAS-C01 questions. Unlock DAS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock DAS-C01 - $49.99 / 30 days Sign in

Topics

#EMR Troubleshooting#HDFS Replication#YARN Resource Management#Apache Tez

Full DAS-C01 Practice