CCA-410 Exam Questions
93 real CCA-410 exam questions with expert-verified answers and explanations. Page 1 of 2.
- Question #1
Your Hadoop cluster has 25 nodes with a total of 100 TB (4 TB per node) of raw disk space allocated HDFS storage. Assuming Hadoop's default configuration, how much data will you be...
- Question #2
You set up the Hadoop cluster using NameNode Federation. One NameNode manages the/users namespace and one NameNode manages the/data namespace. What happens when client tries to wri...
- Question #3
Identify two features/issues that MapReduce v2 (MRv2/YARN) is designed to address:
- Question #4
The most important consideration for slave nodes in a Hadoop cluster running production jobs that require short turnaround times is:
- Question #5
The failure of which daemon makes HDFS unavailable on a cluster running MapReduce v1 (MRv1)?
- Question #6
Cluster Summary 45 files and directories, 12 blocks = 57 total. Heap Size is 15.31 MB / 193.38MB(7%) Refer to the above screenshot. You configure the Hadoop cluster with seven Data...
- Question #7
You have cluster running with the FIFO Scheduler enabled. You submit a large job A to the cluster, which you expect to run for one hour. Then, you submit job B to cluster, which yo...
- Question #9
Your developers request that you enable them to use Hive on your Hadoop cluster. What do install and/or configure?
- Question #10
How must you format the underlying filesystem of your Hadoop cluster's slave nodes running on Linux?
- Question #11
Your cluster is running Map v1 (MRv1), with default replication set to 3, and a cluster blocks 64MB. Identify which best describes the file read process when a Client application c...
- Question #12
Identity four pieces of cluster information that are stored on disk on the NameNode?
- Question #13
On a cluster running MapReduce v1 (MRv1), the value of the mapred.tasktracker.map.tasks.maximum configuration parameter in the mapred-site.xml file should be set to:
- Question #14
Which command does Hadoop offer to discover missing or corrupt HDFS data?
- Question #15
What does CDH packaging do on install to facilitate Kerberos security setup?
- Question #16
Your Hadoop cluster contains nodes in three racks. Choose which scenario results if you leave the dfs.hosts property in the NameNode's configuration file empty (blank)?
- Question #17
What metadata is stored on a DataNode when a block is written to it?
- Question #18
Your cluster implements HDFS High Availability (HA). You two NameNodes are named nn01 and nn02. What occurs when you execute the command: Hdfs haadmin -failover nn01 nn02
- Question #19
You've configured your cluster with HDFS Federation. One NameNode manages the /data namesapace and another Name/Node manages the /reports namespace. How do you configure a client m...
- Question #21
Identify which two daemons typically run each slave node in a Hadoop cluster running MapReduce v1 (MRv1)
- Question #22
Identify four characteristics of a 300MB file that has been written to HDFS with block size of 128MB and all other Hadoop defaults unchanged?
- Question #23
For each job, the Hadoop framework generates task log files. Where are Hadoop's task log files stored?
- Question #24
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop cluster running MapReduce v1 (MRv1):
- Question #25
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a reasonable time without starving long-running jobs?
- Question #26
Your Hadoop cluster has 12 slave nodes, a block size set to 64MB, and a replication factor of three. Choose which best describes how the Hadoop Framework distributes block writes i...
- Question #27
You has a cluster running with the Fail Scheduler enabled. There are currently no jobs running on the cluster you submit a job A, so that only job A is running on the cluster. A wh...
- Question #28
You are a Hadoop cluster with a NameNode on host mynamenode. What are two ways to determine available HDFS space in your cluster?
- Question #29
In the context of configuring a Hadoop cluster for HDFS High Availability (HA), `fencing' refers to:
- Question #30
You are planning a Hadoop duster, and you expect to be receiving just under 1TB of data per week which will be stored on the cluster, using Hadoop's default replication. You decide...
- Question #31
Under which scenario would it be most appropriate to consider using faster (e.g 10 Gigabit) Ethernet as the network fabric for your Hadoop cluster?
- Question #32
What determines the number of Reduces that run a given MapReduce job on a cluster running MapReduce v1 (MRv1)?
- Question #33
In HDFS, you view a file with rw-r--r-- set as its permissions. What does this tell you about the file?
- Question #34
Your cluster Mode size is set to 128MB. A client application (client application A) is writing a 500MB file to HDFS. After client application A has written 300MB of data, another c...
- Question #35
Your cluster has nodes in seven racks, and you have provided a rack topology script. What is Hadoop's block placement policy, assuming a block replication factor of three?
- Question #36
Which MapReduce v2 (MR2/YARN) daemon is a per-machine slave responsible for launching application containers and monitoring application resources usage?
- Question #37
What happens if a Mapper on one node goes into an infinite loop while running a MapReduce job?
- Question #38
Your cluster Mapreduce V1 (MVR1). What determines where blocks are return into HDFS client application?
- Question #39
You are running two Hadoop clusters(cluster1 and cluster2), they run identical versions of hadoop. You want to copy the data inside /home/foo/cluster1 to cluster2 into the director...
- Question #40
Using cloudera manager on CDH4 cluster running mapreduce V1(MRv1), you delete a tasktracker role instance from a host that also a runs a datanode role instance and a region server...
- Question #41
You have a cluster running 32 slave nodes and 3 master nodes running mapreduce V1 (MRv1). You execute the command: $ hadoop fsck / What four cluster conditions running this command...
- Question #42
Your running a hadoop cluster with a name node on the host mynamenode. What are two ways you can determine available HDFS space in your cluster?
- Question #43
Your developers request that you enable them to use pig on your hadoop cluster. What do you need to configure and / or install?
- Question #44
What is the best disk configuration for slave nodes in hadoop cluster where each node has 6x2TB drives?
- Question #45
MapReduce V2 (MRv2/YARN) splits which two major functions of the jobtracker into separate daemons?
- Question #46
What is the rule governing the formatting of the underlying filesystem in the hadoop cluster?
- Question #47
Your cluster implements hdfs high availability (HA) your two namenodes are named hadoop01 and hadoop02. What occurs when you execute the command: Sudo -u hdfs haadmin -failover had...
- Question #48
Your company stores user profile records in an OLTP database. You want to join these records with webserver logs. You have already ingested into the hadoop file system. What is the...
- Question #49
In the execution of a MapReduce job, where does the mapper place the intermediate data in each map task?
- Question #50
How many mappers required for a map reduce, job determine on a cluster running map reduce V1 (MRv1)?
- Question #51
What occurs when you run a hadoop job specifying output directory for job output which already exists in HDFS?
- Question #52
You have a cluster running with the fair scheduler enabled and configured. You submit multiple jobs to the cluster. Each job is assigned to a pool. What are the two key points to r...