Browse Exams Pricing

ExamsCCA-500Questions

CCA-500 Exam Questions

89 real CCA-500 exam questions with expert-verified answers and explanations. Page 1 of 2.

Question #1Configuration
You need to analyze 60,000,000 images stored in JPEG format, each of which is approximately 25 KB. Because you Hadoop cluster isn't optimized for storing and processing many small...
SequenceFilessmall files problemdata serializationMapReduce streaming
Question #2
Identify two features/issues that YARN is designated to address:
Question #3Monitoring and Logging
Which YARN daemon or service monitors a Controller's per-application resource using (e.g., memory CPU)?
YARN NodeManagerresource monitoringcontainer managementapplication resources
Question #4
Which is the default scheduler in YARN?
Question #5
Which YARN process run as "container 0" of a submitted job and is responsible for resource qrequests?
Question #6
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a reasonable time without starting long-running jobs?
Question #7Deployment
Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?
YARN job submissionMapReduce executionHDFS temporary directoryApplicationMaster
Question #8
You are working on a project where you need to chain together MapReduce, Pig jobs. You also need the ability to use forks, decision points, and path joins. Which ecosystem project...
Question #9
Which process instantiates user code, and executes map and reduce tasks on a cluster running MapReduce v2 (MRv2) on YARN?
Question #10Monitoring and Logging
Cluster Summary: 45 files and directories, 12 blocks = 57 total. Heap size is 15.31 MB/193.38MB(7%) Refer to the above screenshot. You configure a Hadoop cluster with seven DataNod...
DataNode monitoringunder-replicated blockscluster healthHDFS UI
Question #11Security
Which two features does Kerberos security add to a Hadoop cluster?
KerberosRPC authenticationcluster securitycentral authentication
Question #12Deployment
Assuming a cluster running HDFS, MapReduce version 2 (MRv2) on YARN with all settings at their default, what do you need to do when adding a new slave node to cluster?
cluster scalingslave node additionDNS configurationYARN defaults
Question #13
Which YARN daemon or service negotiations map and reduce Containers from the Scheduler, tracking their status and monitoring progress?
Question #14Resource Management
During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper place the intermediate data of each Map Task?
intermediate dataMap task outputlocal filesystemMapReduce shuffle
Question #15
You suspect that your NameNode is incorrectly configured, and is swapping memory to disk. Which Linux commands help you to identify whether swapping is occurring?
Question #16
On a cluster running CDH 5.0 or above, you use the hadoop fs -put command to write a 300MB file into a previously empty directory using an HDFS block size of 64 MB. Just after this...
Question #17Troubleshooting
Which command does Hadoop offer to discover missing or corrupt HDFS data?
hdfs fsckdata integritycorrupt blocksHDFS tools
Question #18Deployment
You are planning a Hadoop cluster and considering implementing 10 Gigabit Ethernet as the network fabric. Which workloads benefit the most from faster network fabric?
10GbE networkingcluster planningintermediate datanetwork bandwidth
Question #19Configuration
Your cluster is running MapReduce version 2 (MRv2) on YARN. Your ResourceManager is configured to use the FairScheduler. Now you want to configure your scheduler such that a new us...
FairSchedulerYARN queueauto queue creationuser queue
Question #20Configuration
A slave node in your cluster has 4 TB hard drives installed (4 x 2TB). The DataNode is configured to store HDFS blocks on all disks. You set the value of the dfs.datanode.du.reserv...
dfs.datanode.du.reserveddisk reservationDataNode storageHDFS configuration
Question #21Configuration
What two processes must you do if you are running a Hadoop cluster with a single NameNode and six DataNodes, and you want to change a configuration parameter so that it affects all...
DataNode configurationdaemon restartconfiguration propagationcluster management
Question #22Deployment
You have installed a cluster HDFS and MapReduce version 2 (MRv2) on YARN. You have no dfs.hosts entry(ies) in your hdfs-site.xml configuration file. You configure a new worker node...
dfs.hostsDataNode admissionworker nodedfsadmin
Question #23Troubleshooting
You use the hadoop fs -put command to add a file "sales.txt" to HDFS. This file is small enough that it fits into a single block, which is replicated to three nodes in your cluster...
HDFS replicationDataNode failureunder-replicationautomatic re-replication
Question #24Resource Management
Given: You want to clean up this list by removing jobs where the State is KILLED. What command you enter?
YARN applicationkill commandjob managementapplication state
Question #25
Assume you have a file named foo.txt in your local directory. You issue the following three commands: Hadoop fs -mkdir input Hadoop fs -put foo.txt input/foo.txt Hadoop fs -put foo...
Question #26Deployment
You are configuring a server running HDFS, MapReduce version 2 (MRv2) on YARN running Linux. How must you format underlying file system of each DataNode?
DataNode filesystemLinux filesystemHDFS setupstorage format
Question #27Configuration
You are migrating a cluster from MApReduce version 1 (MRv1) to MapReduce version 2 (MRv2) on YARN. You want to maintain your MRv1 TaskTracker slot capacities when you migrate. What...
MRv1 to MRv2 migrationNodeManager resourcesYARN configurationTaskTracker slots
Question #28Resource Management
On a cluster running MapReduce v2 (MRv2) on YARN, a MapReduce job is given a directory of 10 plain text files as its input directory. Each file is made up of 3 HDFS blocks. How man...
Mapper countinput splitsHDFS blocksMapReduce input
Question #29
You're upgrading a Hadoop cluster from HDFS and MapReduce version 1 (MRv1) to one running HDFS and MapReduce version 2 (MRv2) on YARN. You want to set and enforce version 1 (MRv1)...
Question #30
Your cluster has the following characteristics: - A rack aware topology is configured and on - Replication is set to 3 - Cluster block size is set to 64MB Which describes the file...
Question #31
Your Hadoop cluster is configuring with HDFS and MapReduce version 2 (MRv2) on YARN. Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still...
Question #32Configuration
You have A 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High Availability (HA). You want to minimize the chance of data loss in your cluster. What sh...
HDFS HAreplication factordata redundancyNameNode
Question #33
You are running Hadoop cluster with all monitoring facilities properly configured. Which scenario will go undeselected?
Question #34Configuration
You decide to create a cluster which runs HDFS in High Availability mode with automatic failover, using Quorum Storage. What is the purpose of ZooKeeper in such a configuration?
ZooKeeperHDFS HAautomatic failoverQuorum Storage
Question #35Resource Management
Choose three reasons why should you run the HDFS balancer periodically?
HDFS balancerdata localitydisk utilizationcluster performance
Question #36Configuration
Your cluster implements HDFS High Availability (HA). Your two NameNodes are named nn01 and nn02. What occurs when you execute the command: hdfs haadmin -failover nn01 nn02?
HDFS HAhaadminNameNode failoverstandby NameNode
Question #37
You have a Hadoop cluster HDFS, and a gateway machine external to the cluster from which clients submit jobs. What do you need to do in order to run Impala on the cluster and submi...
Question #38
You have just run a MapReduce job to filter user messages to only those of a selected geographical region. The output for this job is in a directory named westUsers, located just b...
Question #39
In CDH4 and later, which file contains a serialized form of all the directory and files inodes in the filesystem, giving the NameNode a persistent checkpoint of the filesystem meta...
Question #40
You are running a Hadoop cluster with a NameNode on host mynamenode. What are two ways to determine available HDFS space in your cluster?
Question #41
You have recently converted your Hadoop cluster from a MapReduce 1 (MRv1) architecture to MapReduce 2 (MRv2) on YARN architecture. Your developers are accustomed to specifying map...
Question #42Configuration
Your Hadoop cluster contains nodes in three racks. You have not configured the dfs.hosts property in the NameNode's configuration file. What results?
dfs.hostsDataNodecluster membershipNameNode configuration
Question #43
You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You consistently see that MapReduce map tasks on your cluster are running slowly because of excessive garb...
Question #44
You have a cluster running with a FIFO scheduler enabled. You submit a large job A to the cluster, which you expect to run for one hour. Then, you submit job B to the cluster, whic...
Question #45
A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is a Directory in HDFS named /data/input. The Jar is named j.jar, and the driver cla...
Question #46
Your company stores user profile records in an OLTP databases. You want to join these records with web server logs you have already ingested into the Hadoop file system. What is th...
Question #47
Which two are features of Hadoop's rack topology?
Question #48Configuration
Your cluster's mapred-start.xml includes the following parameters <name>mapreduce.map.memory.mb</name> <value>4096</value> <name>mapreduce.reduce.memory.mb</name> <value>8192</valu...
YARN memoryvirtual memoryvmem-pmem-ratiomap task containers
Question #49
Assuming you're not running HDFS Federation, what is the maximum number of NameNode daemons you should run on your cluster in order to avoid a "split-brain" scenario with your Name...
Question #50Configuration
Table schemas in Hive are:
Hive metastoretable schemametadata storage

Page 1 of 2Next

study smarter, certify faster.

Product

Browse Exams
Pricing
PDF Downloads

Resources

Blog
Glossary
Topics
FAQ
Contact Us

Legal

Terms of Service
Privacy Policy
Cookie Policy
DMCA
Refund Policy

Company

About
Contact

© 2026 NerdExam. All rights reserved.Built for IT professionals

NerdExam is a trading name of WADL Solutions Limited, a company incorporated in Hong Kong (CR# 80143234). Registered office: Unit 2904-05, 29/F, Universal Trade Centre, 3 Arbuthnot Road, Central, Hong Kong.

CompTIA, AWS, Cisco, Microsoft, Google Cloud, Oracle, VMware, and other certification names referenced on this site are trademarks of their respective owners. NerdExam is not affiliated with, endorsed by, or sponsored by any certification vendor.