PROFESSIONAL-DATA-ENGINEER · Question #45
PROFESSIONAL-DATA-ENGINEER Question #45: Real Exam Question with Answer & Explanation
The correct answer is B: Rewrite the job in Apache Spark.. Explanation/Reference: Spark performs in-memory processing and faster, which results in optimization of job’s processing time.
Question
Your company has recently grown rapidly and now ingesting data at a significantly higher rate than it was previously. You manage the daily batch MapReduce analytics jobs in Apache Hadoop. However, the recent increase in data has meant the batch jobs are falling behind. You were asked to recommend ways the development team could increase the responsiveness of the analytics without increasing costs. What should you recommend they do?
Options
- ARewrite the job in Pig.
- BRewrite the job in Apache Spark.
- CIncrease the size of the Hadoop cluster.
- DDecrease the size of the Hadoop cluster but also rewrite the job in Hive.
Explanation
Explanation/Reference: Spark performs in-memory processing and faster, which results in optimization of job’s processing time.
Community Discussion
No community discussion yet for this question.