DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST Question #64: Real Exam Question with Answer & Explanation

Sign in or unlock DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST to reveal the answer and full explanation for question #64. The question stem and answer options stay visible for context.

Question

You are building a classifier off of a very high-dimensiona data set similar to shown in the image with 5000 variables (lots of columns, not that many rows). It can handle both dense and sparse input. Which technique is most suitable, and why?

Exhibit

Options

ALogistic regression with L1 regularization, to prevent overfitting
BNaive Bayes, because Bayesian methods act as regularlizers
Ck-nearest neighbors, because it uses local neighborhoods to classify examples
DRandom forest because it is an ensemble method

Unlock DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST to see the answer

You've previewed enough free DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST questions. Unlock DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Unlock DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST - $49.99 / 30 days Sign in

Full DATABRICKS-CERTIFIED-PROFESSIONAL-DATA-SCIENTIST Practice