nerdexam
Cloudera

DS-200 · Question #43

DS-200 Question #43: Real Exam Question with Answer & Explanation

Sign in or unlock DS-200 to reveal the answer and full explanation for question #43. The question stem and answer options stay visible for context.

Question

You want to build a classification model to identify spam comments on a blog. You decide to use the words in the comment text as inputs to your model. Which criteria should you use when deciding which words to use as features in order to contribute to making the correct classification decision?

Options

  • AChoose words for your sample that are most correlated with the Spam label
  • BChoose wordsfor your sample thatoccur most frequently in the text
  • CChoose words, for your sample that have the largest mutual information with the spam label
  • DChoose words for your sample that are least correlated with the spam label

Unlock DS-200 to see the answer

You've previewed enough free DS-200 questions. Unlock DS-200 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Full DS-200 Practice
You want to build a classification model to identify spam comments... | DS-200 Q#43 Answer | NerdExam