nerdexam
AmazonAmazon

MLS-C01 · Question #53

MLS-C01 Question #53: Real Exam Question with Answer & Explanation

Sign in or unlock MLS-C01 to reveal the answer and full explanation for question #53. The question stem and answer options stay visible for context.

Data Engineering

Question

A Machine Learning Specialist is creating a new natural language processing application that processes a dataset comprised of 1 million sentences. The aim is to then run Word2Vec to generate embeddings of the sentences and enable different types of predictions. Here is an example from the dataset: "The quck BROWN FOX jumps over the lazy dog." Which of the following are the operations the Specialist needs to perform to correctly sanitize and prepare the data in a repeatable manner? (Choose three.)

Options

  • APerform part-of-speech tagging and keep the action verb and the nouns only.
  • BNormalize all words by making the sentence lowercase.
  • CRemove stop words using an English stopword dictionary.
  • DCorrect the typography on "quck" to "quick."
  • EOne-hot encode all words in the sentence.
  • FTokenize the sentence into words.

Unlock MLS-C01 to see the answer

You've previewed enough free MLS-C01 questions. Unlock MLS-C01 for full answers, explanations, the timed quiz mode, progress tracking, and the master PDF. Question stem and options stay visible so you can still see what's on the exam.

Topics

#NLP#Text Preprocessing#Word Embeddings#Data Preparation
Full MLS-C01 PracticeBrowse All MLS-C01 Questions