nerdexam
MicrosoftMicrosoft

DP-100 · Question #126

DP-100 Question #126: Real Exam Question with Answer & Explanation

The correct answer is A: Penalize the classification. A: Try Penalized Models You can use the same algorithms but give them a different perspective on the problem. Penalized classification imposes an additional cost on the model for making classification mistakes on the minority class during training. These penalties can bias the mo

Design and prepare a machine learning solution

Question

You are building a binary classification model by using a supplied training set. The training set is imbalanced between two classes. You need to resolve the data imbalance. What are three possible ways to achieve this goal? Each correct answer presents a complete solution. NOTE: Each correct selection is worth one point.

Options

  • APenalize the classification
  • BResample the dataset using undersampling or oversampling
  • CNormalize the training feature set
  • DGenerate synthetic samples in the minority class
  • EUse accuracy as the evaluation metric of the model

Explanation

A: Try Penalized Models You can use the same algorithms but give them a different perspective on the problem. Penalized classification imposes an additional cost on the model for making classification mistakes on the minority class during training. These penalties can bias the model to pay more attention to the minority class. B: You can change the dataset that you use to build your predictive model to have more balanced This change is called sampling your dataset and there are two main methods that you can use to even-up the classes: Consider testing under-sampling when you have an a lot data (tens-or hundreds of thousands of instances or more) Consider testing over-sampling when you don't have a lot of data (tens of thousands of records or D: Try Generate Synthetic Samples A simple way to generate synthetic samples is to randomly sample the attributes from instances in the minority class. https://machinelearningmastery.com/tactics-to-combat-imbalanced-classes-in-your-machine- learning-dataset/

Topics

#Data Imbalance#Classification#Data Preprocessing#Model Training

Community Discussion

No community discussion yet for this question.

Full DP-100 PracticeBrowse All DP-100 Questions