nerdexam
AmazonAmazon

MLS-C01 · Question #251

MLS-C01 Question #251: Real Exam Question with Answer & Explanation

The correct answer is B: Create two different models for different sections of the data.. To improve the accuracy of a linear model whose residual plot shows a non-random pattern, the ML engineer should address the underlying non-linearity by considering multiple models for different data sections, applying non-linear data transformations, or using a non-linear model

Modeling

Question

A retail company wants to create a system that can predict sales based on the price of an item. A machine learning (ML) engineer built an initial linear model that resulted in the following residual plot: Which actions should the ML engineer take to improve the accuracy of the predictions in the next phase of model building? (Choose three.)

Options

  • ADownsample the data uniformly to reduce the amount of data.
  • BCreate two different models for different sections of the data.
  • CDownsample the data in sections where Price < 50.
  • DOffset the input data by a constant value where Price > 50.
  • EExamine the input data, and apply non-linear data transformations where appropriate.
  • FUse a non-linear model instead of a linear model.

Explanation

To improve the accuracy of a linear model whose residual plot shows a non-random pattern, the ML engineer should address the underlying non-linearity by considering multiple models for different data sections, applying non-linear data transformations, or using a non-linear model entirely.

Common mistakes.

  • A. Downsampling data uniformly reduces the available information for training and is unlikely to address issues like non-linearity or heteroscedasticity visible in a residual plot.
  • C. Downsampling data only in specific sections reduces valuable information for those areas and does not inherently resolve the model's inability to capture the correct relationship.
  • D. Offsetting input data by a constant value is a linear transformation and will not resolve non-linear patterns or heteroscedasticity evident in the residual plot.

Concept tested. Residual analysis, improving model fit, non-linear modeling

Reference. https://scikit-learn.org/stable/modules/linear_model.html

Topics

#Residual Analysis#Model Diagnostics#Non-linear Modeling#Feature Transformation

Community Discussion

No community discussion yet for this question.

Full MLS-C01 PracticeBrowse All MLS-C01 Questions