DP-100 · Question #152
DP-100 Question #152: Real Exam Question with Answer & Explanation
The correct answer is A: featurization = 'auto'. {"question_number": 3, "correct_answer": "A", "explanation": "Setting featurization='auto' in the AutoMLConfig class instructs Azure AutoML to automatically apply featurization steps to the training data before model training begins. This includes imputing missing values using st
Question
You plan to use automated machine learning to train a regression model. You have data that has features which have missing values, and categorical features with few distinct values. You need to configure automated machine learning to automatically impute missing values and encode categorical features as part of the training task. Which parameter and value pair should you use in the AutoMLConfig class?
Options
- Afeaturization = 'auto'
- Benable_voting_ensemble = True
- Ctask = 'classification'
- Dexclude_nan_labels = True
- Eenable_tf = True
Explanation
{"question_number": 3, "correct_answer": "A", "explanation": "Setting featurization='auto' in the AutoMLConfig class instructs Azure AutoML to automatically apply featurization steps to the training data before model training begins. This includes imputing missing values using statistical strategies (mean, median, or mode depending on the feature) and encoding categorical features (e.g., one-hot encoding for low-cardinality columns). Option B (enable_voting_ensemble) creates an ensemble of models at the end of a run but does not handle data preprocessing. Option C (task='classification') changes the ML task type from regression to classification, which is incorrect for the stated goal. Option D (exclude_nan_labels) drops rows where the target label is NaN, which is unrelated to feature imputation. Option E (enable_tf) enables TensorFlow-based models and has no effect on featurization.", "generated_by": "claude-sonnet", "llm_judge_score": 4}
Topics
Community Discussion
No community discussion yet for this question.