DA0-001 · Question #342
DA0-001 Question #342: Real Exam Question with Answer & Explanation
The correct answer is C: Outlying. When an analyst conducts measures of central tendency during data cleansing, they are primarily attempting to identify outlying data points. Outliers can significantly skew central tendency measures like the mean, and identifying them is crucial for maintaining data quality.
Question
During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?
Options
- ADuplicate
- BMissing
- COutlying
- DInvalid
Explanation
When an analyst conducts measures of central tendency during data cleansing, they are primarily attempting to identify outlying data points. Outliers can significantly skew central tendency measures like the mean, and identifying them is crucial for maintaining data quality.
Common mistakes.
- A. Duplicate data is typically identified using unique identifiers or direct comparison, not primarily through measures of central tendency.
- B. Missing data is usually identified by checking for null values or gaps, not by calculating central tendency measures.
- D. Invalid data refers to data that does not conform to defined rules or formats, which is checked by validation rules rather than central tendency measures.
Concept tested. Data cleansing, central tendency, outlier detection
Reference. https://learn.microsoft.com/en-us/azure/machine-learning/concept-data-quality-issues
Topics
Community Discussion
No community discussion yet for this question.