Cloudera
DS-200 · Question #24
DS-200 Question #24: Real Exam Question with Answer & Explanation
The correct answer is B. Distributing the updates of the cluster centroids. See the full explanation below for the reasoning.
Question
In what way can Hadoop be used to improve the performance of LIoyd's algorithm for k-means clustering on large data sets?
Options
- AParallelizing the centroid computations to improve numerical stability
- BDistributing the updates of the cluster centroids
- CReducing the number of iterations required for the centroids to converge
- DMapping the input data into a non-Euclidean metric space
Community Discussion
No community discussion yet for this question.