nerdexam
Cloudera

DS-200 · Question #24

DS-200 Question #24: Real Exam Question with Answer & Explanation

The correct answer is B. Distributing the updates of the cluster centroids. See the full explanation below for the reasoning.

Question

In what way can Hadoop be used to improve the performance of LIoyd's algorithm for k-means clustering on large data sets?

Options

  • AParallelizing the centroid computations to improve numerical stability
  • BDistributing the updates of the cluster centroids
  • CReducing the number of iterations required for the centroids to converge
  • DMapping the input data into a non-Euclidean metric space

Community Discussion

No community discussion yet for this question.

Full DS-200 Practice