검색 상세

K-평균 군집화의 재현성 평가 및 응용 : Reproducibility Assessment of K-Means Clustering and Applications

Reproducibility Assessment of K-Means Clustering and Applications

  • 발행기관 한국통계학회
  • 발행년도 2004
  • 총서유형 Journal
  • UCI G704-000408.2004.17.1.011
  • KCI ID ART000948023

초록/요약

We propose a reproducibility (validity) assessment procedure of K-means cluster anal- ysis by randomly partitioning the data set into three parts, of which two subsets are used for developing clustering rules and one subset for testing consistency of clustering rules. Also, as an alternative to Rand index and corrected Rand index, we propose an entropy-based consistency measure between two clustering rules, and apply it to determination of the number of clusters in K-means clustering.

more