Evaluation of Clustering Validity

Rudhwan Sideek; Ghaydaa Al-Talib

doi:10.33899/csmj.2008.163987

مجلد 5 عدد 2 (2008), Articles

مجلد 5 عدد 2 (2008)

Evaluation of Clustering Validity

Articles

https://doi.org/10.33899/csmj.2008.163987

منشور 2008-12-01

Rudhwan Sideek
Ghaydaa Al-Talib

Rudhwan Sideek

Ghaydaa Al-Talib

PDF (الإنجليزية)

الكلمات المفتاحية

Data Mining
K_Means
S_Dbw
SD

الملخص

Clustering is a mostly unsupervised procedure and the majority of the clustering algorithms depend on certain assumptions in order to define the subgroups present in a data set. As a consequence, in most applications the resulting clustering scheme requires some sort of evaluation as regards its validity. In this paper, we present a clustering validity procedure, which evaluates the results of clustering algorithms on data sets. We define a validity indexes, S_Dbw & SD, based on well-defined clustering criteria enabling the selection of the optimal input parameters values for a clustering algorithm that result in the best partitioning of a data set. We evaluate the reliability of our indexes experimentally, considering clustering algorithm (K_Means) on real data sets. Our approach is performed favorably in finding the correct number of clusters fitting a data set.

https://doi.org/10.33899/csmj.2008.163987

PDF (الإنجليزية)

مجلة الرافدین لعلوم الحاسوب والریاضیات

Evaluation of Clustering Validity

الكلمات المفتاحية

الملخص

الاشتراك في النشرة الإخبارية

اشترك في النشرة الإخبارية لدينا للحصول على الأخبار والتحديثات الهامة