files/journal/2022-09-02_12-54-44-000000_354.png

Journal of Engineering and Applied Sciences

ISSN: Online 1818-7803
ISSN: Print 1816-949x
106
Views
1
Downloads

Integrated Bisect K-Means and Firefly Algorithm for Hierarchical Text Clustering

Athraa Jasim Mohammed, Yuhanis Yusof and Husniza Husni
Page: 522-527 | Received 21 Sep 2022, Published online: 21 Sep 2022

Full Text Reference XML File PDF File

Abstract

Hierarchical text clustering plays a significant role in systematically browsing, summarizing and organizing documents into structure manner. However, the Bisect K-means which is a well-known hierarchical clustering algorithm is only able to generate local optimal solutions due to the employment of K-means as part of its process. In this study, we propose to replace the K-means with firefly algorithm, hence producing a Bisect FA for hierarchical clustering. At each level of the proposed Bisect FA, firefly algorithm works to produce the best clusters. For evaluation purposes, we performed experiments on 20 newsgroups dataset that is commonly used in text clustering studies. The results demonstrate that Bisect FA obtains more accurate and compact clustering than Bisect K-means, K-means and C-firefly algorithms. Such a result indicates that the proposed Bisect FA is a competitive algorithm for unsupervised learning.


How to cite this article:

Athraa Jasim Mohammed, Yuhanis Yusof and Husniza Husni. Integrated Bisect K-Means and Firefly Algorithm for Hierarchical Text Clustering.
DOI: https://doi.org/10.36478/jeasci.2016.522.527
URL: https://www.makhillpublications.co/view-article/1816-949x/jeasci.2016.522.527