files/journal/2022-09-02_12-54-44-000000_354.png

Journal of Engineering and Applied Sciences

ISSN: Online 1818-7803
ISSN: Print 1816-949x
89
Views
1
Downloads

Features Selection from Data in Order to Improve Classification Methods Performance

Reyhaneh Khademi and Mahdi Afzali
Page: 1859-1865 | Received 21 Sep 2022, Published online: 21 Sep 2022

Full Text Reference XML File PDF File

Abstract

Web pages classification is one of the main and challenging subjects in the field of data mining. Web page classification knowledge helps users to obtain useful information from massive data sets on the Internet automatically and efficiently. Many efforts have been made by researchers for web page classification, however, there is still opportunity to improve current approaches. Source of one of the main challenges in the educational categories is that the current data set is unbalanced. Because the size of pages in one subject is not the same with the other subject and its distribution is not uniform. Standard machine learning algorithms are influenced by main and big classes (groups) and secondary groups are ignored so accuracy standard for grouping is reduced. In this research, for solving this problem and for grouping web page a new approach based on collective grouping of support vector machine is proposed. To reduce and select features, principal components analysis and independent component analysis tools have been used respectively. Results show that proposed methods in better than other methods (which are widely used on web pages categories).


How to cite this article:

Reyhaneh Khademi and Mahdi Afzali. Features Selection from Data in Order to Improve Classification Methods Performance.
DOI: https://doi.org/10.36478/jeasci.2016.1859.1865
URL: https://www.makhillpublications.co/view-article/1816-949x/jeasci.2016.1859.1865