Evaluation of students performance is inevitable in any educational setting, allocating scores to students response is a function of how close the answer supplied to the question is to expected answer. This study delves into analyzing the effectiveness of cosine similarity measure and Euclidean distance which are both used in similarity measures for Automated Essay Type Grading System (AETGS). AETGS involves transcription of the contents of the marking schemes into electronic form to derive a txt file extension using text editor while students answers assumed txt format. The inherent stopwords and stemming in the txt document were pre-processed to address morphological variations using standard stopwords list and porters stemmer algorithm, respectively. N-gram terms were derived for each students response and the Marking Schemes (MS) using the vector space model. A Document Term Matrix (DTM) was generated with N-gram terms of MS and students response representing columns and rows, respectively. Modified principal component analysis algorithm was used to reduce the sparseness of the DTM to obtain a vector representation of the students answers and the marking scheme. The reduced vector representation of the students answers was graded according to the mark assigned to each question in the marking scheme using cosine similarity measure and the Euclidean distance measure. The developed Automated Essay-Type Grading System (AETGS) was implemented in Matrix Laboratory 8.1 (R2013a). The effect of the similarity measures on the developed system was performed using Pearson Correlation coefficient of two courses: CMP401-Organization of Programming Languages and CMP205-Operating System I. The result showed that cosine similarity measure has a high positive correlation than the Euclidean distance.
Olumide O. Obe, Adeleye S. Falohun, Ibrahim A. Adeyanju and Odunayo. E. Oduntan. A Comparative Analysis of Euclidean Distance and Cosine Similarity
Measure for Automated Essay-Type Grading.
DOI: https://doi.org/10.36478/jeasci.2018.4198.4204
URL: https://www.makhillpublications.co/view-article/1816-949x/jeasci.2018.4198.4204