TY - GEN
T1 - To What Extent Can Text Classification Help with Making Inferences About Students’ Understanding
AU - Beaumont, A. J.
AU - Al-Shaghdari, T.
PY - 2020/1/3
Y1 - 2020/1/3
N2 - In this paper we apply supervised machine learning algorithms to automatically classify the text of students' reflective learning journals from an introductory Java programming module with the aim of identifying students who need help with their understanding of the topic they are reflecting on. Such a system could alert teaching staff to students who may need an intervention to support their learning. Several different classifier algorithms have been validated on the training data set to find the best model in two situations; with equal cost for a positive or negative classification and with cost sensitive classification. Methods were used to identify those individual parameters which maximise the performance of each algorithm. Precision, recall and F1-score, as well as confusion matrices were used to understand the behaviour of each classifier and choose the one with the best performance.The classifiers that obtained the best results from the validation were then evaluated on a testing data set containing different data to that used for training. We believe that although the results could be improved with further work, our initial results show that machine learning could be applied to students' reflective writing to assist staff in identifying those students who are struggling to understand the topic.
AB - In this paper we apply supervised machine learning algorithms to automatically classify the text of students' reflective learning journals from an introductory Java programming module with the aim of identifying students who need help with their understanding of the topic they are reflecting on. Such a system could alert teaching staff to students who may need an intervention to support their learning. Several different classifier algorithms have been validated on the training data set to find the best model in two situations; with equal cost for a positive or negative classification and with cost sensitive classification. Methods were used to identify those individual parameters which maximise the performance of each algorithm. Precision, recall and F1-score, as well as confusion matrices were used to understand the behaviour of each classifier and choose the one with the best performance.The classifiers that obtained the best results from the validation were then evaluated on a testing data set containing different data to that used for training. We believe that although the results could be improved with further work, our initial results show that machine learning could be applied to students' reflective writing to assist staff in identifying those students who are struggling to understand the topic.
UR - http://www.scopus.com/inward/record.url?scp=85078463688&partnerID=8YFLogxK
UR - https://link.springer.com/chapter/10.1007%2F978-3-030-37599-7_31
U2 - 10.1007/978-3-030-37599-7_31
DO - 10.1007/978-3-030-37599-7_31
M3 - Conference publication
AN - SCOPUS:85078463688
SN - 9783030375980
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 372
EP - 383
BT - Machine Learning, Optimization, and Data Science - 5th International Conference, LOD 2019, Proceedings
A2 - Nicosia, Giuseppe
A2 - Pardalos, Panos
A2 - Umeton, Renato
A2 - Giuffrida, Giovanni
A2 - Sciacca, Vincenzo
PB - Springer
T2 - 5th International Conference on Machine Learning, Optimization, and Data Science, LOD 2019
Y2 - 10 September 2019 through 13 September 2019
ER -