Skip to main navigation Skip to search Skip to main content

Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing

  • Timur Jaganov
  • , John Blake
  • , Julián Villegas
  • , Nicholas Carr

Research output: Contribution to journalArticlepeer-review

Abstract

This study investigates the potential for Large Language Models (LLMs) to scale-up Dynamic Assessment (DA). To facilitate such an investigation, we first developed DynaWrite-a modular, microservices-based grammatical tutoring application which supports multiple LLMs to generate dynamic feedback to learners of English. Initial testing of 21 LLMs, revealed GPT-4o and neural chat to have the most potential to scale-up DA in the language learning classroom. Further testing of these two candidates found both models performed similarly in their ability to accurately identify grammatical errors in user sentences. However, GPT-4o consistently outperformed neural chat in the quality of its DA by generating clear, consistent, and progressively explicit hints. Real-time responsiveness and system stability were also confirmed through detailed performance testing, with GPT-4o exhibiting sufficient speed and stability. This study shows that LLMs can be used to scale-up dynamic assessment and thus enable dynamic assessment to be delivered to larger groups than possible in traditional teacher-learner settings.
Original languageEnglish
Pages (from-to)151538 - 151550
JournalIEEE Access
Volume13
Early online date27 Aug 2025
DOIs
Publication statusPublished - 3 Sept 2025

Bibliographical note

Copyright © 2025 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/

Funding

This work was supported by Japan Society for the Promotion of Science (JSPS) Grant-in-Aid for Scientific Research (KAKENHI) under Grant 23K00656.

Keywords

  • cs.CL
  • cs.AI

Fingerprint

Dive into the research topics of 'Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing'. Together they form a unique fingerprint.

Cite this