Pre-trained Models for Grammatical Error Correction in Healthcare-Specific Text

González Mora, José Guillermo; Ponce, Hiram

doi:10.1007/978-3-032-09037-9_27

Pre-trained Models for Grammatical Error Correction in Healthcare-Specific Text

Journal

Advances in Soft Computing : 24th Mexican International Conference on Artificial Intelligence, MICAI 2025, Guanajuato, Mexico, November 3, 2025, Proceedings, Part I

ISSN

0302-9743

1611-3349

Publisher

Springer Nature Switzerland

Date Issued

2025

Author(s)

González Mora, José Guillermo

Facultad de Ingeniería - CampCM

Ponce, Hiram

Facultad de Ingeniería - CampCM

Type

text::book::book part

DOI

10.1007/978-3-032-09037-9_27

URL

https://scripta.up.edu.mx/handle/20.500.12552/12597

Abstract

Grammatical Error Correction (GEC) is a common Natural Language Processing task and has been studied extensively in the field. There is an added complexity when trying to correct domain-specific text that is uncommon in general-purpose corpora. In this paper, we present a comparative study of three different approaches using pre-trained language models for Grammatical Error Correction in healthcare-specific text. We evaluated the performance of all proposals using the GLEU score metric, which allows a quantitative comparison of all methods. We utilized two different problem settings: first, a sequence-to-sequence pre-trained T5 model, followed by a fine-tuning process over a small set of examples. The second model is a pre-trained large language model, for which we test both zero-shot and few-shot in-context learning. The study shows how the fine-tuned T5 model is capable of exceeding the performance shown by the LLM tested. With this result, we show how smaller encoder-decoder models can solve domain-specific tasks with fewer parameters than a purely generative pre-trained LLM. ©The authors ©Springer.

Subjects

Grammatical Error Cor...

Sequence-to-sequence ...

LLM

Zero-shot learning

Few-shot learning

Fine-tuning

License

Acceso Restringido

URL License

https://creativecommons.org/licenses/by-nc-sa/4.0/

How to cite

González Mora, G., Ponce, H. (2026). Pre-trained Models for Grammatical Error Correction in Healthcare-Specific Text. In: Martínez-Villaseñor, L., Vázquez, R.A., Ochoa-Ruiz, G. (eds) Advances in Soft Computing. MICAI 2025. Lecture Notes in Computer Science(), vol 16221. Springer, Cham. https://doi.org/10.1007/978-3-032-09037-9_27