Pre-trained Models for Grammatical Error Correction in Healthcare-Specific Text
Journal
Advances in Soft Computing : 24th Mexican International Conference on Artificial Intelligence, MICAI 2025, Guanajuato, Mexico, November 3, 2025, Proceedings, Part I
ISSN
0302-9743
1611-3349
Publisher
Springer Nature Switzerland
Date Issued
2025
Author(s)
González Mora, José Guillermo
Type
text::book::book part
Abstract
Grammatical Error Correction (GEC) is a common Natural Language Processing task and has been studied extensively in the field. There is an added complexity when trying to correct domain-specific text that is uncommon in general-purpose corpora. In this paper, we present a comparative study of three different approaches using pre-trained language models for Grammatical Error Correction in healthcare-specific text. We evaluated the performance of all proposals using the GLEU score metric, which allows a quantitative comparison of all methods. We utilized two different problem settings: first, a sequence-to-sequence pre-trained T5 model, followed by a fine-tuning process over a small set of examples. The second model is a pre-trained large language model, for which we test both zero-shot and few-shot in-context learning. The study shows how the fine-tuned T5 model is capable of exceeding the performance shown by the LLM tested. With this result, we show how smaller encoder-decoder models can solve domain-specific tasks with fewer parameters than a purely generative pre-trained LLM. ©The authors ©Springer.
License
Acceso Restringido
How to cite
González Mora, G., Ponce, H. (2026). Pre-trained Models for Grammatical Error Correction in Healthcare-Specific Text. In: Martínez-Villaseñor, L., Vázquez, R.A., Ochoa-Ruiz, G. (eds) Advances in Soft Computing. MICAI 2025. Lecture Notes in Computer Science(), vol 16221. Springer, Cham. https://doi.org/10.1007/978-3-032-09037-9_27
