Extracting epilepsy‐related information from unstructured clinic letters using large language models

Shichao Fang; Ben Holgate; Anthony Shek; Joel S. Winston; Matthew McWilliam; Pedro F. Viana; James T. Teo; Mark P. Richardson

doi:10.1111/epi.18475

journal article Open Access Jul 10, 2025

Extracting epilepsy‐related information from unstructured clinic letters using large language models

Shichao Fang

Ben Holgate Anthony Shek Joel S. Winston

Matthew McWilliam Pedro F. Viana

James T. Teo

Mark P. Richardson

Epilepsia Vol. 66 No. 9 pp. 3369-3384 · Wiley

View at Publisher Save 10.1111/epi.18475

Abstract

AbstractObjectiveThe emergence of large language models (LLMs) and the increasing prevalence of electronic health records (EHRs) present significant opportunities for advancing health care research and practice. However, research that compares and applies LLMs to extract key epilepsy‐related information from unstructured medical free text is under‐explored. This study fills this gap by comparing and applying different open‐source LLMs and methods to extract epilepsy information from unstructured clinic letters, thereby optimizing EHRs as a resource for the benefit of epilepsy research. We also highlight some limitations of LLMs.MethodsEmploying a dataset of 280 annotated clinic letters from King's College Hospital, we explored the efficacy of open‐source LLMs (Llama and Mistral series) for extracting key epilepsy‐related information, including epilepsy type, seizure type, current anti‐seizure medications (ASMs), and associated symptoms. The study used various extraction methods, including direct extraction, summarized extraction, and contextualized extraction, complemented by role‐prompting and few‐shot prompting techniques. Performance was evaluated against a gold standard dataset, and was also compared to advanced fine‐tuned models and human annotations.ResultsLlama 2 13b (a 13‐billion‐parameter LLM developed by Meta) demonstrated superior extraction capabilities across tasks by consistently outperforming other LLMs (F1 = .80 in epilepsy‐type extraction, F1 = .76 in seizure‐type extraction, and F1 = .90 in current ASMs extraction). Here, F1 score is a balanced metric indicating the model's accuracy in correctly identifying relevant information without excessive false positives. The study highlights the direct extraction showing consistent high performance. Comparative analysis showed that LLMs outperformed current approaches like MedCAT (Medical Concept Annotation Tool) in extracting epilepsy‐related information (.2 higher in F1).SignificanceThe results affirm the potential of LLMs in medical information extraction relating to epilepsy, offering insights into leveraging these models for detailed and accurate data extraction from unstructured texts. The study underscores the importance of method selection in optimizing extraction performance and suggests a promising avenue for enhancing medical research and patient care through advanced natural language processing technologies.

Topics

No keywords indexed for this article. Browse by subject →

References

66

[1]

10.1016/j.yebeh.2019.07.031

[2]

10.1111/j.1528-1167.2011.03213.x

[3]

10.1016/s0140-6736(95)91208-8

[4]

10.1111/j.1528-1167.2007.00992.x

[5]

MatejkaJ.Refractory epilepsy – continuing the hunt for answers.2023Epilepsy Research Institute. Available from:https://epilepsy‐institute.org.uk/research/refractory‐epilepsy‐continuing‐the‐hunt‐for‐answers/

[6]

10.1007/s00392-016-1025-6

[7]

10.1111/joim.12119

[8]

10.4018/978-1-5225-7071-4.ch004

[9]

10.1016/j.amepre.2007.01.018

[10]

Zaied ANH "Electronic health records: applications, techniques and challenges" Int J Comput Appl (2015)

[11]

10.4258/hir.2019.25.1.1

[12]

10.1136/bmjopen-2018-023232

[13]

10.1145/3341105.3374000

[14]

10.1186/s12911-023-02271-8

[15]

10.1016/j.seizure.2022.07.010

[16]

10.1371/journal.pone.0030412

[17]

10.1200/cci.19.00057

[18]

10.1136/amiajnl-2011-000155

[19]

10.1136/amiajnl-2013-001625

[20]

10.1016/s2589-7500(23)00179-6

[21]

10.1093/jamia/ocac018

[22]

10.1111/epi.17633

[23]

10.1111/epi.17474

[24]

10.1016/j.eplepsyres.2020.106414

[25]

10.1111/j.1528-1167.2009.02201.x

[26]

10.1002/pds.2329

[27]

10.1111/epi.16547

[28]

10.1111/epi.12895

[29]

10.1111/j.1528-1167.2012.03550.x

[30]

10.1038/s41591-023-02448-8

[31]

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models

Tiffany H. Kung, Morgan Cheatham, Arielle Medenilla et al.

PLOS Digital Health 10.1371/journal.pdig.0000198

[32]

NoriH KingN McKinneySM CarignanD HorvitzE.Capabilities of gpt‐4 on medical challenge problems.arXiv preprint arXiv:230313375.2023.

[33]

SinghalK TuT GottweisJ SayresR WulczynE HouL et al.Towards expert‐level medical question answering with large language models. arXiv preprint arXiv:230509617.2023.

[34]

10.1001/jamaneurol.2024.0676

[35]

KraljevicZ BeanD ShekA BendayanR HemingwayH AuJ.Foresight‐Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs.

[36]

10.18653/v1/2024.bionlp-1.43

[37]

10.1093/jamia/ocae047

[38]

10.1093/biomethods/bpae072

[39]

10.1111/epi.17907

[40]

MetaAI.Introducing Meta Llama 3: the most capable openly available LLM to date.2023. Available from:https://ai.meta.com/blog/meta‐llama‐3/.

[41]

TouvronH MartinL StoneK AlbertP AlmahairiA BabaeiY et al.Llama 2: open foundation and fine‐tuned chat models. arXiv preprint arXiv:230709288.2023.

[42]

JiangAQ SablayrollesA MenschA BamfordC ChaplotDS delas CasasD et al.Mistral 7B. arXiv preprint arXiv:231006825.2023.

[43]

Machine learning–XGBoost analysis of language networks to classify patients with epilepsy

L. Torlay, M. Perrone-Bertolotti, E. Thomas et al.

Brain Informatics 10.1007/s40708-017-0065-7

[44]

10.1016/j.yebeh.2019.04.006

[45]

ShekA.AI and the sacred disease – The opportunities of electronic patient records and natural language processing to advance epilepsy care and beyond. Doctoral thesis. King's College London London.2022.

[46]

10.1212/wnl.0000000000012570

[47]

WhiteJ FuQ HaysS SandbornM OleaC GilbertH et al.A prompt pattern catalog to enhance prompt engineering with chatgpt. arXiv preprint arXiv:230211382.2023.

[48]

TopsakalO AkinciTC.Creating large language model applications utilizing langchain: a primer on developing LLM apps fast.Proceedings of the International Conference on Applied Engineering and Natural Sciences Konya Turkey 10–12.2023. 10.59287/icaens.1127

[49]

10.2196/50638

[50]

Marvin G (2023)

Showing 50 of 66 references

Metrics

4

Citations

66

References

Details

Published: Jul 10, 2025
Vol/Issue: 66(9)
Pages: 3369-3384
License: View

Authors

S

Shichao Fang

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK; Guy's and St Thomas' NHS Foundation Trust London UK

B

Ben Holgate

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK; Guy's and St Thomas' NHS Foundation Trust London UK

A

Anthony Shek

King's College Hospital NHS Foundation Trust London UK; Guy's and St Thomas' NHS Foundation Trust London UK

J

Joel S. Winston

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK

M

Matthew McWilliam

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK

P

Pedro F. Viana

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK

J

James T. Teo

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK; Guy's and St Thomas' NHS Foundation Trust London UK

M

Mark P. Richardson

Department of Basic & Clinical Neuroscience King's College London London UK; King's College Hospital NHS Foundation Trust London UK

Funding

Angelini Pharma

Cite This Article

Shichao Fang, Ben Holgate, Anthony Shek, et al. (2025). Extracting epilepsy‐related information from unstructured clinic letters using large language models. Epilepsia, 66(9), 3369-3384. https://doi.org/10.1111/epi.18475

Extracting epilepsy‐related information from unstructured clinic letters using large language models

You May Also Like