Enhancing EHR-based pancreatic cancer prediction with LLM-derived embeddings
Published in npj Digital Medicine, 2025
Pancreatic cancer (PC) is often diagnosed late, as early symptoms and effective screening tools are lacking, and genetic or familial factors explain only ~10% of cases. Leveraging longitudinal electronic health record (EHR) data may offer a promising avenue for early detection. We developed a predictive model using large language model (LLM)-derived embeddings of medical condition names to enhance learning from EHR data.
Recommended citation: Park, J., Patterson, J., Acitores Cortina, J.M. et al. Enhancing EHR-based pancreatic cancer prediction with LLM-derived embeddings. npj Digit. Med. 8, 465 (2025). https://doi.org/10.1038/s41746-025-01869-8
Read paper | Download paper