Integrating a Large Language Model Into Sepsis Predictive Tool Improves Clinical Acceptance Without Delaying Antibiotics in the Emergency Department

Integrating a Large Language Model Into Sepsis Predictive Tool Improves Clinical Acceptance Without Delaying Antibiotics in the Emergency Department

Tuesday, May 19, 2026 2:40 PM to 2:48 PM · 8 min. (America/New_York)
M101: Level M
Abstracts
Informatics/Data Science/AI

Information

Abstract Number
661
Background and Objectives
The use of artificial intelligence (AI), and more recently, large language models (LLM) afford emergency providers novel tools to better identify patients at risk of sepsis. We previously implemented COMPOSER, an AI model to predict sepsis 4-6 hours prior to clinical manifestation. Here, we test the hypothesis that the addition of an LLM to ingest clinical notes and refine predictions in cases of diagnostic uncertainty to COMPOSER would improve clinical acceptance without impacting timeliness of antibiotics.
Methods
We employed a before-after quasi-experimental design to compare the clinical impact of transitioning from an existing sepsis prediction model (COMPOSER) to an LLM-augmented model (COMPOSER-LLM) at two emergency departments (ED) within an academic health system. COMPOSER was used during the pre-intervention period (12/2022-7/2024) and COMPOSER-LLM was used during the post-intervention period (8/2024-3/2025). The outcome of interest was the proportion of alerts in which the clinician selected “no Infection suspected” in each study period as a proxy for model utility and acceptance. We used a Bayesian structural time-series model to a conduct causal impact analysis, adjusting for seasonal effects and confounders (comorbidities and ED volume). The primary outcome was the difference between observed and estimated responses in the intervention period. Additionally, we measured time to antibiotics in both periods to assess if delays in antibiotic administration occurred after the implementation of COMPOSER-LLM.
Results
There were 2539 alert events in the pre-intervention period and 1978 in the post-intervention period. We found a significant drop in rates of “No Infection Suspected” after implementation when comparing the actual vs expected responses in the post-intervention period (25%-30% pre-intervention to 18-22% and post-intervention, with deviations of −5% to −15%, indicating lower observed rates than predicted in the post-intervention period (p < 0.001). Timeliness of antibiotics was not changed after implementation of COMPOSER-LLM (median 140.4 min [IQR 37.3–249.1] to 112.1 min [IQR 32.1–280.3], not significant).
Conclusion
Augmenting a sepsis prediction model with an LLM resulted in a significant decrease in cases where providers selected “no infection suspected,” without negatively impacting antibiotic timeliness.
CME
0.75

Disclosures

Access the following link to view disclosures of session presenters, presenting authors, organizers, moderators, and planners:

Log in

See all the content and easy-to-use features by logging in or registering!