SPEER: Sentence-Level Planning of Long Clinical Summaries via Embedded Entity Retrieval (2401.02369v2)

Published 4 Jan 2024 in cs.CL

Abstract: Clinician must write a lengthy summary each time a patient is discharged from the hospital. This task is time-consuming due to the sheer number of unique clinical concepts covered in the admission. Identifying and covering salient entities is vital for the summary to be clinically useful. We fine-tune open-source LLMs (Mistral-7B-Instruct and Zephyr-7B-beta) on the task and find that they generate incomplete and unfaithful summaries. To increase entity coverage, we train a smaller, encoder-only model to predict salient entities, which are treated as content-plans to guide the LLM. To encourage the LLM to focus on specific mentions in the source notes, we propose SPEER: Sentence-level Planning via Embedded Entity Retrieval. Specifically, we mark each salient entity span with special "{{ }}" boundary tags and instruct the LLM to retrieve marked spans before generating each sentence. Sentence-level planning acts as a form of state tracking in that the model is explicitly recording the entities it uses. We fine-tune Mistral and Zephyr variants on a large-scale, diverse dataset of ~167k in-patient hospital admissions and evaluate on 3 datasets. SPEER shows gains in both coverage and faithfulness metrics over non-guided and guided baselines.

References (58)

Authors (3)

Griffin Adams (14 papers)
Jason Zucker (4 papers)
Noémie Elhadad (28 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/GriffinAdams92/status/1841888957975441773

https://twitter.com/gm8xx8/status/1743130798016934312

https://twitter.com/GriffinAdams92/status/1743383363812041049

https://twitter.com/ThomasW423/status/1843694561228722657

SPEER: Sentence-Level Planning of Long Clinical Summaries via Embedded Entity Retrieval (2401.02369v2)

Summary

Related Papers

Tweets