Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kondrup, Flemming, Jiralerspong, Thomas, Lau, Elaine, de Lara, Nathan, Shkrob, Jacob, Tran, My Duc, Precup, Doina, Basu, Sumana
Format:	Journal Article
Language:	English
Published:	05-10-2022
Subjects:	Computer Science - Learning
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to predict the optimal ventilator parameters for a patient to promote 90 day survival. We design a clinically relevant intermediate reward that encourages continuous improvement of the patient vitals as well as addresses the challenge of sparse reward in RL. We find that DeepVent recommends ventilation parameters within safe ranges, as outlined in recent clinical trials. The CQL algorithm offers additional safety by mitigating the overestimation of the value estimates of out-of-distribution states/actions. We evaluate our agent using Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from the MIMIC-III dataset.
AbstractList	Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to predict the optimal ventilator parameters for a patient to promote 90 day survival. We design a clinically relevant intermediate reward that encourages continuous improvement of the patient vitals as well as addresses the challenge of sparse reward in RL. We find that DeepVent recommends ventilation parameters within safe ranges, as outlined in recent clinical trials. The CQL algorithm offers additional safety by mitigating the overestimation of the value estimates of out-of-distribution states/actions. We evaluate our agent using Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from the MIMIC-III dataset.
Author	Lau, Elaine Shkrob, Jacob Basu, Sumana Kondrup, Flemming Jiralerspong, Thomas Precup, Doina de Lara, Nathan Tran, My Duc
Author_xml	– sequence: 1 givenname: Flemming surname: Kondrup fullname: Kondrup, Flemming – sequence: 2 givenname: Thomas surname: Jiralerspong fullname: Jiralerspong, Thomas – sequence: 3 givenname: Elaine surname: Lau fullname: Lau, Elaine – sequence: 4 givenname: Nathan surname: de Lara fullname: de Lara, Nathan – sequence: 5 givenname: Jacob surname: Shkrob fullname: Shkrob, Jacob – sequence: 6 givenname: My Duc surname: Tran fullname: Tran, My Duc – sequence: 7 givenname: Doina surname: Precup fullname: Precup, Doina – sequence: 8 givenname: Sumana surname: Basu fullname: Basu, Sumana
BackLink	https://doi.org/10.48550/arXiv.2210.02552$$DView paper in arXiv
BookMark	eNotj8tKxDAYhbPQhY4-gCvzAh1zadJkKeMVKgNadVn-pn800EmHtHh5e2N1deDjcDjfMTmIY0RCzjhbl0YpdgHpK3yshciACaXEEXltxk9I_USfwCN9QPcOMTgY6AvGOQwwhzHSJiHMuwzo8xTiG71C3NOt90OISB8xRD8mh0uhRkgxd07IoYdhwtP_XJHm5rrZ3BX19vZ-c1kXoCtRWC3zD4bWmLIXtjTAveMdY1pKhp2p-i6_NEwiMOO0tlx55MK6sucV5yBX5PxvdjFr9ynsIH23v4btYih_ACQqTMs
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.2210.02552
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2210_02552
GroupedDBID	AKY GOX
ID	FETCH-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3
IEDL.DBID	GOX
IngestDate	Mon Jan 08 05:44:09 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3
OpenAccessLink	https://arxiv.org/abs/2210.02552
ParticipantIDs	arxiv_primary_2210_02552
PublicationCentury	2000
PublicationDate	2022-10-05
PublicationDateYYYYMMDD	2022-10-05
PublicationDate_xml	– month: 10 year: 2022 text: 2022-10-05 day: 05
PublicationDecade	2020
PublicationYear	2022
Score	1.859954
SecondaryResourceType	preprint
Snippet	Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Learning
Title	Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning
URI	https://arxiv.org/abs/2210.02552
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NTwMhECW2Jy9Go6Z-hoNXIrAU2KOxrb1oE7vR3jYsDMZLbVpr_Pky7DZ68QpceHzMy8zjQciNTIfPF16z2AhgyjvPGq4jG2pvDWjDRcBU9nRunhZ2NEabHLp7C-PW3-9frT9ws7mVEpVXifWmS7YnJUq2HmaLtjiZrbi68b_jEsfMTX-CxOSQHHTsjt61y3FE9mB5TF6rLE3d0LmLQB8B39oiNPQFlTqtFo1WO8E3zUV8OgJY0VmMyALpM2R_U59TebSzRH07IdVkXN1PWfefAXPaSJa2OhIOKK1VQZbKOhG9wG1fFBwaa0KTJmB5AY5br3UphhFSOPYqCCOEK05Jf_mxhAGh1sjIXcASpFSm5E7poTMqRFCFLn1zRgYZhXrVWlbUCFCdATr_v-uC7EsU9-fy-CXpf663cEV6m7C9zrj_APWogDc
link.rule.ids	228,230,782,887
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Towards+Safe+Mechanical+Ventilation+Treatment+Using+Deep+Offline+Reinforcement+Learning&rft.au=Kondrup%2C+Flemming&rft.au=Jiralerspong%2C+Thomas&rft.au=Lau%2C+Elaine&rft.au=de+Lara%2C+Nathan&rft.date=2022-10-05&rft_id=info:doi/10.48550%2Farxiv.2210.02552&rft.externalDocID=2210_02552