Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning
Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | Journal Article |
Language: | English |
Published: |
05-10-2022
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Mechanical ventilation is a key form of life support for patients with
pulmonary impairment. Healthcare workers are required to continuously adjust
ventilator settings for each patient, a challenging and time consuming task.
Hence, it would be beneficial to develop an automated decision support tool to
optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning
(CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to
predict the optimal ventilator parameters for a patient to promote 90 day
survival. We design a clinically relevant intermediate reward that encourages
continuous improvement of the patient vitals as well as addresses the challenge
of sparse reward in RL. We find that DeepVent recommends ventilation parameters
within safe ranges, as outlined in recent clinical trials. The CQL algorithm
offers additional safety by mitigating the overestimation of the value
estimates of out-of-distribution states/actions. We evaluate our agent using
Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from
the MIMIC-III dataset. |
---|---|
AbstractList | Mechanical ventilation is a key form of life support for patients with
pulmonary impairment. Healthcare workers are required to continuously adjust
ventilator settings for each patient, a challenging and time consuming task.
Hence, it would be beneficial to develop an automated decision support tool to
optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning
(CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to
predict the optimal ventilator parameters for a patient to promote 90 day
survival. We design a clinically relevant intermediate reward that encourages
continuous improvement of the patient vitals as well as addresses the challenge
of sparse reward in RL. We find that DeepVent recommends ventilation parameters
within safe ranges, as outlined in recent clinical trials. The CQL algorithm
offers additional safety by mitigating the overestimation of the value
estimates of out-of-distribution states/actions. We evaluate our agent using
Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from
the MIMIC-III dataset. |
Author | Lau, Elaine Shkrob, Jacob Basu, Sumana Kondrup, Flemming Jiralerspong, Thomas Precup, Doina de Lara, Nathan Tran, My Duc |
Author_xml | – sequence: 1 givenname: Flemming surname: Kondrup fullname: Kondrup, Flemming – sequence: 2 givenname: Thomas surname: Jiralerspong fullname: Jiralerspong, Thomas – sequence: 3 givenname: Elaine surname: Lau fullname: Lau, Elaine – sequence: 4 givenname: Nathan surname: de Lara fullname: de Lara, Nathan – sequence: 5 givenname: Jacob surname: Shkrob fullname: Shkrob, Jacob – sequence: 6 givenname: My Duc surname: Tran fullname: Tran, My Duc – sequence: 7 givenname: Doina surname: Precup fullname: Precup, Doina – sequence: 8 givenname: Sumana surname: Basu fullname: Basu, Sumana |
BackLink | https://doi.org/10.48550/arXiv.2210.02552$$DView paper in arXiv |
BookMark | eNotj8tKxDAYhbPQhY4-gCvzAh1zadJkKeMVKgNadVn-pn800EmHtHh5e2N1deDjcDjfMTmIY0RCzjhbl0YpdgHpK3yshciACaXEEXltxk9I_USfwCN9QPcOMTgY6AvGOQwwhzHSJiHMuwzo8xTiG71C3NOt90OISB8xRD8mh0uhRkgxd07IoYdhwtP_XJHm5rrZ3BX19vZ-c1kXoCtRWC3zD4bWmLIXtjTAveMdY1pKhp2p-i6_NEwiMOO0tlx55MK6sucV5yBX5PxvdjFr9ynsIH23v4btYih_ACQqTMs |
ContentType | Journal Article |
Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2210.02552 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2210_02552 |
GroupedDBID | AKY GOX |
ID | FETCH-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:44:09 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3 |
OpenAccessLink | https://arxiv.org/abs/2210.02552 |
ParticipantIDs | arxiv_primary_2210_02552 |
PublicationCentury | 2000 |
PublicationDate | 2022-10-05 |
PublicationDateYYYYMMDD | 2022-10-05 |
PublicationDate_xml | – month: 10 year: 2022 text: 2022-10-05 day: 05 |
PublicationDecade | 2020 |
PublicationYear | 2022 |
Score | 1.859954 |
SecondaryResourceType | preprint |
Snippet | Mechanical ventilation is a key form of life support for patients with
pulmonary impairment. Healthcare workers are required to continuously adjust
ventilator... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Learning |
Title | Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning |
URI | https://arxiv.org/abs/2210.02552 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NTwMhECW2Jy9Go6Z-hoNXIrAU2KOxrb1oE7vR3jYsDMZLbVpr_Pky7DZ68QpceHzMy8zjQciNTIfPF16z2AhgyjvPGq4jG2pvDWjDRcBU9nRunhZ2NEabHLp7C-PW3-9frT9ws7mVEpVXifWmS7YnJUq2HmaLtjiZrbi68b_jEsfMTX-CxOSQHHTsjt61y3FE9mB5TF6rLE3d0LmLQB8B39oiNPQFlTqtFo1WO8E3zUV8OgJY0VmMyALpM2R_U59TebSzRH07IdVkXN1PWfefAXPaSJa2OhIOKK1VQZbKOhG9wG1fFBwaa0KTJmB5AY5br3UphhFSOPYqCCOEK05Jf_mxhAGh1sjIXcASpFSm5E7poTMqRFCFLn1zRgYZhXrVWlbUCFCdATr_v-uC7EsU9-fy-CXpf663cEV6m7C9zrj_APWogDc |
link.rule.ids | 228,230,782,887 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Towards+Safe+Mechanical+Ventilation+Treatment+Using+Deep+Offline+Reinforcement+Learning&rft.au=Kondrup%2C+Flemming&rft.au=Jiralerspong%2C+Thomas&rft.au=Lau%2C+Elaine&rft.au=de+Lara%2C+Nathan&rft.date=2022-10-05&rft_id=info:doi/10.48550%2Farxiv.2210.02552&rft.externalDocID=2210_02552 |