Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool...

Full description

Saved in:
Bibliographic Details
Main Authors: Kondrup, Flemming, Jiralerspong, Thomas, Lau, Elaine, de Lara, Nathan, Shkrob, Jacob, Tran, My Duc, Precup, Doina, Basu, Sumana
Format: Journal Article
Language:English
Published: 05-10-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to predict the optimal ventilator parameters for a patient to promote 90 day survival. We design a clinically relevant intermediate reward that encourages continuous improvement of the patient vitals as well as addresses the challenge of sparse reward in RL. We find that DeepVent recommends ventilation parameters within safe ranges, as outlined in recent clinical trials. The CQL algorithm offers additional safety by mitigating the overestimation of the value estimates of out-of-distribution states/actions. We evaluate our agent using Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from the MIMIC-III dataset.
AbstractList Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offline Deep Reinforcement Learning (DRL) agent that learns to predict the optimal ventilator parameters for a patient to promote 90 day survival. We design a clinically relevant intermediate reward that encourages continuous improvement of the patient vitals as well as addresses the challenge of sparse reward in RL. We find that DeepVent recommends ventilation parameters within safe ranges, as outlined in recent clinical trials. The CQL algorithm offers additional safety by mitigating the overestimation of the value estimates of out-of-distribution states/actions. We evaluate our agent using Fitted Q Evaluation (FQE) and demonstrate that it outperforms physicians from the MIMIC-III dataset.
Author Lau, Elaine
Shkrob, Jacob
Basu, Sumana
Kondrup, Flemming
Jiralerspong, Thomas
Precup, Doina
de Lara, Nathan
Tran, My Duc
Author_xml – sequence: 1
  givenname: Flemming
  surname: Kondrup
  fullname: Kondrup, Flemming
– sequence: 2
  givenname: Thomas
  surname: Jiralerspong
  fullname: Jiralerspong, Thomas
– sequence: 3
  givenname: Elaine
  surname: Lau
  fullname: Lau, Elaine
– sequence: 4
  givenname: Nathan
  surname: de Lara
  fullname: de Lara, Nathan
– sequence: 5
  givenname: Jacob
  surname: Shkrob
  fullname: Shkrob, Jacob
– sequence: 6
  givenname: My Duc
  surname: Tran
  fullname: Tran, My Duc
– sequence: 7
  givenname: Doina
  surname: Precup
  fullname: Precup, Doina
– sequence: 8
  givenname: Sumana
  surname: Basu
  fullname: Basu, Sumana
BackLink https://doi.org/10.48550/arXiv.2210.02552$$DView paper in arXiv
BookMark eNotj8tKxDAYhbPQhY4-gCvzAh1zadJkKeMVKgNadVn-pn800EmHtHh5e2N1deDjcDjfMTmIY0RCzjhbl0YpdgHpK3yshciACaXEEXltxk9I_USfwCN9QPcOMTgY6AvGOQwwhzHSJiHMuwzo8xTiG71C3NOt90OISB8xRD8mh0uhRkgxd07IoYdhwtP_XJHm5rrZ3BX19vZ-c1kXoCtRWC3zD4bWmLIXtjTAveMdY1pKhp2p-i6_NEwiMOO0tlx55MK6sucV5yBX5PxvdjFr9ynsIH23v4btYih_ACQqTMs
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2210.02552
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2210_02552
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3
IEDL.DBID GOX
IngestDate Mon Jan 08 05:44:09 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a672-9630250e9884d2948a1fc1b006330eb87db552803ea08c66915fe129c4d1711a3
OpenAccessLink https://arxiv.org/abs/2210.02552
ParticipantIDs arxiv_primary_2210_02552
PublicationCentury 2000
PublicationDate 2022-10-05
PublicationDateYYYYMMDD 2022-10-05
PublicationDate_xml – month: 10
  year: 2022
  text: 2022-10-05
  day: 05
PublicationDecade 2020
PublicationYear 2022
Score 1.859954
SecondaryResourceType preprint
Snippet Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Title Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning
URI https://arxiv.org/abs/2210.02552
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NTwMhECW2Jy9Go6Z-hoNXIrAU2KOxrb1oE7vR3jYsDMZLbVpr_Pky7DZ68QpceHzMy8zjQciNTIfPF16z2AhgyjvPGq4jG2pvDWjDRcBU9nRunhZ2NEabHLp7C-PW3-9frT9ws7mVEpVXifWmS7YnJUq2HmaLtjiZrbi68b_jEsfMTX-CxOSQHHTsjt61y3FE9mB5TF6rLE3d0LmLQB8B39oiNPQFlTqtFo1WO8E3zUV8OgJY0VmMyALpM2R_U59TebSzRH07IdVkXN1PWfefAXPaSJa2OhIOKK1VQZbKOhG9wG1fFBwaa0KTJmB5AY5br3UphhFSOPYqCCOEK05Jf_mxhAGh1sjIXcASpFSm5E7poTMqRFCFLn1zRgYZhXrVWlbUCFCdATr_v-uC7EsU9-fy-CXpf663cEV6m7C9zrj_APWogDc
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Towards+Safe+Mechanical+Ventilation+Treatment+Using+Deep+Offline+Reinforcement+Learning&rft.au=Kondrup%2C+Flemming&rft.au=Jiralerspong%2C+Thomas&rft.au=Lau%2C+Elaine&rft.au=de+Lara%2C+Nathan&rft.date=2022-10-05&rft_id=info:doi/10.48550%2Farxiv.2210.02552&rft.externalDocID=2210_02552