About: BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in healthcare domains. Recent years have seen a great progress of applying RL in addressing decision-making problems in Intensive Care Units (ICUs). However, since the goal of traditional RL algorithms is to maximize a long-term reward function, exploration in the learning process may have a fatal impact on the patient. As such, a short-term goal should also be considered to keep the patient stable during the treating process. METHODS: We use a Supervised-Actor-Critic (SAC) RL algorithm to address this problem by combining the long-term goal-oriented characteristics of RL with the short-term goal of supervised learning. We evaluate the differences between SAC and traditional Actor-Critic (AC) algorithms in addressing the decision making problems of ventilation and sedative dosing in ICUs. RESULTS: Results show that SAC is much more efficient than the traditional AC algorithm in terms of convergence rate and data utilization. CONCLUSIONS: The SAC algorithm not only aims to cure patients in the long term, but also reduces the degree of deviation from the strategy applied by clinical doctors and thus improves the therapeutic effect.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in healthcare domains. Recent years have seen a great progress of applying RL in addressing decision-making problems in Intensive Care Units (ICUs). However, since the goal of traditional RL algorithms is to maximize a long-term reward function, exploration in the learning process may have a fatal impact on the patient. As such, a short-term goal should also be considered to keep the patient stable during the treating process. METHODS: We use a Supervised-Actor-Critic (SAC) RL algorithm to address this problem by combining the long-term goal-oriented characteristics of RL with the short-term goal of supervised learning. We evaluate the differences between SAC and traditional Actor-Critic (AC) algorithms in addressing the decision making problems of ventilation and sedative dosing in ICUs. RESULTS: Results show that SAC is much more efficient than the traditional AC algorithm in terms of convergence rate and data utilization. CONCLUSIONS: The SAC algorithm not only aims to cure patients in the long term, but also reduces the degree of deviation from the strategy applied by clinical doctors and thus improves the therapeutic effect. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : wasabi.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in healthcare domains. Recent years have seen a great progress of applying RL in addressing decision-making problems in Intensive Care Units (ICUs). However, since the goal of traditional RL algorithms is to maximize a long-term reward function, exploration in the learning process may have a fatal impact on the patient. As such, a short-term goal should also be considered to keep the patient stable during the treating process. METHODS: We use a Supervised-Actor-Critic (SAC) RL algorithm to address this problem by combining the long-term goal-oriented characteristics of RL with the short-term goal of supervised learning. We evaluate the differences between SAC and traditional Actor-Critic (AC) algorithms in addressing the decision making problems of ventilation and sedative dosing in ICUs. RESULTS: Results show that SAC is much more efficient than the traditional AC algorithm in terms of convergence rate and data utilization. CONCLUSIONS: The SAC algorithm not only aims to cure patients in the long term, but also reduces the degree of deviation from the strategy applied by clinical doctors and thus improves the therapeutic effect.
Subject	Analysis Reinforcement learning Decision-making Markov models Unsolved problems in neuroscience Belief revision
part of	Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units
is abstract of	Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units
is hasSource of	covid:ann/target/72e240d0da1252e8dc05aae3e2755076f21d0701 covid:ann/target/0ec4b047f1fc2100944ecb55a40b919f3b40662c covid:ann/target/621523a28657a45dd06acbcb62a0f5282e7a4585 covid:ann/target/88e0bc149f28163b95de481aabdc6e1d082ac5be covid:ann/target/942476cd272d26286602f65d9ea4dbfe3614622b covid:ann/target/879c2506775d3222b57c7d928df7240bd6b9cc6d covid:ann/target/27df2dfabd210134e2d10ed9e3418fdadac3320f covid:ann/target/787641bc603a41ec3e4598ed24b4c8968f55ca5c covid:ann/target/b7c4a1a7f6299ae199487a655a94c6aacc5463c3 covid:ann/target/f65fe9cfc5c9b06d8667135ba6e1e6b40b026d7d covid:ann/target/1e965559cb6679ad682045df9b79650ed780d432 covid:ann/target/a9eee59686880492f54c0030e5e6d19f46068b33 covid:ann/target/cc3c9407eb6b0f0a971640b3c1aa21ebeae888d2 covid:ann/target/12c8f2fbd29573127402821f2a65229a2a6b723b covid:ann/target/2f67030335007cad0407238b8947a93baf19d55d covid:ann/target/073a628ff8bd6252ba0bc351ca48e4c1eb6d512a covid:ann/target/4c06018b7431f692a2be535a709ed8006e5bf7c1 covid:ann/target/93d7ad27da06c8ccc9917ba9d57ad0e70c9a8848 covid:ann/target/4af2e575e68264922546f968dce91998f26642f9 covid:ann/target/8196ecdb470dbe9cb6868ccb34d73158c13d6354 covid:ann/target/3b0449f1e13b76911c0806dbd250d019292b4a0b covid:ann/target/c51a94eb8a8618dbfabd137b1d41118a8dd95326 covid:ann/target/99e8ec2d9118dba5a5ecbb88aab3feb0c2f9e5a5 covid:ann/target/1d5309e2c437e86241440c1005d3f287295a8117 covid:ann/target/bb81cef95110475178d03bfa9e7b854952a16bbc covid:ann/target/935005002135ad2f9b923d87b7eb7abcfb67eb48 covid:ann/target/33741e80a64ce1464dc619f1dcc0b23266a66f06 covid:ann/target/835f47beba7ad2ec2670a2ac702884cd314df69b covid:ann/target/52b6ccd853a18cf13a75cc884631326a9579c2c5 covid:ann/target/61b8d9350fc64d913f120716d12dc4be44ae65df covid:ann/target/73637a02e8a92af29b62b113b52c530e95afc38c covid:ann/target/26c740c5b703e5fa7a0495551e12db79b5115361 covid:ann/target/218b2436c192f210452470728aaa61c4b1e2358f covid:ann/target/0d1b2bd90042f4751e870b9db8ba64224d766cfe covid:ann/target/a5b0b6471768d65335e4dfb67ffcaaf1d08edf1f covid:ann/target/509f0c0dc30cf754618a4d97fff2f6dda9f3bc82 covid:ann/target/65fa24dbb9426a889ad9631a61e85d66d7a14bc0 covid:ann/target/a9f90991925dd505bca1a35c7b0982ad776d9b03 covid:ann/target/d2471a5425bb252d05f72dfd92ad88cf0c5903cf covid:ann/target/dcd29739978d47c73d160f5905decb66652f79e0 covid:ann/target/3626595449ab703b55cc1d51b3f3b972fe77526a covid:ann/target/a8e42795a43963b0dfcd62fa143ac405f9514232 covid:ann/target/c8e42a2455dd4142f28acbcda9dd7081907fa6d5 covid:ann/target/8a7edf2cc09feeabb108aa635234ec724a11e020 covid:ann/target/4476095b890638b9ee78395058f2c95dfc469a09 covid:ann/target/b1ca1545603ed89c502705f1042e078900a9da52 covid:ann/target/b9bf9aa3c73b0cd60c21c8713fbcb7b3077c9c36 covid:ann/target/de09731b4b11ab5de36c4fe13d936b8d1af14b18 covid:ann/target/f9730db682cd9ec657b3dd5dcf4deaf6782fc820 covid:ann/target/e820dba3dec1315110afc9c4ed8c38793b45b534 covid:ann/target/fec8c8fe61a442f8d5ea11b9dfe12df99b5a5d31 covid:ann/target/0efbdca1f5543cf4afe23368bd842e7a650b2113 covid:ann/target/1a9c6065f865abf7e0d7dfcc40db38c0e6825ec9 covid:ann/target/56515fc17c9262a83d64b0c5f7865286c4af5b1e covid:ann/target/5842b4728167773a1ba898fa5fb521f92f116945 covid:ann/target/944b0ec8a436bf7108fc643c76a712b3d70908f6 covid:ann/target/b9a86886e97b9e8450895883c126aaf146dc4f5c covid:ann/target/f9a0a029c568fa9114d558a006ba5c43d6fb3f8d covid:ann/target/5e978eb36322207a2e28d88de4ea741bdcd347fb covid:ann/target/945b4b4d81f8f97f712ddd92799fb980a0bc2825 covid:ann/target/a4f2cf4f4170cfdcae987fd06c26680ad6ea0829 covid:ann/target/d970bdf95e906e143f17819abf367e66b656803a covid:ann/target/de563fdb538c0fd4bf5c1f0e39d0f5809c21638c covid:ann/target/ea79bc1df0a289e1a76f7d275a67133173fb9da4 covid:ann/target/bd185ebab60810e383e7689cdaaab37589d80c5c covid:ann/target/5d860608468d54e061dee7efd9222537f299b217 covid:ann/target/2a3983e6af3c435167d5f3e1fa445d96a3f0a305 covid:ann/target/48bf8bad15b53c0e97ca87c83287484af5b69700 covid:ann/target/42d852d75c837111814155408c728c43dc6c0016 covid:ann/target/4784b90ed65a9b83da85e8af57e965742305b484 covid:ann/target/7f3740aa5f370aa364afec75dc2947c9f75ed113 covid:ann/target/e0db91250a3290bc03f7f1a78f63d33583c6c0f4 covid:ann/target/e5d45c4c412189d23736e126aa218fbfc7acadf1 covid:ann/target/a40aacc676efdc0541c875e7da1ad79a334eff10 covid:ann/target/ea481366b727aa6e8f4d211e0b4ec78199da4cab covid:ann/target/18c880120061d1ecf6f47f139b3401655b54b86f covid:ann/target/292076b61eed495da21cf1ff211be5db75021068 covid:ann/target/37ed73f94834eb716fb2b4543ec39953ce1661fa covid:ann/target/de2b5b2a51c96b71ef5527b67d70be33135ad60f covid:ann/target/160a09213e4a2df1ecc0df3a2529815c853d31c4 covid:ann/target/c9048c6c75004b56d02cfdeb162153bdde768c71 covid:ann/target/25f26fc7664f282b0bfd296776a6a9147dafc6c6 covid:ann/target/4eb54307931c55dde6111b5fe4ef151c769f3de2 covid:ann/target/a6744920b1bdbd6f4acf4326e752605dda6b30a6 covid:ann/target/18c3a8d631e4c63cc78b9e1cef5e880948864813 covid:ann/target/6b6cb26789b217050337cd64aea2b6dc0f58d97b covid:ann/target/f9eb52a078816118c1dc407159cbdbd34fe2f824 covid:ann/target/97bd664312de54821f97083d9abff426bb6ceaf8 covid:ann/target/773e877fd750d4b8cc4238d70c5377453d09c067 covid:ann/target/c1687a8d93eefeb0922fc1a81d229e0ea9070f58

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software