About: Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated by finite state approximations induced by increasingly fine partitions of the continuous state space. We show that for cost functions over finite time horizons the approximations become arbitrarily precise. Second, we use imprecise Markov decision process approximations as a tool to analyse and validate cost functions and strategies obtained by reinforcement learning. We find that, on the one hand, our new theoretical results validate basic design choices of a previously proposed reinforcement learning approach. On the other hand, the imprecise Markov decision process approximations reveal some inaccuracies in the learned cost functions.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated by finite state approximations induced by increasingly fine partitions of the continuous state space. We show that for cost functions over finite time horizons the approximations become arbitrarily precise. Second, we use imprecise Markov decision process approximations as a tool to analyse and validate cost functions and strategies obtained by reinforcement learning. We find that, on the one hand, our new theoretical results validate basic design choices of a previously proposed reinforcement learning approach. On the other hand, the imprecise Markov decision process approximations reveal some inaccuracies in the learned cost functions. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : wasabi.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated by finite state approximations induced by increasingly fine partitions of the continuous state space. We show that for cost functions over finite time horizons the approximations become arbitrarily precise. Second, we use imprecise Markov decision process approximations as a tool to analyse and validate cost functions and strategies obtained by reinforcement learning. We find that, on the one hand, our new theoretical results validate basic design choices of a previously proposed reinforcement learning approach. On the other hand, the imprecise Markov decision process approximations reveal some inaccuracies in the learned cost functions.
subject	Reinforcement learning Markov models Markov processes Optimal decisions Dynamic programming Stochastic control Loss functions Belief revision Finite automata Reconfiguration
part of	Approximating Euclidean by Imprecise Markov Decision Processes
is abstract of	Approximating Euclidean by Imprecise Markov Decision Processes
is hasSource of	covid:ann/target/873527e9502e49b597e01dd4f64cd9441a8146f2 covid:ann/target/b57e4191be52a1edd9de194ab1de70446fcc5a44 covid:ann/target/e581ba206b6d2e3c2bda60f48dbd24d7933c07cf covid:ann/target/1ef6971cab585267b64973b3da1aa1659fd2a748 covid:ann/target/50f13fda936ca6590e757f527fdb2cea976c7758 covid:ann/target/96cb87588bd1cc637bd2de1f95ae6dbad7acf4ba covid:ann/target/9a6099af63f1ce2ab057b73561ce5db4f316a650 covid:ann/target/b32f376b9b1577da15b952e71a41793c319d1435 covid:ann/target/4aa557d780b64c5f9d2466eac2277955b487eb62 covid:ann/target/31ef59b0e3762912096a01e8a8f0290dd6ad40c9 covid:ann/target/9a7943268c9b57fc629af0b5d4f3d7bfdcfeacfa covid:ann/target/3842e1ca93cb5c329e1c7d1075c9539f446477f5 covid:ann/target/de77b3ec47a2b6d0411232d06d9640bce6000fdd covid:ann/target/50ae9709af221d35e8a2d83ee35f2eb1f906cd9b covid:ann/target/75eaa1f2e8048efebb6b5d3e484a59143413ceba covid:ann/target/931aec615a4eaa186cac663458d7eb66974ecc49 covid:ann/target/af6f66c3bee93f7acf770a0e4eba1b261e79c893 covid:ann/target/d83b7a139fe792afbde6547cf84dfbfa363007a2 covid:ann/target/2804ef50b5d3012c5ff1cb7faac41fa9a1b97f26 covid:ann/target/58511b8ec993fcb6b8e055394bdf906830fe2e04 covid:ann/target/7578a29aca14ec3ea33431ad77344757491e7e85 covid:ann/target/8de8d5815bc622a2e08fe4a5c5af58b269b26919 covid:ann/target/ff9378a8c3e47b471a8b48dcd468d147dbe66acf covid:ann/target/0b4873670b1f09b89a3a49586443e980a7ef8258 covid:ann/target/483d76cf55c47950536df4b85ddb09a03c1bfb37 covid:ann/target/4b45751c43e873d397a6e796eb92cb24d0fe7a5c covid:ann/target/ff2b68e66876bbe166fe40c831f2ee5d350c28c1 covid:ann/target/e92f48721f679ee6c554f8f8a9ec421ce35b5ce9 covid:ann/target/9e084e8d16db6b1ad14c330a62e8daec95f78544 covid:ann/target/81d0c9a32d39ef13266381d52083d20a82f76b72 covid:ann/target/1f264877b8a68544bc5a3bf29022b187f89a8477 covid:ann/target/feee35c0b48d31b06a7efbc22b373623bb6d1c5b covid:ann/target/0bb963b4907046eadce2d748516291938b58d701 covid:ann/target/12b51f4e4383fcc23172f17a1d82fa3f12aa0dfc covid:ann/target/bae7384921acac0c0970adbf963a5ed9b433eb34 covid:ann/target/b132e1b6425014c5d79b1d03f9bf1742696da558 covid:ann/target/110518a2f32762b8f0ca244a58c11c0adb54fa72 covid:ann/target/1d1f279e1cbc49c56cea4409a9f2c11b77f9938b covid:ann/target/3b9c3814581a75c6d9ed7545812800577aa87390 covid:ann/target/c3858c0404152defd6984dbe1e67f5a0a971a9ed covid:ann/target/3524cc4850bf25731e12d09dd978f71884d484a0 covid:ann/target/2dde18651d2bb8067384458237e48d486a18a746 covid:ann/target/6910b42ea979640c26d25d09bc1478b3caf071f1 covid:ann/target/46eee5835556892f129bf909d6c5c7ae2ca59ad9 covid:ann/target/842dca969aa2234c318728009a6d3c0c4487abca covid:ann/target/a74b6270a201c6492fab8a80d089b74e19bb7616 covid:ann/target/ed3516e0734c765b828ab7f1cef898ea5cfa56c7 covid:ann/target/72e5a6fe1d46f0c20bf85542eafd3aee9053da25 covid:ann/target/c25dc1bd9d3457db0314e44dbdd682ec1e764e6c covid:ann/target/4f231307f771e7a4986e96e636650c6af934e28b covid:ann/target/8c78cd87a0e2e1cdc9f2553a09fe1834b1218ffe covid:ann/target/bc2e228a3b58f1ae0d6f91243f9d85a63883e2a8 covid:ann/target/c83d5c559afcf165447ec04c0a65106625fc9cfc covid:ann/target/10a8887fc7da4583572f7910b3bf554074835f4a covid:ann/target/1375c8f578e7e32a745fe90eed536a6d0904cd86 covid:ann/target/2739d25615fc94f78661ff53c9562fc8b7f871e1 covid:ann/target/df8bcb6e7fd7e0735c1999d25f58bee937ae6b30 covid:ann/target/3e38d434082b4caf99bb030190570e9967bb5f3b covid:ann/target/a93bb16c5a46e1eb4c8daba1f96b3a2b10b03cef covid:ann/target/1158ac116389104262a07b85f6833e4ef04ae9c7 covid:ann/target/ca9876aa2c3d34c251d65cd286ff29f3a4365a53 covid:ann/target/04d94a9cef1a2b3b822ffdd9514bbb1d95a1edbd covid:ann/target/40c5e73fd6f0756914151c5baa0312558ef033ed covid:ann/target/5835f81082f567f3c8a5659976dcabddacf6aa3c covid:ann/target/5a851732346dea83180f6fe84ef283b452ce7006 covid:ann/target/ead55121c9ac6219146e1e1c07aa05fa3f2986b4 covid:ann/target/b1f9c644054c17f171eac02903878ec0cca2f203

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software