Bounded Policy Synthesis for POMDPs with Safe-Reachability and Quantitative Objectives

dc.contributor.advisorChaudhuri, Swarat
dc.contributor.committeeMemberKavraki, Lydia E.
dc.creatorWang, Yue
dc.date.accessioned2019-05-17T16:33:23Z
dc.date.available2019-12-01T06:01:12Z
dc.date.created2018-12
dc.date.issued2018-10-05
dc.date.submittedDecember 2018
dc.date.updated2019-05-17T16:33:23Z
dc.description.abstractRobots are being deployed for many real-world applications like autonomous driving, disaster rescue, and personal assistance. Effectively planning robust executions under uncertainty is critical for building these autonomous robots. Partially Observable Markov Decision Processes (POMDPs) provide a standard approach to model many robot applications under uncertainty. A key algorithmic problem for POMDPs is the synthesis of policies that specify the actions to take contingent on all possible events. Policy synthesis for POMDPs with two kinds of objectives is considered in this thesis: (1) boolean objectives for a correctness guarantee of accomplishing tasks and (2) quantitative objectives for optimal behaviors. For boolean objectives, this thesis focuses on a common safe-reachability objective: with a probability above a threshold, a goal state is eventually reached while keeping the probability of visiting unsafe states below a different threshold. Previous results have shown that policy synthesis for POMDPs over infinite horizon is generally undecidable. For decidability, this thesis focuses on POMDPs over a bounded horizon. Solving POMDPs requires reasoning over a vast space of beliefs (probability distributions). To address this, this thesis introduces the notion of a goal-constrained belief space that only contains beliefs reachable under desired executions that can achieve the safe-reachability objectives. Based on this notion, this thesis presents an offline approach that constructs policies over the goal-constrained belief space instead of the entire belief space. Simulation experiments show that this offline approach can scale to large belief spaces by focusing on the goal-constrained belief space. A full policy is generally costly to compute. To improve efficiency, this thesis presents an online approach that interleaves the computation of partial policies and execution. A partial policy is parameterized by a replanning probability and only contain a sampled subset of all possible events. This online approach allows users to specify an appropriate bound on the replanning probability to balance efficiency and correctness. Finally, this thesis presents an approximate policy synthesis approach that combines the safe-reachability objectives with the quantitative objectives. The results demonstrate that the constructed policies not only achieve the safe-reachability objective but also are of high quality concerning the quantitative objective.
dc.embargo.terms2019-12-01
dc.format.mimetypeapplication/pdf
dc.identifier.citationWang, Yue. "Bounded Policy Synthesis for POMDPs with Safe-Reachability and Quantitative Objectives." (2018) Diss., Rice University. <a href="https://hdl.handle.net/1911/105878">https://hdl.handle.net/1911/105878</a>.
dc.identifier.urihttps://hdl.handle.net/1911/105878
dc.language.isoeng
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.
dc.subjectRobotics
dc.subjectFormal Methods
dc.subjectPlanning under Uncertainty
dc.subjectPOMDPs
dc.subjectPolicy
dc.titleBounded Policy Synthesis for POMDPs with Safe-Reachability and Quantitative Objectives
dc.typeThesis
dc.type.materialText
thesis.degree.departmentComputer Science
thesis.degree.disciplineEngineering
thesis.degree.grantorRice University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
WANG-DOCUMENT-2018.pdf
Size:
115.71 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.6 KB
Format:
Plain Text
Description: