Show simple item record

dc.contributor.advisorPacheco, Jason
dc.contributor.authorMurphy, Ryan
dc.creatorMurphy, Ryan
dc.date.accessioned2023-06-11T16:49:55Z
dc.date.available2023-06-11T16:49:55Z
dc.date.issued2023
dc.identifier.citationMurphy, Ryan. (2023). Policy Improvement via Planning in Maximum Entropy Reinforcement Learning (Master's thesis, University of Arizona, Tucson, USA).
dc.identifier.urihttp://hdl.handle.net/10150/668322
dc.description.abstractThis thesis examines maximum entropy reinforcement learning, an alternative formulation of the traditional reinforcement learning paradigm. Maximum entropy policies prioritize actions leading to states where an agent has more choice between high-value future trajectories. Here we propose two novel algorithms rooted in this framework that use planning-based approaches to improve policy learning. The first model-based algorithm achieves improved value function estimates via Monte Carlo Tree Search. The second algorithm uses a model-free heuristic-based approach to improve deep exploration in challenging environments. Finally, we present a preliminary analysis comparing optimal maximum entropy policies to optimal policies under the traditional reinforcement learning objective.
dc.language.isoen
dc.publisherThe University of Arizona.
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction, presentation (such as public display or performance) of protected items is prohibited except with permission of the author.
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectMachine Learning
dc.subjectReinforcement Learning
dc.titlePolicy Improvement via Planning in Maximum Entropy Reinforcement Learning
dc.typetext
dc.typeElectronic Thesis
thesis.degree.grantorUniversity of Arizona
thesis.degree.levelmasters
dc.contributor.committeememberJun, Kwang-Sung
dc.contributor.committeememberMorrison, Clayton
thesis.degree.disciplineGraduate College
thesis.degree.disciplineComputer Science
thesis.degree.nameM.S.
refterms.dateFOA2023-06-11T16:49:56Z


Files in this item

Thumbnail
Name:
azu_etd_20520_sip1_m.pdf
Size:
3.487Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record