An echo state model of non-Markovian reinforcement learning

Bush, Keith A., author; Anderson, Charles W., advisor; Draper, Bruce A. (Bruce Austin), 1962-, committee member; Kirby, Michael, 1961-, committee member; Young, Peter M., committee member

An echo state model of non-Markovian reinforcement learning

dc.contributor.author	Bush, Keith A., author
dc.contributor.author	Anderson, Charles W., advisor
dc.contributor.author	Draper, Bruce A. (Bruce Austin), 1962-, committee member
dc.contributor.author	Kirby, Michael, 1961-, committee member
dc.contributor.author	Young, Peter M., committee member
dc.date.accessioned	2007-01-03T04:17:52Z
dc.date.available	2007-01-03T04:17:52Z
dc.date.issued	2008
dc.description	Department Head: Dale H. Grit.
dc.description.abstract	There exists a growing need for intelligent, autonomous control strategies that operate in real-world domains. Theoretically the state-action space must exhibit the Markov property in order for reinforcement learning to be applicable. Empirical evidence, however, suggests that reinforcement learning also applies to domains where the state-action space is approximately Markovian, a requirement for the overwhelming majority of real-world domains. These domains, termed non-Markovian reinforcement learning domains, raise a unique set of practical challenges. The reconstruction dimension required to approximate a Markovian state-space is unknown a priori and can potentially be large. Further, spatial complexity of local function approximation of the reinforcement learning domain grows exponentially with the reconstruction dimension. Parameterized dynamic systems alleviate both embedding length and state-space dimensionality concerns by reconstructing an approximate Markovian state-space via a compact, recurrent representation. Yet this representation extracts a cost; modeling reinforcement learning domains via adaptive, parameterized dynamic systems is characterized by instability, slow-convergence, and high computational or spatial training complexity. The objectives of this research are to demonstrate a stable, convergent, accurate, and scalable model of non-Markovian reinforcement learning domains. These objectives are fulfilled via fixed point analysis of the dynamics underlying the reinforcement learning domain and the Echo State Network, a class of parameterized dynamic system. Understanding models of non-Markovian reinforcement learning domains requires understanding the interactions between learning domains and their models. Fixed point analysis of the Mountain Car Problem reinforcement learning domain, for both local and nonlocal function approximations, suggests a close relationship between the locality of the approximation and the number and severity of bifurcations of the fixed point structure. This research suggests the likely cause of this relationship: reinforcement learning domains exist within a dynamic feature space in which trajectories are analogous to states. The fixed point structure maps dynamic space onto state-space. This explanation suggests two testable hypotheses. Reinforcement learning is sensitive to state-space locality because states cluster as trajectories in time rather than space. Second, models using trajectory-based features should exhibit good modeling performance and few changes in fixed point structure. Analysis of performance of lookup table, feedforward neural network, and Echo State Network (ESN) on the Mountain Car Problem reinforcement learning domain confirm these hypotheses. The ESN is a large, sparse, randomly-generated, unadapted recurrent neural network, which adapts a linear projection of the target domain onto the hidden layer. ESN modeling results on reinforcement learning domains show it achieves performance comparable to lookup table and neural network architectures on the Mountain Car Problem with minimal changes to fixed point structure. Also, the ESN achieves lookup table caliber performance when modeling Acrobot, a four-dimensional control problem, but is less successful modeling the lower dimensional Modified Mountain Car Problem. These performance discrepancies are attributed to the ESN’s excellent ability to represent complex short term dynamics, and its inability to consolidate long temporal dependencies into a static memory. Without memory consolidation, reinforcement learning domains exhibiting attractors with multiple dynamic scales are unlikely to be well-modeled via ESN. To mediate this problem, a simple ESN memory consolidation method is presented and tested for stationary dynamic systems. These results indicate the potential to improve modeling performance in reinforcement learning domains via memory consolidation.
dc.format.medium	doctoral dissertations
dc.identifier	2008_spring_Bush_COMS.pdf
dc.identifier	ETDF2008100001COMS
dc.identifier.uri	http://hdl.handle.net/10217/28682
dc.language	English
dc.language.iso	eng
dc.publisher	Colorado State University. Libraries
dc.relation	Catalog record number (MMS ID): 991009367029703361
dc.relation	Q325.6.B87 2008
dc.relation.ispartof	2000-2019
dc.rights	Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subject	reinforcement learning (machine learning)
dc.subject	mountain car problem
dc.subject	reinforcement learning
dc.subject	Markovian
dc.subject	echo state network
dc.subject	ESN
dc.subject	fixed point analysis
dc.subject.lcsh	Hybrid systems
dc.title	An echo state model of non-Markovian reinforcement learning
dc.type	Text
dcterms.rights.dpla	This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Colorado State University
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy (Ph.D.)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2008_spring_Bush_COMS.pdf
Size:: 5.99 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

2000-2019
Theses and Dissertations