MARC View

000			02962nam a2200493 i 4500
001			6267343
003			IEEE
005			20190220121646.0
006			m o d
007			cr \|n\|\|\|\|\|\|\|\|\|
008			151223s1998 maua ob 001 eng d
010			_z 97026416 (print)
020			_a9780262257053 _qelectronic
020			_z0262193981 _qalk. paper
020			_z9780262193986 _qprint
035			_a(CaBNVSL)mat06267343
035			_a(IDAMS)0b000064818b431d
040			_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL
050		4	_aQ325.6 _b.S88 1998eb
082	0	0	_a006.3/1 _221
100	1		_aSutton, Richard S., _eauthor.
245	1	0	_aReinforcement learning : _ban introduction / _cRichard S. Sutton and Andrew G. Barto.
264		1	_aCambridge, Massachusetts : _bMIT Press, _cc1998.
264		2	_a[Piscataqay, New Jersey] : _bIEEE Xplore, _c[1998]
300			_a1 PDF (xviii, 322 pages) : _billustrations.
336			_atext _2rdacontent
337			_aelectronic _2isbdmedia
338			_aonline resource _2rdacarrier
490	1		_aAdaptive computation and machine learning series
504			_aIncludes bibliographical references (p. [291]-312) and index.
506	1		_aRestricted to subscribers or individual electronic text purchasers.
520			_aReinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.
530			_aAlso available in print.
538			_aMode of access: World Wide Web
588			_aDescription based on PDF viewed 12/23/2015.
650		0	_aReinforcement learning.
655		0	_aElectronic books.
700	1		_aBarto, Andrew G.
710	2		_aIEEE Xplore (Online Service), _edistributor.
710	2		_aMIT Press, _epublisher.
776	0	8	_iPrint version _z9780262193986
830		0	_aAdaptive computation and machine learning series
856	4	2	_3Abstract with links to resource _uhttp://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343
999			_c39257 _d39257