000 02962nam a2200493 i 4500
001 6267343
003 IEEE
005 20190220121646.0
006 m o d
007 cr |n|||||||||
008 151223s1998 maua ob 001 eng d
010 _z 97026416 (print)
020 _a9780262257053
_qelectronic
020 _z0262193981
_qalk. paper
020 _z9780262193986
_qprint
035 _a(CaBNVSL)mat06267343
035 _a(IDAMS)0b000064818b431d
040 _aCaBNVSL
_beng
_erda
_cCaBNVSL
_dCaBNVSL
050 4 _aQ325.6
_b.S88 1998eb
082 0 0 _a006.3/1
_221
100 1 _aSutton, Richard S.,
_eauthor.
245 1 0 _aReinforcement learning :
_ban introduction /
_cRichard S. Sutton and Andrew G. Barto.
264 1 _aCambridge, Massachusetts :
_bMIT Press,
_cc1998.
264 2 _a[Piscataqay, New Jersey] :
_bIEEE Xplore,
_c[1998]
300 _a1 PDF (xviii, 322 pages) :
_billustrations.
336 _atext
_2rdacontent
337 _aelectronic
_2isbdmedia
338 _aonline resource
_2rdacarrier
490 1 _aAdaptive computation and machine learning series
504 _aIncludes bibliographical references (p. [291]-312) and index.
506 1 _aRestricted to subscribers or individual electronic text purchasers.
520 _aReinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.
530 _aAlso available in print.
538 _aMode of access: World Wide Web
588 _aDescription based on PDF viewed 12/23/2015.
650 0 _aReinforcement learning.
655 0 _aElectronic books.
700 1 _aBarto, Andrew G.
710 2 _aIEEE Xplore (Online Service),
_edistributor.
710 2 _aMIT Press,
_epublisher.
776 0 8 _iPrint version
_z9780262193986
830 0 _aAdaptive computation and machine learning series
856 4 2 _3Abstract with links to resource
_uhttp://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343
999 _c39257
_d39257