Reinforcement learning : (Record no. 39257)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 02962nam a2200493 i 4500 |
001 - CONTROL NUMBER | |
control field | 6267343 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | IEEE |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20190220121646.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS | |
fixed length control field | m o d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION | |
fixed length control field | cr |n||||||||| |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 151223s1998 maua ob 001 eng d |
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER | |
Canceled/invalid LC control number | 97026416 (print) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9780262257053 |
Qualifying information | electronic |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
Canceled/invalid ISBN | 0262193981 |
Qualifying information | alk. paper |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
Canceled/invalid ISBN | 9780262193986 |
Qualifying information | |
035 ## - SYSTEM CONTROL NUMBER | |
System control number | (CaBNVSL)mat06267343 |
035 ## - SYSTEM CONTROL NUMBER | |
System control number | (IDAMS)0b000064818b431d |
040 ## - CATALOGING SOURCE | |
Original cataloging agency | CaBNVSL |
Language of cataloging | eng |
Description conventions | rda |
Transcribing agency | CaBNVSL |
Modifying agency | CaBNVSL |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER | |
Classification number | Q325.6 |
Item number | .S88 1998eb |
082 00 - DEWEY DECIMAL CLASSIFICATION NUMBER | |
Classification number | 006.3/1 |
Edition number | 21 |
100 1# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Sutton, Richard S., |
Relator term | author. |
245 10 - TITLE STATEMENT | |
Title | Reinforcement learning : |
Remainder of title | an introduction / |
Statement of responsibility, etc. | Richard S. Sutton and Andrew G. Barto. |
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE | |
Place of production, publication, distribution, manufacture | Cambridge, Massachusetts : |
Name of producer, publisher, distributor, manufacturer | MIT Press, |
Date of production, publication, distribution, manufacture, or copyright notice | c1998. |
264 #2 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE | |
Place of production, publication, distribution, manufacture | [Piscataqay, New Jersey] : |
Name of producer, publisher, distributor, manufacturer | IEEE Xplore, |
Date of production, publication, distribution, manufacture, or copyright notice | [1998] |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 1 PDF (xviii, 322 pages) : |
Other physical details | illustrations. |
336 ## - CONTENT TYPE | |
Content type term | text |
Source | rdacontent |
337 ## - MEDIA TYPE | |
Media type term | electronic |
Source | isbdmedia |
338 ## - CARRIER TYPE | |
Carrier type term | online resource |
Source | rdacarrier |
490 1# - SERIES STATEMENT | |
Series statement | Adaptive computation and machine learning series |
504 ## - BIBLIOGRAPHY, ETC. NOTE | |
Bibliography, etc. note | Includes bibliographical references (p. [291]-312) and index. |
506 1# - RESTRICTIONS ON ACCESS NOTE | |
Terms governing access | Restricted to subscribers or individual electronic text purchasers. |
520 ## - SUMMARY, ETC. | |
Summary, etc. | Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning. |
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE | |
Additional physical form available note | Also available in print. |
538 ## - SYSTEM DETAILS NOTE | |
System details note | Mode of access: World Wide Web |
588 ## - SOURCE OF DESCRIPTION NOTE | |
Source of description note | Description based on PDF viewed 12/23/2015. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Reinforcement learning. |
655 #0 - INDEX TERM--GENRE/FORM | |
Genre/form data or focus term | Electronic books. |
700 1# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Barto, Andrew G. |
710 2# - ADDED ENTRY--CORPORATE NAME | |
Corporate name or jurisdiction name as entry element | IEEE Xplore (Online Service), |
Relator term | distributor. |
710 2# - ADDED ENTRY--CORPORATE NAME | |
Corporate name or jurisdiction name as entry element | MIT Press, |
Relator term | publisher. |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY | |
Relationship information | Print version |
International Standard Book Number | 9780262193986 |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE | |
Uniform title | Adaptive computation and machine learning series |
856 42 - ELECTRONIC LOCATION AND ACCESS | |
Materials specified | Abstract with links to resource |
Uniform Resource Identifier | http://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343 |
No items available.