Normal view MARC view ISBD view

Reinforcement learning : (Record no. 39257)

000 -LEADER
fixed length control field	02962nam a2200493 i 4500
001 - CONTROL NUMBER
control field	6267343
003 - CONTROL NUMBER IDENTIFIER
control field	IEEE
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20190220121646.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
fixed length control field	m o d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field	cr \|n\|\|\|\|\|\|\|\|\|
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	151223s1998 maua ob 001 eng d
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER
Canceled/invalid LC control number	97026416 (print)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9780262257053
Qualifying information	electronic
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
Canceled/invalid ISBN	0262193981
Qualifying information	alk. paper
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
Canceled/invalid ISBN	9780262193986
Qualifying information	print
035 ## - SYSTEM CONTROL NUMBER
System control number	(CaBNVSL)mat06267343
035 ## - SYSTEM CONTROL NUMBER
System control number	(IDAMS)0b000064818b431d
040 ## - CATALOGING SOURCE
Original cataloging agency	CaBNVSL
Language of cataloging	eng
Description conventions	rda
Transcribing agency	CaBNVSL
Modifying agency	CaBNVSL
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number	Q325.6
Item number	.S88 1998eb
082 00 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.3/1
Edition number	21
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name	Sutton, Richard S.,
Relator term	author.
245 10 - TITLE STATEMENT
Title	Reinforcement learning :
Remainder of title	an introduction /
Statement of responsibility, etc.	Richard S. Sutton and Andrew G. Barto.
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture	Cambridge, Massachusetts :
Name of producer, publisher, distributor, manufacturer	MIT Press,
Date of production, publication, distribution, manufacture, or copyright notice	c1998.
264 #2 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture	[Piscataqay, New Jersey] :
Name of producer, publisher, distributor, manufacturer	IEEE Xplore,
Date of production, publication, distribution, manufacture, or copyright notice	[1998]
300 ## - PHYSICAL DESCRIPTION
Extent	1 PDF (xviii, 322 pages) :
Other physical details	illustrations.
336 ## - CONTENT TYPE
Content type term	text
Source	rdacontent
337 ## - MEDIA TYPE
Media type term	electronic
Source	isbdmedia
338 ## - CARRIER TYPE
Carrier type term	online resource
Source	rdacarrier
490 1# - SERIES STATEMENT
Series statement	Adaptive computation and machine learning series
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note	Includes bibliographical references (p. [291]-312) and index.
506 1# - RESTRICTIONS ON ACCESS NOTE
Terms governing access	Restricted to subscribers or individual electronic text purchasers.
520 ## - SUMMARY, ETC.
Summary, etc.	Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE
Additional physical form available note	Also available in print.
538 ## - SYSTEM DETAILS NOTE
System details note	Mode of access: World Wide Web
588 ## - SOURCE OF DESCRIPTION NOTE
Source of description note	Description based on PDF viewed 12/23/2015.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element	Reinforcement learning.
655 #0 - INDEX TERM--GENRE/FORM
Genre/form data or focus term	Electronic books.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name	Barto, Andrew G.
710 2# - ADDED ENTRY--CORPORATE NAME
Corporate name or jurisdiction name as entry element	IEEE Xplore (Online Service),
Relator term	distributor.
710 2# - ADDED ENTRY--CORPORATE NAME
Corporate name or jurisdiction name as entry element	MIT Press,
Relator term	publisher.
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information	Print version
International Standard Book Number	9780262193986
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title	Adaptive computation and machine learning series
856 42 - ELECTRONIC LOCATION AND ACCESS
Materials specified	Abstract with links to resource
Uniform Resource Identifier	http://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343

No items available.

IIITB Library

Reinforcement learning : (Record no. 39257)