An edition of Algorithms for reinforcement learning (2010)

Algorithms for reinforcement learning

by Csaba Szepesvári

0 Ratings
1 Want to read
0 Currently reading
0 Have read

Not in Library

My Reading Lists:

Use this Work

Create a new list

0 Ratings
1 Want to read
0 Currently reading
0 Have read

Check nearby libraries

Buy this book

Last edited by ImportBot

February 26, 2022 | History

Edit

An edition of Algorithms for reinforcement learning (2010)

Algorithms for reinforcement learning

by Csaba Szepesvári

0 Ratings
1 Want to read
0 Currently reading
0 Have read

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.

Publish Date

2010

Publisher

Morgan & Claypool

Language

English

Check nearby libraries

Buy this book

Previews available in: English

Showing 1 featured edition. View all 1 editions?

Edition	Availability
1 Algorithms for reinforcement learning 2010, Morgan & Claypool electronic resource / in English 1608454932 9781608454938	aaaa Not in Library Libraries near you: WorldCat

Add another edition?

Book Details

1. Markov decision processes

Preliminaries

Markov decision processes

Value functions

Dynamic programming algorithms for solving MDPs

2. Value prediction problems

Temporal difference learning in finite state spaces

Tabular TD(0)

Every-visit Monte-Carlo

TD([lambda]): unifying Monte-Carlo and TD(0)

Algorithms for large state spaces

TD([lambda]) with function approximation

Gradient temporal difference learning

Least-squares methods

The choice of the function space

3. Control

A catalog of learning problems

Closed-loop interactive learning

Online learning in bandits

Active learning in bandits

Active learning in Markov decision processes

Online learning in Markov decision processes

Direct methods

Q-learning in finite MDPs

Q-learning with function approximation

Actor-critic methods

Implementing a critic

Implementing an actor

4. For further exploration

Edition Notes

Part of: Synthesis digital library of engineering and computer science.

Title from PDF t.p. (viewed on July 13, 2010).

Series from website.

Includes bibliographical references (p. 73-88).

Abstract freely available; full-text restricted to subscribers or individual document purchasers.

Also available in print.

Mode of access: World Wide Web.

System requirements: Adobe Acrobat Reader.

Published in: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
Series: Synthesis lectures on artificial intelligence and machine learning -- # 9
Other Titles: Synthesis digital library of engineering and computer science.

Classifications

Dewey Decimal Class: 006.31
Library of Congress: Q325.6 .S942 2010

The Physical Object

Format: [electronic resource] /

ID Numbers

Open Library: OL25556666M
Internet Archive: algorithmsforrei00szep
ISBN 13: 9781608454938, 9781608454921

Source records

Internet Archive item record
Better World Books record
Better World Books record

Community Reviews (0)

Feedback?

No community reviews have been submitted for this work.

Lists

This work does not appear on any lists.

History

Created July 29, 2014
2 revisions

Download catalog record: RDF / JSON

February 26, 2022	Edited by ImportBot	import existing book
July 29, 2014	Created by ImportBot	import new book

Algorithms for reinforcement learning

by Csaba Szepesvári