Algorithms for reinforcement learning

  • 1 Want to read

My Reading Lists:

Create a new list

Check-In

×Close
Add an optional check-in date. Check-in dates are used to track yearly reading goals.
Today

  • 1 Want to read


Download Options

Buy this book

Last edited by ImportBot
February 26, 2022 | History

Algorithms for reinforcement learning

  • 1 Want to read

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.

Publish Date
Publisher
Morgan & Claypool
Language
English

Buy this book

Previews available in: English

Edition Availability
Cover of: Algorithms for reinforcement learning
Algorithms for reinforcement learning
2010, Morgan & Claypool
electronic resource / in English

Add another edition?

Book Details


Table of Contents

1. Markov decision processes
Preliminaries
Markov decision processes
Value functions
Dynamic programming algorithms for solving MDPs
2. Value prediction problems
Temporal difference learning in finite state spaces
Tabular TD(0)
Every-visit Monte-Carlo
TD([lambda]): unifying Monte-Carlo and TD(0)
Algorithms for large state spaces
TD([lambda]) with function approximation
Gradient temporal difference learning
Least-squares methods
The choice of the function space
3. Control
A catalog of learning problems
Closed-loop interactive learning
Online learning in bandits
Active learning in bandits
Active learning in Markov decision processes
Online learning in Markov decision processes
Direct methods
Q-learning in finite MDPs
Q-learning with function approximation
Actor-critic methods
Implementing a critic
Implementing an actor
4. For further exploration
Further reading
Applications
Software
A. The theory of discounted Markovian decision processes
A.1. Contractions and Banach's fixed-point theorem
A.2. Application to MDPs
Bibliography
Author's biography.

Edition Notes

Part of: Synthesis digital library of engineering and computer science.

Title from PDF t.p. (viewed on July 13, 2010).

Series from website.

Includes bibliographical references (p. 73-88).

Abstract freely available; full-text restricted to subscribers or individual document purchasers.

Also available in print.

Mode of access: World Wide Web.

System requirements: Adobe Acrobat Reader.

Published in
San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
Series
Synthesis lectures on artificial intelligence and machine learning -- # 9
Other Titles
Synthesis digital library of engineering and computer science.

Classifications

Dewey Decimal Class
006.31
Library of Congress
Q325.6 .S942 2010

The Physical Object

Format
[electronic resource] /

ID Numbers

Open Library
OL25556666M
Internet Archive
algorithmsforrei00szep
ISBN 13
9781608454938, 9781608454921

Community Reviews (0)

No community reviews have been submitted for this work.

Lists

This work does not appear on any lists.

History

Download catalog record: RDF / JSON
February 26, 2022 Edited by ImportBot import existing book
July 29, 2014 Created by ImportBot import new book