Record ID | marc_columbia/Columbia-extract-20221130-030.mrc:219393010:2519 |
Source | marc_columbia |
Download Link | /show-records/marc_columbia/Columbia-extract-20221130-030.mrc:219393010:2519?format=raw |
LEADER: 02519cam a2200469 i 4500
001 14980872
005 20201012111641.0
008 190829t20192019maua b 001 0 eng
035 $a(OCoLC)on1114332024
040 $aAAA$beng$erda$cAAA$dAAA$dOCLCO$dYDX$dIXA$dOCLCF$dEAU$dOSU
019 $a1112277566
020 $a9781886529397$q(hardcover)
020 $a1886529396$q(hardcover)
035 $a(OCoLC)1114332024$z(OCoLC)1112277566
042 $apcc
050 4 $aQA402.5$b.B465 2019
050 4 $aQ325.6$b.B47 2019
049 $aZCUA
100 1 $aBertsekas, Dimitri P.,$eauthor.
245 10 $aReinforcement learning and optimal control /$cby Dimitri P. Bertsekas.
264 1 $aBelmont, Massachusetts :$bAthena Scientific,$c[2019]
264 4 $c©2019
300 $axiii, 373 pages :$billustrations ;$c25 cm.
336 $atext$btxt$2rdacontent
337 $aunmediated$bn$2rdamedia
338 $avolume$bnc$2rdacarrier
490 1 $aAthena Scientific optimization and computation series
504 $aIncludes bibliographical references (pages 345-367) and index.
505 0 $a1. Exact Dynamic Programming -- 2. Approximation in Value Space -- 3. Parametric Approximation -- 4. Infinite Horizon Dynamic Programming -- 5. Infinite Horizon Reinforcement Learning -- 6. Aggregation.
520 $aThis book explores the common boundary between optimal control and artificial intelligence, as it relates to reinforcement learning and simulation-based neural network methods. These are popular fields with many applications, which can provide approximate solutions to challenging sequential decision problems and large-scale dynamic programming (DP). The aim of the book is to organize coherently the broad mosaic of methods in these fields, which have a solid analytical and logical foundation, and have also proved successful in practice--back cover.
650 0 $aReinforcement learning.
650 0 $aDynamic programming.
650 0 $aNeural networks (Computer science)
650 0 $aMathematical optimization.
650 0 $aArtificial intelligence.
650 7 $aArtificial intelligence.$2fast$0(OCoLC)fst00817247
650 7 $aDynamic programming.$2fast$0(OCoLC)fst00900291
650 7 $aMathematical optimization.$2fast$0(OCoLC)fst01012099
650 7 $aNeural networks (Computer science)$2fast$0(OCoLC)fst01036260
650 7 $aReinforcement learning.$2fast$0(OCoLC)fst01732553
830 0 $aAthena Scientific optimization and computation series.
852 0 $bsci$hQ325.6$i.B47 2019g