Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Reinforcement learning for control : Performance, stability, and deep approximators (CROSBI ID 255667)

Prilog u časopisu | pregledni rad (znanstveni) | međunarodna recenzija

Buşoniu, Lucian ; de Bruin, Tim ; Tolić, Domagoj ; Kober, Jens ; Palunko, Ivana Reinforcement learning for control : Performance, stability, and deep approximators // Annual reviews in control, 46 (2018), 8-28. doi: 10.1016/j.arcontrol.2018.09.005

Podaci o odgovornosti

Buşoniu, Lucian ; de Bruin, Tim ; Tolić, Domagoj ; Kober, Jens ; Palunko, Ivana

engleski

Reinforcement learning for control : Performance, stability, and deep approximators

Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain. This review mainly cov- ers artificial-intelligence approaches to RL, from the viewpoint of the control engineer. We explain how approximate representations of the solution make RL feasible for problems with continuous states and control actions. Stability is a central concern in control, and we argue that while the control-theoretic RL subfield called adaptive dynamic programming is dedicated to it, stability of RL largely remains an open question. We also cover in detail the case where deep neural networks are used for approxima- tion, leading to the field of deep RL, which has shown great success in recent years. With the control practitioner in mind, we outline opportunities and pitfalls of deep RL ; and we close the survey with an outlook that –among other things –points out some avenues for bridging the gap between control and artificial-intelligence RL techniques.

Reinforcement learning ; Optimal control ; Deep learning ; Stability ; Function approximation ; Adaptive dynamic programming

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

46

2018.

8-28

objavljeno

1367-5788

10.1016/j.arcontrol.2018.09.005

Povezanost rada

Elektrotehnika, Računarstvo

Poveznice
Indeksiranost