Q-learning by the nth step state and multi-agent negotiation in unknown environment

Job, Josip; Jović, Franjo; Livada, Časlav

izvor podataka: crosbi ✓

Q-learning by the nth step state and multi-agent negotiation in unknown environment (CROSBI ID 186495)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Job, Josip ; Jović, Franjo ; Livada, Časlav Q-learning by the nth step state and multi-agent negotiation in unknown environment // Tehnički vjesnik : znanstveno-stručni časopis tehničkih fakulteta Sveučilišta u Osijeku, 19 (2012), 3; 529-534

Podaci o odgovornosti

Autori

Job, Josip ; Jović, Franjo ; Livada, Časlav

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Q-learning by the nth step state and multi-agent negotiation in unknown environment

Sažetak

This work will show a new procedure of Q-learning in which the agent’s decision, regarding the next step, is not based on the optimal action at that moment but on the usefulness of a future state. A near agent communication has been implemented so that the agents signal each other their future actions which contribute to a better choice of actions for each of the agents. The new method is named Q-learning by the nth step and multi-agent negotiation. The results of the testing of this algorithm are compared with the basic QL algorithm which is also graphically demonstrated and the advantages of the new algorithm are listed too. An average of 40 % collision decrease is obtained during learning procedure.

Ključne riječi

agent; learning from reward and punishment; q-learning; reinforcement learning

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o izdanju

Časopis

Tehnički vjesnik : znanstveno-stručni časopis tehničkih fakulteta Sveučilišta u Osijeku

Volumen (broj)

19 (3)

Godina

2012.

Stranice rada

529-534

Status objave rada

objavljeno

ISSN

1330-3651

Povezanost rada

Povezane osobe

Franjo Jović (autor/i)

Časlav Livada (autor/i)

Josip Job (autor/i)

Povezane ustanove

Fakultet elektrotehnike, računarstva i informacijskih tehnologija Osijek (165) (autorova ustanova)

Područje

Računarstvo

Poveznice

hrcak.srce.hr

Indeksiranost

Scopus

Web of Science Core Collection, Science Citation Index Expanded (WoSCC-SCI-Exp)

Web of Science Core Collection, SCI-Exp, SSCI & A&HCI (WoSCC-SCI-Exp, SSCI, A&HCI)