An intrinsic motivation based artificial goal generation in on-policy continuous control

Sağlam, Baturay; Mutlu, Furkan B.; Gönç, Kaan; Dalmaz, Onat; Kozat, Süleyman S.

An intrinsic motivation based artificial goal generation in on-policy continuous control

buir.contributor.author	Sağlam, Baturay
buir.contributor.author	Mutlu, Furkan B.
buir.contributor.author	Gönç, Kaan
buir.contributor.author	Dalmaz, Onat
buir.contributor.author	Kozat, Süleyman S.
buir.contributor.orcid	Sağlam, Baturay\|0000-0002-8324-5980
buir.contributor.orcid	Kozat, Süleyman S.\|0000-0002-6488-3848
dc.citation.epage	[4]	en_US
dc.citation.spage	[1]	en_US
dc.contributor.author	Sağlam, Baturay
dc.contributor.author	Mutlu, Furkan B.
dc.contributor.author	Gönç, Kaan
dc.contributor.author	Dalmaz, Onat
dc.contributor.author	Kozat, Süleyman S.
dc.coverage.spatial	Safranbolu, Turkey	en_US
dc.date.accessioned	2023-02-15T11:06:21Z
dc.date.available	2023-02-15T11:06:21Z
dc.date.issued	2022-08-29
dc.department	Department of Computer Engineering	en_US
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description	Conference Name: 2022 30th Signal Processing and Communications Applications Conference (SIU)	en_US
dc.description	Date of Conference: 15-18 May 2022	en_US
dc.description.abstract	This work adapts the existing theories on animal motivational systems into the reinforcement learning (RL) paradigm to constitute a directed exploration strategy in on-policy continuous control. We introduce a novel and scalable artificial bonus reward rule that encourages agents to visit useful state spaces. By unifying the intrinsic incentives in the reinforcement learning paradigm under the introduced deterministic reward rule, our method forces the value function to learn the values of unseen or less-known states and prevent premature behavior before sufficiently learning the environment. The simulation results show that the proposed algorithm considerably improves the state-of-the-art on-policy methods and improves the inherent entropy-based exploration.	en_US
dc.description.abstract	Bu çalışma, politikaya dayalı sürekli kontrolde yönlendirilmiş bir keşif stratejisi oluşturmak için hayvan motivasyon sistemleri hakkındaki mevcut teorileri pekiştirmeli ögrenme (RL) paradigmasına uyarlamaktadır. Ajanları faydalı durum alanlarını ziyaret etmeye teşvik eden yeni ve ölçeklenebilir bir yapay bonus ödül kuralı sunulmaktadır. Pekiştirmeli ögrenme paradigmasındaki içsel teşvikleri, tanıtılan deterministik ödül kuralı altında birleştirerek değer işlevini, görülmeyen veya daha az bilinen durum degerlerini öğrenmeye ve çevreyi yeterince öğrenmeden önce erken davranışı önlemeye zorlamaktadır. Simülasyon sonuçları, önerilen algoritmanın literatürdeki en iyi sonuçları veren politikaya dayalı yöntemleri önemli ölçüde geliştirdiğini ve içsel entropi tabanlı keşfi iyileştirdiğini göstermektedir.
dc.identifier.doi	10.1109/SIU55565.2022.9864957	en_US
dc.identifier.eisbn	978-1-6654-5092-8	en_US
dc.identifier.issn	2165-0608	en_US
dc.identifier.uri	http://hdl.handle.net/11693/111333	en_US
dc.language.iso	Turkish	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://www.doi.org/10.1109/SIU55565.2022.9864957	en_US
dc.source.title	Signal Processing and Communications Applications Conference (SIU)	en_US
dc.subject	Deep reinforcement learning	en_US
dc.subject	Exploration	en_US
dc.subject	Intrinsic motivation	en_US
dc.subject	Continuous control	en_US
dc.subject	On-policy learning	en_US
dc.subject	Derin pekiştirmeli öğrenme	en_US
dc.subject	Keşif	en_US
dc.subject	İçsel motivasyon	en_US
dc.subject	Sürekli kontrol	en_US
dc.subject	Politikaya dayalı öğrenme	en_US
dc.title	An intrinsic motivation based artificial goal generation in on-policy continuous control	en_US
dc.title.alternative	Politikaya dayalı sürekli kontrolde içsel motivasyona dayalı yapay hedef oluşturma	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: An_Intrinsic_Motivation_Based_Artificial_Goal_Generation_in_On-Policy_Continuous_Control.pdf
Size:: 3.3 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering
Scholarly Publications - Computer Engineering