Obstacle Tower Without Human Demonstrations: How Far A Deep Feed-Forward Network Goes With Reinforcement Learning. (arXiv:2004.00567v1 [cs.LG])

(Ithunyelwe ngomhla ka-1 Ephreli 2020)

Abstract: The Obstacle Tower Challenge is the task to master a procedurally generated
chain of levels that subsequently get harder to complete. Whereas the top 6
performing entries of last year’s competition all used human demonstrations to
learn how to cope with the challenge, we present an approach that performed
competitively (placed 7th) but starts completely from scratch by means of Deep
Reinforcement Learning with a relatively simple feed-forward deep network
structure. We especially look at the generalization performance of the taken
approach concerning different seeds and various visual themes that have become
available after the competition, and investigate where the agent fails and why.
Note that our approach does not possess a short-term memory like employing
recurrent hidden states. With this work, we hope to contribute to a better
understanding of what is possible with a relatively simple, flexible solution
that can be applied to learning in environments featuring complex 3D visual
input where the abstract task structure itself is still fairly simple.

Umlando wokuhambisa

From: Marco Pleines [buka i-imeyili]
[v1]
Wed, 1 Apr 2020 16:55:51 UTC (6,535 KB)

Umthombo: https://arxiv.org/abs/2004.00567

I-Generative Data Intelligence

I-Obstacle Tower Ngaphandle Kwemiboniso Yomuntu: Inethiwekhi Yokuphakela Phambili Ejulile Ihamba Kangakanani Ngokufunda Okuqiniswayo. (arXiv:2004.00567v1 [cs.LG])

Umlando wokuhambisa

Ukubuyekezwa kwe-BingX 2024: Izici, Ama-Cryptos Asekelwe Nezimali Ezikhokhwayo!

I-UK Iqinisa I-Crypto Grip: Umthetho Omusha Ku-Staking kanye ne-Stablecoins, Kanye Nokuhambisana Ne-Web3 kanye Nokushintshashintsha Kwemakethe

Latest Intelligence

I-UK Iqinisa Umthetho We-Crypto: Imithetho Emisha Ye-Staking kanye Ne-Stablecoins Imiselwe uJulayi, Ngenkathi Imboni Isungula Futhi Ijwayela

I-UK Gears Up For Crypto Evolution: Imithetho Emisha Ye-Staking kanye Ne-Stablecoin Imiselwe Ukukhishwa KwangoJulayi

I-UK Iqinisa Umthetho We-Crypto: Imithetho Emisha Ye-Staking kanye Ne-Stablecoins Imiselwe Ukukhishwa KwangoJulayi

I-UK Iqinisa I-Crypto Grip: Imithetho Emisha Ye-Staking kanye Ne-Stablecoin Izosebenza NgoJulayi Phakathi Nokuhambisana Kwe-Web3, I-Memecoin Surges, kanye ne-Global Crypto Dynamics

I-UK Ibeka Izifundo Zokulawula ze-Crypto: Umthetho Omusha We-Staking kanye ne-Stablecoins Uzoqala Ukusebenza ngoJulayi

I-UK Iqinisa I-Crypto Grip: Imithetho Emisha Ye-Staking kanye Ne-Stablecoins Isethelwe uJulayi njengokuhambisana ne-Web3 kanye nezimakethe ze-Memecoin Yenza Izihloko