USAK method for the reinforcement learning

Novotarskyi, Mykhailo; Kuzmich, Valentin

USAK method for the reinforcement learning

Файли

216042-488193-1-10-20201111.pdf (799.66 KB)

Дата

2020

Автори

Novotarskyi, Mykhailo

Kuzmich, Valentin

Видавець

National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Анотація

In the field of reinforcement learning, tabular methods have become widespread. There are many important scientific results, which significantly improve their performance in specific applications. However, the application of tabular methods is limited due to the large amount of resources required to store value functions in tabular form under high-dimensional state spaces. A natural solution to the memory problem is to use parameterized function approximations. However, conventional approaches to function approximations, in most cases, have ceased to give the desired result of memory reduction in solving real world problems. This fact became the basis for the application of new approaches, one of which is the use of Sparse Distributed Memory (SDM) based on Kanerva coding. A further development of this direction was the method of Similarity-Aware Kanerva (SAK). In this paper, a modification of the SAK method is proposed, the Uniform Similarity-Aware Kanerva (USAK) method, which is based on the uniform distribution of prototypes in the state space. This approach has reduced the use of RAM required to store prototypes. In addition, reducing the receptive distance of each of the prototypes made it possible to increase the learning speed by reducing the number of calculations in the linear approximator.

Ключові слова

reinforcement learning, Kanerva coding, function approximation, prototype, value function

Бібліографічний опис

Novotarskyi, M. USAK method for the reinforcement learning / Novotarskyi Mykhailo, Valentin Kuzmich // Information, Computing and Intelligent systems. – 2020. – No. 1. – Pp. 4–14. – Bibliogr.: 19 ref.