Adaptive Multifactorial Evolutionary Optimization for Multitask Reinforcement Learning

Aritz D. Martinez*, Javier Del Ser, Eneko Osaba, Francisco Herrera

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

26 Citations (Scopus)

Abstract

Evolutionary computation has largely exhibited its potential to complement conventional learning algorithms in a variety of machine learning tasks, especially those related to unsupervised (clustering) and supervised learning. It has not been until lately when the computational efficiency of evolutionary solvers has been put in prospective for training reinforcement learning models. However, most studies framed so far within this context have considered environments and tasks conceived in isolation, without any exchange of knowledge among related tasks. In this manuscript we present A-MFEA-RL, an adaptive version of the well-known MFEA algorithm whose search and inheritance operators are tailored for multitask reinforcement learning environments. Specifically, our approach includes crossover and inheritance mechanisms for refining the exchange of genetic material, which rely on the multilayered structure of modern deep-learning-based reinforcement learning models. In order to assess the performance of the proposed approach, we design an extensive experimental setup comprising multiple reinforcement learning environments of varying levels of complexity, over which the performance of A-MFEA-RL is compared to that furnished by alternative nonevolutionary multitask reinforcement learning approaches. As concluded from the discussion of the obtained results, A-MFEA-RL not only achieves competitive success rates over the simultaneously addressed tasks, but also fosters the exchange of knowledge among tasks that could be intuitively expected to keep a degree of synergistic relationship.

Original languageEnglish
Pages (from-to)233-247
Number of pages15
JournalIEEE Transactions on Evolutionary Computation
Volume26
Issue number2
DOIs
Publication statusPublished - 1 Apr 2022

Funding

The work of Aritz D. Martinez and Eneko Osaba was supported by the Basque Government through the ELKARTEK Program (3KIA Project) under Grant KK-2020/00049. The work of Javier Del Ser was supported in part by the Basque Government through the ELKARTEK Program (3KIA Project) under Grant KK-2020/00049, and in part by the Consolidated Research Group MATHMODE (IT1294-19) granted by the Department of Education of the Basque Government. The work of Francisco Herrera was supported in part by the Spanish Government through (SMART-DaSCI) under Grant TIN2017-89517-P, and in part by the BBVA Foundation through Ayudas Fundacion BBVA a Equipos de Investigacion Científica 2018 call (DeepSCOP)

FundersFunder number
Department of Education of the Basque Government
SMART-DaSCITIN2017-89517-P
Spanish Government
Fundación BBVA
Eusko JaurlaritzaIT1294-19, KK-2020/00049

    Keywords

    • Evolutionary multitasking
    • multifactorial optimization (MFO)
    • multitask reinforcement learning
    • neuroevolution (NE)

    Fingerprint

    Dive into the research topics of 'Adaptive Multifactorial Evolutionary Optimization for Multitask Reinforcement Learning'. Together they form a unique fingerprint.

    Cite this