[R] Towards A Unified Agent with Foundation Models – Google DeepMind, ICLR23, July 2023 – LLM + RL leads to substantial performance improvements!

Jul 22, 2023

—

[ad_1]

Abstract:

Language Models and Vision Language Models have recently demonstrated unprecedented capabilities in terms of understanding human intentions, reasoning, scene understanding, and planning-like behaviour, in text form, among many others. In this work, we investigate how to embed and leverage such abilities in Reinforcement Learning (RL) agents. We design a framework that uses language as the core reasoning tool, exploring how this enables an agent to tackle a series of fundamental RL challenges, such as efficient exploration, reusing experience data, scheduling skills, and learning from observations, which traditionally require separate, vertically designed algorithms. We test our method on a sparse-reward simulated robotic manipulation environment, where a robot needs to stack a set of objects. We demonstrate substantial performance improvements over baselines in exploration efficiency and ability to reuse data from offline datasets, and illustrate how to reuse learned skills to solve novel tasks or imitate videos of human experts.

https://preview.redd.it/voehn3aa3ddb1.jpg?width=1101&format=pjpg&auto=webp&s=c367c7b1042d11b3e2a2b2109c95482f8555747b

https://preview.redd.it/6ei186aa3ddb1.jpg?width=617&format=pjpg&auto=webp&s=10e1928769da9552aabdcf084b45f5e6be2ec97e

https://preview.redd.it/umg3b7aa3ddb1.jpg?width=1353&format=pjpg&auto=webp&s=2be83b87e6b3553c6d1770a579f9a9aa69c238dd

https://preview.redd.it/ushea8aa3ddb1.jpg?width=1661&format=pjpg&auto=webp&s=67edddd76c0cdde67c0e9502fd76fbc1a9247946

submitted by /u/Singularian2501
[comments]

[ad_2]

Source link