How Crucial is Transformer in Decision Transformer?

11/26/2022
by   Max Siebenborn, et al.
0

Decision Transformer (DT) is a recently proposed architecture for Reinforcement Learning that frames the decision-making process as an auto-regressive sequence modeling problem and uses a Transformer model to predict the next action in a sequence of states, actions, and rewards. In this paper, we analyze how crucial the Transformer model is in the complete DT architecture on continuous control tasks. Namely, we replace the Transformer by an LSTM model while keeping the other parts unchanged to obtain what we call a Decision LSTM model. We compare it to DT on continuous control tasks, including pendulum swing-up and stabilization, in simulation and on physical hardware. Our experiments show that DT struggles with continuous control problems, such as inverted pendulum and Furuta pendulum stabilization. On the other hand, the proposed Decision LSTM is able to achieve expert-level performance on these tasks, in addition to learning a swing-up controller on the real system. These results suggest that the strength of the Decision Transformer for continuous control tasks may lie in the overall sequential modeling architecture and not in the Transformer per se.

READ FULL TEXT
research
06/02/2021

Decision Transformer: Reinforcement Learning via Sequence Modeling

We present a framework that abstracts Reinforcement Learning (RL) as a s...
research
12/15/2022

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

Deep reinforcement learning has recently emerged as an appealing alterna...
research
06/15/2023

Recurrent Memory Decision Transformer

Transformative models, originally developed for natural language problem...
research
05/07/2021

Utilizing Skipped Frames in Action Repeats via Pseudo-Actions

In many deep reinforcement learning settings, when an agent takes an act...
research
05/20/2023

Autoregressive Modeling with Lookahead Attention

To predict the next token, autoregressive models ordinarily examine the ...
research
06/21/2023

Probing the limit of hydrologic predictability with the Transformer network

For a number of years since its introduction to hydrology, recurrent neu...
research
05/26/2023

Emergent Agentic Transformer from Chain of Hindsight Experience

Large transformer models powered by diverse data and model scale have do...

Please sign up or login with your details

Forgot password? Click here to reset