Reinforcement Learning

MDP(Markov Decision Process)

Today, let's talk about the foundation stone of RL. You can refer to this to grasp basic understanding of RL. Introduction to MDP Markov Decision Process (MDP) is the fundamental of

Transformer: Self-Attention

Recently, one of my friends used LSTM with PPO to train a robot in a simulation aimed at solving a collection task. With a basic understanding of RNNs and LSTMs—an optimized form