Reinforcement Learning - CSYYYSC (Page 4)

Bellman Equation - Policy Iteration

06 Jun 2025 4 min read Markov Decision Process

This morning, let's talk how to find the optimal policy of MDP(Markov Decision Process) problems. Yet, There is one important thing to note is that the problem can be divided

MDP(Markov Decision Process)

04 Jun 2025 3 min read Reinforcement Learning

Today, let's talk about the foundation stone of RL. You can refer to this to grasp basic understanding of RL. Introduction to MDP Markov Decision Process (MDP) is the fundamental of

Transformer: Self-Attention

04 Jun 2025 4 min read Transformer

Recently, one of my friends used LSTM with PPO to train a robot in a simulation aimed at solving a collection task. With a basic understanding of RNNs and LSTMs—an optimized form

Reinforcement Learning Introduction

03 Jun 2025 2 min read Reinforcement Learning

The first time I heard about Reinforcement Learning (RL) was at an orientation meeting a few years ago. Back then, I barely understood it, especially while listening to others ask questions. Only recently