CSYYYSC (Page 10)

Bellman Equation - Value Iteration

06 Jun 2025 4 min read Markov Decision Process

One commonly used algorithm for solving MDP problems, alongside the Bellman Equation and Policy Iteration, is Value Iteration. It is similar to Bellman Equation - Policy Iteration but differs in its internal process.

Bellman Equation - Policy Iteration

06 Jun 2025 4 min read Markov Decision Process

This morning, let's talk how to find the optimal policy of MDP(Markov Decision Process) problems. Yet, There is one important thing to note is that the problem can be divided

Transformer: Multi-Head Attention

05 Jun 2025 5 min read Deep Learning

Today, we’re going to dive deeper into the Transformer. However, before discussing its architecture, there's one important concept we need to cover: Multi-Head Attention. If you're not familiar

MDP(Markov Decision Process)

04 Jun 2025 3 min read Reinforcement Learning

Today, let's talk about the foundation stone of RL. You can refer to this to grasp basic understanding of RL. Introduction to MDP Markov Decision Process (MDP) is the fundamental of

Nagoya University Summer Entrance

04 Jun 2025 7 min read Japan

Overview A whole year has passed, and yet I hadn’t written anything. As time went on, I often thought about putting my exam prep experience into words, but those thoughts would always