Reinforcement Learning

Asynchronous Advantage Actor Critic (A3C)

02 Sep 2025 3 min read Reinforcement Learning

We have discussed the details of the Vanilla Actor-Critic (VAC) and implemented an application for stock prediction. Today, I would like to explore another variation of the Actor-Critic family called Asynchronous Advantage Actor-Critic

Vanilla Actor-Critic (VAC)

31 Aug 2025 4 min read Reinforcement Learning

We have previously discussed the Actor Critic and Soft Actor Critic (SAC) frameworks. Today, I would like to revisit its most fundamental form in order to build a deeper understanding and connect it

Advantage Actor Critic (A2C)

22 Jun 2025 5 min read Reinforcement Learning

It seems we've overlooked an important method in policy-based approaches: the Actor-Critic algorithm. In fact, we've already discussed a more advanced variant built upon it— Soft Actor Critic (SAC)

PPO-NES

20 Jun 2025 4 min read Reinforcement Learning

Today, we will explore Natural Evolution Strategies (NES) and investigate how they can be combined with Proximal Policy Optimization (PPO) to potentially enhance its performance. To evaluate the effectiveness of this approach, we

Soft Actor Critic (SAC)

16 Jun 2025 8 min read Reinforcement Learning

In the previous post, we discussed Proximal Policy Optimization (PPO) and its strengths, which have made it a popular choice in recent years. However, as an on-policy method, PPO suffers from a key