Posted 2025-04-19Updated 2025-04-196 minutes read (About 941 words)

Of Q‑Tables and Three‑in‑a‑Rows: Training an RL Knight in Tic‑Tac‑Toe

Reinforcement Learning is fairly popular at the moment. In this chronicle, we shall embark on a quest to forge a Reinforcement Learning model for the noble game of Tic‑Tac‑Toe. We’ll write our own environment, summon a DQN sorcerer from Stable Baselines 3, and ultimately witness our AI crush the humblest of human challengers (or at least draw more than half the time).

Style Transfer: in a Way of Semi-Ancient English-Chinese Translation

You may have heard of style transfer, a technique that can transform the feature of one thing to another. People might firstly think of it as related to machine learning, but sometime the classic way is still a fun to play with.

Ars longa, vita brevis.

Some random guy writes about nothing.

Of Q‑Tables and Three‑in‑a‑Rows: Training an RL Knight in Tic‑Tac‑Toe

Style Transfer: in a Way of Semi-Ancient English-Chinese Translation

Ars longa, vita brevis.

Links

Recents

Archives

follow.it