TicTacToeArtist: CV·로보틱스 기반 틱택토 AI

1. 🔗 프로젝트 개요 & 링크

CNC 플로터(로봇) - 틱택토 판 인식(CV) - 듀얼 제로섬 게임 에이전트(RL)를 개발해 시스템화시켰다.

학습, 실험 코드 저장소

역할: 팀원 · 강화시스터즈(24-2)

듀얼 에이전트 제로섬 게임 알고리즘 이론 강의 · 로보틱스

듀얼 에이전트 기초부터 심화까지 개인 공부 레파지토리

AI와 인간이 현실 세계에서 상호작용하게 만들고자 했고, 그림을 그리는 로봇인 CNC Plotter을 제작했다.

C++을 이용해 CNC Plotter을 제어하는 라이브러리를 구현했다.

틱택토는 승리 패턴이 적기 때문에 두 번째 플레이어가 불리합니다. 알파제로를 적용한 틱택토 에이전트는 두 번째 플레이어일 때도 낮은 패배율을 보였으며 다른 고전 듀얼 제로섬 에이전트보다 높은 승률 을 보였습니다.

싱글 에이전트와 달리 듀얼 에이전트는 상대까지 고려해 행동해야하기 때문에 사용하는 이론도, 난이도도 상이했습니다. 제로섬 게임 플레이어 이론이 어떻게 발전되어 왔고, 강화학습에서 어떻게 구현되었는지 배우는 과정이 새로웠습니다.
전체 구조를 짤 때는 어렵지 않을 거라 생각하고 호기롭게 시작한 프로젝트였습니다. 하지만 로보틱스에 대한 지식이 아예 없어 아주 단순한 현상이라도 문제를 명확히 규정하는 것부터가 많은 시간이 소요됐습니다. 문제를 인지하는 것부터가 해결의 시발점이라는 걸 알게 되었습니다.

Built an end-to-end loop that blends CNC plotter robotics, TicTacToe board perception, and dual zero-sum reinforcement learning agents.

Source code for robotics, CV pipeline, and RL agents

Role: Team Member

Led dual-agent zero-sum algorithm sessions and engineered the robotics stack

Mentored club members through the progression from Minimax/MCTS to AlphaZero, distilling the theory into practical agent implementations.
Completed the robotics layer early so teammates could focus on CV and AI, integrating hardware, firmware, and calibration.
Eliminated Arduino instability by authoring a C/C++ motion library tailored to the CNC plotter’s mechanics.

Personal repository covering dual-agent theory and experiments

How we constructed a drawing robot so the AI could interact with humans in the physical world.

Implemented a C++ library to drive the plotter precisely for turn-by-turn play.

Because TicTacToe offers few winning lines, the second player is typically disadvantaged. The AlphaZero-inspired agent maintained a low loss rate even when playing second and outperformed classical zero-sum baselines.

Unlike single-agent RL, dual-agent systems must reason about an opponent. Tracing how zero-sum game theory evolved and how it translates into modern RL architectures was eye-opening.
With no prior robotics experience, even diagnosing simple mechanical issues took significant time. Recognizing and articulating the problem turned out to be the critical first step toward a fix.