MineSolver: 지뢰찾기 DQN 에이전트

1. 🔗 프로젝트 개요 & 링크

고성능 지뢰찾기 에이전트를 개발하고, 누구나 활용할 수 있도록 웹 인터페이스를 구현했습니다.

지뢰찾기 에이전트를 직접 체험해보세요!

학습 및 실험 코드 저장소

Dev.

개발 과정과 성능 분석을 기록한 일지

기반부터 직접 구현하는 과정을 통해 많은 오류와 시행착오를 겪었고, 이를 극복하며 코딩 실력이 크게 향상되었습니다.
처음 진행한 강화학습 프로젝트로, 지뢰찾기라는 명확한 의사결정 구조를 구현하면서 사람과 AI 플레이어의 시야 차이를 state와 보상에 반영하는 경험을 했습니다.
stride, padding, 정규화 등 세부 요소가 성능에 큰 영향을 미친다는 점을 체감했고, Conv-only 구조에서 성능 향상을 확인하며 ML 개발자로서 섬세한 구현의 중요성을 배웠습니다.

High-performance Minesweeper agent developed with a web interface for easy access and testing.

Try the Minesweeper agent online

Code repository for training and experiments

Designed the Minesweeper environment and game logic as an MDP from scratch.
Implemented DQN without external libraries to fully understand the algorithm.
Improved performance from ~40% to an average of 84% by optimizing model architecture, state representation, and reward design.
Applied BFS for faster inference and minimized for-loops for efficiency.
Used Monte Carlo evaluation to provide confidence intervals and ensure reliable performance metrics.
Achieved the highest-performing model among 9 club members and guided peers by sharing methods and results.

Documented the problem-solving process, performance analysis, and post-project review.

Dev.

Development notes and performance analysis

Building the agent from scratch required overcoming numerous errors, which significantly improved my coding skills.
As my first reinforcement learning project, it reshaped how I think about problem-solving. Modeling Minesweeper required careful consideration of the visibility gap between human and AI players, reflected in state and reward design.
Experienced firsthand how details like normalization, stride, and padding can make or break performance, and observed a major boost when shifting from Conv+FC to Conv-only structures — reinforcing the importance of fine-grained implementation as an ML developer.