tonnonssi

이지민 ML 엔지니어

Jimin Lee ML Engineer

소개

About Me

저는 깊이 있는 문제 이해를 바탕으로 정확한 해결책을 설계하는 사람입니다. 단순히 주어진 과제를 수행하는 데 그치지 않고, 상황을 정의하고 핵심 원인을 규명하여 최적의 해법을 탐색하고자 합니다. 또한 리더십과 책임감을 바탕으로 동아리를 개설해, 금융·게임·로보틱스 등 다양한 프로젝트를 관리하고 성과를 이끌어냈습니다. 특히 여러 강화학습 중심의 프로젝트를 진행하며, 환경을 설계하고, 보상 구조를 정의하며, 에이전트가 실제로 학습하고 검증되는 전 과정을 직접 주도해왔습니다.

Enjoy the full lifecycle of code, from ideation to implementation, experimentation, and refinement. Thrives on designing new architectures and building complete systems, which aligns well with a focus on reinforcement learning (RL). Explores RL across diverse domains like gaming, robotics, and finance, while continuously expanding knowledge in deep learning and machine learning.

협업과 새로운 아이디어를 항상 환영해요.

Always open to fresh ideas and collaboration.

과거 진행한 여러 프로젝트와 저에 대해 관심이 있다면 편하게 메시지를 남겨주세요.
tonnonssi@gmail.com · GitHub · LinkedIn

Feel free to reach out if you're interested in RL, applied ML, or any of my past projects.
tonnonssi@gmail.com · GitHub · LinkedIn

최근에는 선물 트레이더 에이전트 고도화 · 졸업 논문을 준비 중이에요.

Currently upgrading my futures trader agent and developing an efficient bike-sharing repositioning system.

프로젝트 Projects

KRAFT: KOSPI 200 Reinforcement-learning Agent for Futures Trading thumbnail

Python
Pytorch
Pandas
NumPy
Custom Futures Market Simulator

PPO 알고리즘을 활용해 KOSPI 200 미니 선물 시장에서 스윙 트레이더 에이전트를 개발했습니다. 실제 거래를 반영하기 위해 슬리피지와 수수료를 포함한 강화학습 환경을 구축하고, Sharpe Ratio 0.8과 최대 낙폭 -12%를 기록했습니다. 복합 보상 설계와 KL Penalty를 통해 정책 안정화를 구현했으며, 현재는 xAI 적용 및 성능 향상을 위한 개선을 진행 중입니다.

Developed a swing trader agent for the KOSPI 200 mini futures market using the PPO algorithm. Built a reinforcement learning environment that incorporates slippage and commissions to reflect actual trading conditions, achieving a Sharpe Ratio of 0.8 and a maximum drawdown of -12%. Implemented complex reward design and KL Penalty for policy stabilization, and currently working on xAI application and performance improvement.

GitHub

AiGO: End-to-End AI for Gomoku with Vision and Robotics thumbnail

Python
C++
Pytorch
OpenCV
Arduino
Flask

4축 로봇팔(end-effector: suction)을 제어해 인간과 AI가 함께 오목(Gomoku)을 플레이하는 듀얼 에이전트 시스템을 개발했습니다. 오목 학습에는 AlphaZero 방법론을 적용하고, 로봇 제어에는 정·역기구학을 활용했습니다. 데이터 증강, 다양한 신경망 구조, 노이즈 기반 탐험을 통해 성능을 개선해 9×9 환경에서 평균 25스텝 이상의 플레이를 달성했습니다.

Developed a dual-agent system where humans and AI play Gomoku using a 4-axis robotic arm (end-effector: suction). Applied the AlphaZero methodology for Gomoku learning and utilized forward and inverse kinematics for robot control. Enhanced performance through data augmentation, various neural network architectures, and noise-based exploration, achieving an average of over 25 steps in a 9x9 environment.

GitHub Report

MineSolver: DQN Agent for Minesweeper thumbnail

Python
Pytorch
Javascript
matplotlib

DQN 방법론을 활용해 지뢰찾기 문제를 해결하는 에이전트를 개발했습니다. 초급 난이도에서 테스트 기준 평균 승률 84%를 달성한 모델을 학습시켰으며, 실제 활용을 위해 웹 배포까지 구현했습니다. 동아리 내에서 가장 높은 성능을 기록한 강화학습 에이전트로 개발 성공 사례를 만들었습니다.

Developed an agent to solve the Minesweeper problem using the DQN methodology. Trained a model that achieved an average win rate of 84% on beginner difficulty, and implemented web deployment for practical use. Successfully created a high-performance reinforcement learning agent, marking a development success story within the club.

GitHub

TicTacToeArtist: End-to-End AI for TicTacToe with Vision and Robotics thumbnail

Python
C++
Pytorch
OpenCV
Arduino

CNC 플로터 제어해 AI와 인간이 직접 상호작용하며 틱택토를 플레이할 수 있는 시스템을 구축했습니다. MCS, MCTS, Min-Max, AlphaBeta 등 다양한 방법론을 구현해 비교했고, 특히 AlphaZero 기반 강화학습 모델을 통해 최적의 플레이어 신경망을 학습했습니다. 실험 결과, AlphaZero 기반 AI가 다른 방법론 대비 가장 높은 승률을 기록했습니다.

Built a system where AI and humans can play TicTacToe interactively using a CNC plotter. Implemented and compared various methodologies including MCS, MCTS, Min-Max, and AlphaBeta, with a focus on training an optimal player neural network using an AlphaZero-based reinforcement learning model. Experimental results showed that the AlphaZero-based AI achieved the highest win rate compared to other methodologies.

GitHub Report

더 많은 프로젝트 보기 Show more projects

2023년 2023 리그오브레전드 전략 패턴 시각화 LoL-StrategyInsight: Visual Analytics of League of Legends Strategy Patterns Python · BeautifulSoup · Selenium · R · Tidyverse · Shiny · Visualization

2023년 2023 DigitFusion: 필기 숫자 분류기 DigitFusion: Handwritten Digit Classifier Python · Pytorch · PyQt5

모든 프로젝트의 자세한 내용은 상단의 'Project' 메뉴에서 확인하실 수 있습니다.

You can find detailed information about all projects in the 'Project' menu at the top.

스킬 Tech Stack

Core Python

Python 3.11+, Conda, uv
Typing, Dataclasses
Jupyter Notebook

ML / RL

PyTorch, TorchRL
Gymnasium, PettingZoo
Hydra, Pydantic

Data & Visualization

NumPy, Pandas
Matplotlib, Seaborn, Plotly
Scikit-learn, Statsmodels

보조 언어로, 통계 분석과 시각화에 사용합니다.

Used for statistical analysis and visualization.

Tidyverse
Shiny · RMarkdown

Web

프로토타입이나 내부 도구 배포를 위해 가벼운 웹 인터페이스를 제작합니다.

Build lightweight web frontends for prototypes or internal tools.

Flask · Javascript
Jekyll · GitHub Pages

C / C++

로보틱스 제어와 성능 최적화가 필요한 구간에 한정해 사용합니다.

Reserved for robotics control and performance-critical extensions.

Arduino SDK
Pybind11 integration

활동·리더십 Clubs & Leadership / Activities

Founder & Leader

강화시스터즈 KangHwaSisters

이화여자대학교 AI 동아리 · 2024년 3월 – 2026년 2월 Ewha Womans University AI Club · Mar 2024 – Feb 2026

동아리 창립 후 2년 간 회장으로 활동하며 기초부터 심화까지 아우르는 커리큘럼을 설계·운영했습니다.
4학기 간 15명의 팀원들을 이끌며, 매학기 주요 프로젝트(게임, 금융, 로보틱스)에서 팀 리더로 활동했습니다.
논문 스터디·코드 리뷰 문화 정착했고, 동아리 내 프로젝트 결과물을 정리해 블로그에 게시했습니다.
동아리 내 대회 및 세미나를 기획·운영했습니다.

Founded and led the club for 2 years, designing and running a curriculum covering basics to advanced topics.
Led 15 members over 4 semesters, serving as team lead on key projects (Game RL, Finance RL, Robotics RL).
Established a culture of paper study and code review, documenting project outcomes on our blog.
Organized internal competitions and seminars.

GitHub Blog

Mentor

스터디 멘토 Study Mentor

이화여자대학교 통계수학 · 2023년 3월 – 2023년 6월 Ewha Statistical Mathematics · Mar 2023 — Jun 2023

선형대수학 이론과 실습을 배우는 학부생 스터디 그룹을 만들고, 파이썬에 익숙하지 않은 학생들을 위해 기초 문법과 Numpy 활용법을 지도했습니다.

Created an undergrad study group on linear algebra theory and practice, coaching students unfamiliar with Python on basic syntax and NumPy usage.

Curriculum Notes

학력 Education

이화여자대학교 · 통계학 & 계산과학 복수전공 Ewha Womans University · Double Major in Statistics & Computational Science

대한민국 · GPA 3.97/4.3 South Korea · GPA 3.97/4.3

2022년 2월 입학, 2026년 2월 졸업 예정

Feb 2022 – Feb 2026(Expected)

최근 글 Recent Posts

[xAI] LIME [xAI] LIME (2025-11-01)

[DL/Timeseries] 트랜스포머 [DL/Timeseries] Transformer (2025-09-26)

[RL] Actor Critic [RL] Actor Critic (2025-05-06)

[MARL/01] Multi Agent Reinforcement Learning 기초 [MARL/01] Multi Agent Reinforcement Learning Basic (2025-05-05)

[DonkeyGym/01] mac m1 환경 설정 [DonkeyGym/01] mac m1 환경 설정 (2025-04-11)

소개