Reinforcement Learning on AI Tech Blog

Reinforcement Learning on AI Tech Blog https://jesamkim.github.io/ai-tech-blog/tags/reinforcement-learning/ Recent content in Reinforcement Learning on AI Tech Blog Hugo -- 0.147.6 ko Sun, 10 May 2026 09:00:00 +0900 RLVR과 Agentic RL: LLM 에이전트를 다시 점령한 강화학습 https://jesamkim.github.io/ai-tech-blog/posts/2026-05-10-rlvr-agentic-rl-papers-review/ Sun, 10 May 2026 09:00:00 +0900 https://jesamkim.github.io/ai-tech-blog/posts/2026-05-10-rlvr-agentic-rl-papers-review/ DeepSeek-R1이 촉발한 RL 부활의 흐름을 5편의 최신 논문으로 정리합니다. GRPO에서 DAPO로, 그리고 tool-use 에이전트 학습까지의 전개를 짚어봅니다.