Jack's blog

强化学习进阶教程

2025-08-11

本文深入探讨强化学习的核心算法，包括Q-Learning、DQN等...

强化学习入门指南 (Reinforcement Learning Primer) 📚 学习目标：掌握强化学习的核心概念、主要算法分类等，为深入学习强化学习奠定坚实基础。本文面向初学者，系统介绍强化学习的核心概念与算法。强化学习广泛应用于游戏 AI、机器人控制、推荐系统与自动驾驶等领域。 1234import numpy as npa = np.ones(1) 123#include<iostream>using namespace std; 📋 目录导航 1. 什么是强化学习 (Reinforcement Learning)？ 2. 强化学习的核心要素 3. 强化学习的分类 4. 探索 (Exploration) 与利用 (Exploitation) 5. 强化学习的关键方程：贝尔..

Reinforcement

2025-08-04

Reinforcement Learning Primer 📚 Learning Objectives: Master the core concepts of reinforcement learning, main algorithm categories, etc., to lay a solid foundation for in-depth study of reinforcement learning. This document aims to provide beginners with a systematic introduction to reinforcement learning. As an important branch of artificial intelligen..

claude

test

2025-08-03

Learning Notes: AI “Persona Vectors” Research Study Date: Today Paper Source: Anthropic Research Team Keywords: persona vectors, AI safety, neural networks, behavior control Today’s Key Takeaways 🤔 Questions That Sparked My Thinking I’ve been wondering: why does AI sometimes act friendly and other times say bizarre things? Think of Microsoft Bing’s “Syd..

Hello World

2025-08-03

Welcome to Hexo! This is your very first post. Check documentation for more info. If you get any problems when using Hexo, you can find the answer in troubleshooting or you can ask me on GitHub. Quick Start Create a new post 1hexo new "My New Post" More info: Writing Run server 1hexo server More info: Server Generate static files 1hexo generate More info: ..

Jack's blog

Archives · All

Home

About

Archives

Jujutsu Manual

强化学习进阶教程

RL-demo

Reinforcement

test

Hello World