automation·IndependentNew✓ Verified

Agentic Rl From Scratch

No GPU, no LLM — agentic RL from scratch in pure Python. 不用 GPU、不用 LLM,纯 Python 手搓明白 Agentic RL 训练闭环(trajectory / reward / loss mask / GRPO)

About

No GPU, no LLM — agentic RL from scratch in pure Python. 不用 GPU、不用 LLM,纯 Python 手搓明白 Agentic RL 训练闭环(trajectory / reward / loss mask / GRPO)

Tags

Pricing

Free

0
Visit website ↗

Marketplace

Independent

Category

automation

More like this

Browse automation agents →