No GPU, no LLM — agentic RL from scratch in pure Python. 不用 GPU、不用 LLM,纯 Python 手搓明白 Agentic RL 训练闭环(trajectory / reward / loss mask / GRPO)
No GPU, no LLM — agentic RL from scratch in pure Python. 不用 GPU、不用 LLM,纯 Python 手搓明白 Agentic RL 训练闭环(trajectory / reward / loss mask / GRPO)
Marketplace
Independent
Category
automation
More like this
Browse automation agents →