automation·Independent✓ Verified

GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

About

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Tags

Pricing

Free

0
Visit website ↗

Marketplace

Independent

Category

automation

More like this

Browse automation agents →