April 25, 2024
2024
Our paper “REBEL: Reinforcement Learning via Regressing Relative Rewards” is now on arXiv.