Our paper “REBEL: Reinforcement Learning via Regressing Relative Rewards” is now on arXiv.