Vanishing Gradients in Reinforcement Finetuning of Language Models Paper • 2310.20703 • Published Oct 31, 2023 • 1
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization Paper • 2410.08847 • Published Oct 11, 2024