- Leo; Schulman; Hilton,
Jacob (2022). "Scaling Laws for
Reward Model Overoptimization". arXiv:2210.10760 [cs.LG]. "ChatGPT can now
access up to date information"...
- Finn, Chelsea; Niekum,
Scott (2024). "Scaling Laws for
Reward Model Overoptimization in
Direct Alignment Algorithms". arXiv:2406.02900 [cs.LG]. Shi, Zhengyan;...
- John; Hilton,
Jacob (October 19, 2022). "Scaling Laws for
Reward Model Overoptimization". arXiv:2210.10760 [cs.LG]. Anderson,
Martin (April 5, 2022). "The...
- John; Hilton,
Jacob (2022-10-19). "Scaling Laws for
Reward Model Overoptimization". ICML. arXiv:2210.10760. Yu, Sihyun; Ahn, Sungsoo; Song, Le; Shin...