Claudio Mayrink Verdun
  • about
  • publications

Announcement_18

June 1, 2025

2025

Two new papers about sampling from LLMs and AI alignment. Soft Best-of-n and Inference-Time Reward Hacking in LLMs.

© Copyright 2025 Claudio Mayrink Verdun. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.