Claudio Mayrink Verdun
  • about
  • papers
  • mentorship
  • library

Announcement_18

June 1, 2025

2025

Two new papers about sampling from LLMs and AI alignment. Soft Best-of-n and Inference-Time Reward Hacking in LLMs.

© 2026 Claudio Mayrink Verdun.