Claudio Mayrink Verdun

about
papers
mentorship
library

Announcement_18

June 1, 2025

Two new papers about sampling from LLMs and AI alignment. Soft Best-of-n and Inference-Time Reward Hacking in LLMs.

© 2026 Claudio Mayrink Verdun.