My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf] | Heykuki News