Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf] (github.com/CodeReclaimers)
1 point
CodeReclaimers
10 days ago
2 comments