Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Launch HN: RunRL (YC X25) – Reinforcement learning as a service (runrl.com)
71 points
ag8
9 months ago
22 comments
2.
Training Qwen to answer briefly yet intelligently using feedback control (runrl.com)
4 points
ag8
9 months ago
discuss
3.
Why Run RL? How specialized models can outperform the biggest LLMs (runrl.com)
4 points
-_-
a year ago
discuss
4.
Scaling pretraining affects RL sample efficiency (runrl.com)
1 point
ag8
7 months ago
discuss
5.
Generating the Funniest Joke with RL (runrl.com)
1 point
ag8
a year ago
discuss