Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Launch HN: RunRL (YC X25) – Reinforcement learning as a service
(runrl.com)
71 points
ag8
9 months ago
22 comments
2.
▲
Training Qwen to answer briefly yet intelligently using feedback control
(runrl.com)
4 points
ag8
9 months ago
discuss
3.
▲
Why Run RL? How specialized models can outperform the biggest LLMs
(runrl.com)
4 points
-_-
a year ago
discuss
4.
▲
Scaling pretraining affects RL sample efficiency
(runrl.com)
1 point
ag8
7 months ago
discuss
5.
▲
Generating the Funniest Joke with RL
(runrl.com)
1 point
ag8
a year ago
discuss