Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning
github.com/KhoomeiK
239 points
KhoomeiK
2 years ago
28 comments
Loading...
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning | Heykuki News