Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Show HN: CATArena – Evaluating LLM agents via dynamic enviroment interactions
github.com/AGI-Eval-Official
3 points
jinqueeny
5 months ago
No comment yet
Show HN: CATArena – Evaluating LLM agents via dynamic enviroment interactions | Heykuki News