Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Thudm/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents
github.com/THUDM
1 point
freediver
3 years ago
No comment yet
Thudm/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents | Heykuki News