Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents
github.com/THUDM
1 point
swyx
3 years ago
No comment yet
AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents | Heykuki News