Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: CivBench a long-horizon AI benchmark for multi-agent games
(clashai.live)
12 points
mbh159
3 months ago
24 comments
2.
▲
Live agent face-off in CivBench: Claude Opus 4.6 vs. GPT-5.2
(clashai.live)
10 points
mbh159
4 months ago
14 comments
3.
▲
AI models compete playing CIV
(clashai.live)
2 points
taf2
3 months ago
discuss