Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost
(app.uniclaw.ai)
2 points
skysniper
2 months ago
discuss
2.
▲
StepFun 3.5 Flash is #1 cost-effective model for OpenClaw tasks (300 battles)
(app.uniclaw.ai)
175 points
skysniper
2 months ago
84 comments
3.
▲
GLM-5.1 matches Opus 4.6 in agentic performance, at ~1/3 actual cost
(app.uniclaw.ai)
22 points
skysniper
2 months ago
2 comments
4.
▲
Opus 4.7 dominates agentic benchmark, 15% more expensive than Opus 4.6
(app.uniclaw.ai)
3 points
skysniper
2 months ago
1 comment