Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost (app.uniclaw.ai)
2 points
skysniper
2 months ago
discuss
2.
StepFun 3.5 Flash is #1 cost-effective model for OpenClaw tasks (300 battles) (app.uniclaw.ai)
175 points
skysniper
2 months ago
84 comments
3.
GLM-5.1 matches Opus 4.6 in agentic performance, at ~1/3 actual cost (app.uniclaw.ai)
22 points
skysniper
2 months ago
2 comments
4.
Opus 4.7 dominates agentic benchmark, 15% more expensive than Opus 4.6 (app.uniclaw.ai)
3 points
skysniper
2 months ago
1 comment