Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
61.
▲
Show HN: Playground for comparing embedding models on Wikipedia+book retrieval
(embeds.ai)
5 points
davidtsong
3 years ago
11 comments
62.
▲
Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets
(codeclash.ai)
5 points
lieret
7 months ago
1 comment
63.
▲
Show HN: ClawSoc – Observe Your AI Agent in an AI Society
(clawsoc.io)
5 points
benjosaur
3 months ago
discuss
64.
▲
Show HN: PromptOptimizer – Minimize LLM token complexity to save cost
(github.com/vaibkumr)
4 points
vaibkumr
3 years ago
2 comments
65.
▲
Show HN: Fast360 – A web tool to benchmark open-source OCR models side-by-side
(fast360.xyz)
4 points
yanaimngvov
10 months ago
1 comment
66.
▲
Show HN: We unified LLMs, vector memory, ranking, pruning models in one process
4 points
levkk
3 years ago
1 comment
67.
▲
Show HN: Agent Runner – open-source agent harness to benchmark real coding
(designarena.ai)
4 points
grace77
6 months ago
discuss
68.
▲
Show HN: Competition to write the most efficient bot to be King Of The Grid
(kingofthegrid.com)
4 points
desertkun
a year ago
discuss
69.
▲
Show HN: Elf – A CLI Helper for Advent of Code
(github.com/cak)
3 points
cak
6 months ago
7 comments
70.
▲
Ask HN: What are some strategies to create positivity in the engineering cycle?
3 points
MattyRad
5 years ago
5 comments
71.
▲
Show HN: Flappy Gopher with Online Ranking – A Go/WebAssembly Browser Game
(flappy-ranking.pages.dev)
3 points
ponyo877
a year ago
4 comments
72.
▲
Show HN: I blind-tested 14 LLMs on a WP plugin task. Surprising Findings
(github.com/guilamu)
3 points
guilamu
a month ago
2 comments
73.
▲
Show HN: Using AI to judge a drinking game – SplitTheG.dev
(splittheg.dev)
3 points
BitNibbleByte
a year ago
2 comments
74.
▲
Show HN: Claude/OpenAI/Gemini agents compete as investors with $100K each
(github.com/upstash)
3 points
enesakar
2 months ago
1 comment
75.
▲
Show HN: We open-sourced our internal tool for scoring PRs with Claude AI
(github.com/MergeMint)
3 points
textcortex
6 months ago
1 comment
76.
▲
Show HN: Predict team ranks in sports and video games with openskill.py
(github.com/OpenDebates)
3 points
daegontaven
3 years ago
1 comment
77.
▲
Ask HN: Should i publish my fork?
3 points
LeanderK
9 years ago
1 comment
78.
▲
Show HN: FC-Eval – CLI to Benchmark Local or Cloud LLMs on Function Calling
(github.com/gauravvij)
3 points
gauravvij137
3 months ago
discuss
79.
▲
Show HN: Auto LLM Ranker – Describe a task in English and get ranked models
(github.com/gauravvij)
3 points
gauravvij137
3 months ago
discuss
80.
▲
Show HN: Open-source Go Challenges – Interactive practice for interviews
(github.com/RezaSi)
3 points
RezaSi
a year ago
discuss
81.
▲
Show HN: Android port of 3D Space Cadet Pinball
(github.com/fexed)
3 points
fexed
3 years ago
discuss
82.
▲
Show HN: AICoderList, a way to keep track of all those AI code editors
(aicoderlist.vercel.app)
2 points
treexs
a year ago
6 comments
83.
▲
Show HN: Open dataset of real-world LLM performance on Apple Silicon
(devpadapp.com)
2 points
uncSoft
3 months ago
4 comments
84.
▲
Show HN: I built a tool that summarizes web articles with AI
(getgistr.com)
2 points
spicy_ranch
a year ago
4 comments
85.
▲
Autonomous Bug Bounty Agent: Reached #86 on HackerOne, DoD Triage
2 points
Layer_8
4 months ago
2 comments
86.
▲
Show HN: LoKey Typer – A calm typing practice app with ambient soundscapes
(mcp-tool-shop-org.github.io)
2 points
mikeyfrilot
4 months ago
2 comments
87.
▲
Show HN: Click the Number Reaction Game
(projects.marcnitzsche.de)
2 points
mrccc
3 years ago
1 comment
88.
▲
Show HN: Single-Instruction (Subleq) Programming Game
(jaredkrinke.itch.io)
2 points
schemescape
4 years ago
1 comment
89.
▲
Show HN: Compare AutoML frameworks on 10 Tabular Kaggle competitions
2 points
pplonski86
5 years ago
1 comment
90.
▲
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost
(app.uniclaw.ai)
2 points
skysniper
2 months ago
discuss
More