Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
271.
▲
Show HN: I built an MCP server to recruit employees for free
(github.com/Himalayas-App)
2 points
walshhub
3 months ago
discuss
272.
▲
Show HN: Lar-JEPA – A Testbed for Orchestrating Predictive World Models
(github.com/snath-ai)
2 points
axdithya
3 months ago
discuss
273.
▲
Show HN: BreakMyAgent – Open-source red-teaming sandbox for LLM system prompts
2 points
breakmyagent
3 months ago
discuss
274.
▲
Show HN: Agentic Gatekeeper – AI pre-commit hook to auto-patch logic errors
(github.com/revanthpobala)
2 points
revanth1108
4 months ago
discuss
275.
▲
Show HN: DACP – governance gateway for AI coding agents
(github.com/elliot35)
2 points
elliot35
4 months ago
discuss
276.
▲
Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified
(arxiv.org)
2 points
NBenkovich
4 months ago
discuss
277.
▲
Show HN: sc-membench for modern memory bandwidth and latency benchmarks
(github.com/spareCores)
2 points
daroczig
5 months ago
discuss
278.
▲
Ask HN: Critical review of a spec-first economic protocol
2 points
AGsist
5 months ago
discuss
279.
▲
Show HN: Epistemic Summary Line for ChatGPT
(github.com/il-b)
2 points
il-b
5 months ago
discuss
280.
▲
Show HN: Episteme – Aggregating and critiquing retail investor theses with NLP
(episteme.cloud)
2 points
amstrdm
5 months ago
discuss
281.
▲
Show HN: ZK-auctions – experimenting with zero-knowledge sealed-bid auctions
(github.com/ndrwnaguib)
2 points
ndrwnaguib
5 months ago
discuss
282.
▲
Show HN: Runtime Kubernetes Compliance Engine (Policy as Data, No SCAP XML)
(github.com/scanset)
2 points
scanset
5 months ago
discuss
283.
▲
Show HN: KARMA – An evaluation framework for Medical AI systems
(karma.eka.care)
2 points
k2so
10 months ago
discuss
284.
▲
Show HN: Dingo 1.9.0 released: With enhanced hallucination detection
(github.com/MigoXLab)
2 points
e06084
10 months ago
discuss
285.
▲
Show HN: New SWE-bench leaderboard compares LMs without fancy agent scaffolds
(swebench.com)
2 points
lieret
10 months ago
discuss
286.
▲
Show HN: Kritikos – Ready to use Go back end for LLM-as-a-critique
(github.com/michelelacorte-quinck)
2 points
MicheleLacorte
a year ago
discuss
287.
▲
Show HN: I made an open-source synthetic text datasets generator
(github.com/patrickfleith)
2 points
astropat
a year ago
discuss
288.
▲
Show HN: I made an open-source synthetic text datasets generator
(github.com/patrickfleith)
2 points
astropat
a year ago
discuss
289.
▲
Show HN: Nebula – A DSL for scripting TestContainers-based demos
(github.com/orbitalapi)
2 points
martypitt
a year ago
discuss
290.
▲
Show HN: Botwell – A Framework for LLM Comparative Analysis Using AI Peer Review
(github.com/alanwilhelm)
2 points
shakezooola
a year ago
discuss
291.
▲
Show HN: OptiLLMBench – Test how inference optimization tricks scale up LLMs
2 points
codelion
a year ago
discuss
292.
▲
Show HN: Mandoline – Custom LLM Evaluations for Real-World Use Cases
(mandoline.ai)
2 points
kmckiern
2 years ago
discuss
293.
▲
Show HN: KubeFox – Open-Source At-Runtime Versioning and Virtual Environments
(github.com/xigxog)
2 points
smh812xyz
2 years ago
discuss
294.
▲
Show HN: [OSS] Taking a Systematic Approach to Improving LLM Accuracy
(github.com/palico-ai)
2 points
shikdernyc
2 years ago
discuss
295.
▲
Show HN: Claude 3.5 Sonnet beats GPT-4o at Competitive Programming
(github.com/juvi21)
2 points
juv121
2 years ago
discuss
296.
▲
Show HN: A GitHub Action for helping RAG apps with CI/CD
(github.com/marketplace)
2 points
akamor
2 years ago
discuss
297.
▲
Show HN: Open Source, Splitscreen Prompt Engineering
(github.com/benguz)
2 points
BenGuz
2 years ago
discuss
298.
▲
Show HN: Reference-free evaluation of LLM-powered chatbots
(github.com/parea-ai)
2 points
Joschkabraun
3 years ago
discuss
299.
▲
Show HN: Open-source alternative to OpenAI Assistants API
(superflows.ai)
2 points
henry_pulver
3 years ago
discuss
300.
▲
Show HN: Play Euchre with AI Bots
(euchre.fewworddotrick.com)
2 points
swpecht
3 years ago
discuss
More