Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
121.
▲
Job postings evaluator against your resume (Chrome extension)
(github.com/alikh31)
1 point
alikhoramshahi
5 months ago
discuss
122.
▲
Policy Evaluation in Grid World
(github.com/elliotvilhelm)
1 point
monadicmonad
2 years ago
discuss
123.
▲
Tracking an LLM Evaluator Using Comet
(github.com/dair-ai)
1 point
omarsar
3 years ago
discuss
124.
▲
Propositional Logic Calculator
(github.com/lion137)
1 point
tu7001
7 years ago
discuss
125.
▲
Parsing Mitre EDR Evaluation Results
(github.com/zshehri)
1 point
based2
7 years ago
discuss
126.
▲
Go Expression Evaluation Comparison
(github.com/antonmedv)
1 point
zdw
7 years ago
discuss
127.
▲
Eval.js – A JavaScript interpreter written in JavaScript
(github.com/marten-de-vries)
1 point
hugs
10 years ago
discuss
128.
▲
Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts
(bilawal.net)
31 points
phantompeace
9 months ago
8 comments
129.
▲
Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam)
(sup.ai)
26 points
supai
2 months ago
24 comments
130.
▲
Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context
(github.com/topoteretes)
9 points
vasa_
a year ago
2 comments
131.
▲
Show HN: PromptOptimizer – Minimize LLM token complexity to save cost
(github.com/vaibkumr)
4 points
vaibkumr
3 years ago
2 comments
132.
▲
Show HN: See – searchable JSON compression, smaller than ZSTD (on our data)
(github.com/kodomonocch1)
3 points
Tetsuro
4 months ago
1 comment
133.
▲
Show HN: Legal Action Boundary Eval for agentic legal workflows
(github.com/bigkan8)
2 points
kankouadio_vx
a month ago
2 comments
134.
▲
Show HN: BenchFlow – Open-Source Benchmark Hub and Eval Infra for AI Devs
(docs.benchflow.ai)
1 point
www_xiangyi_li
a year ago
discuss
135.
▲
Show HN: AI Product Hunter – GenAI reviews/scores "all"of Producthunt everyday
(ai-producthunt.com)
1 point
tokiyaabe
2 years ago
discuss
136.
▲
Eval($_POST[cmd])
(github.com)
12 points
brevis
11 years ago
8 comments
137.
▲
Evaluating Technical Arguments
(swanson.github.com)
4 points
swanson
13 years ago
discuss
138.
▲
Engineering JavaScript's eval
(brownplt.github.com)
3 points
p4bl0
14 years ago
discuss
139.
▲
In Go, some evaluation orders in multi-value assignments are unspecified
(github.com/go101)
3 points
tapirl
8 years ago
discuss
140.
▲
Show HN: Dbt-LLM-evals – Monitor LLM quality in your data warehouse
(github.com/paradime-io)
2 points
fdileta
5 months ago
1 comment
141.
▲
Show HN: Synthetic Data Generation Using LangChain for IR and RAG Evaluation
(github.com/mddunlap924)
2 points
tdunlap607
2 years ago
discuss
142.
▲
Automated evaluation of coding round interviews
(github.com/shekhargulati)
2 points
java4all
9 years ago
discuss
143.
▲
Evaluating Technical Arguments
(swanson.github.com)
1 point
swanson
13 years ago
discuss
144.
▲
Show HN: Social proof works 2-7x better on AI shopping agents than humans
(github.com/aaronbatchelder)
1 point
aaronmb7
3 months ago
discuss
145.
▲
defer-import-eval: proposal for introducing a way to defer evaluate of a module
(github.com/tc39)
1 point
tilt
10 months ago
discuss
146.
▲
Evaluation of Covid-19 Models
(github.com/youyanggu)
1 point
sebg
6 years ago
discuss
147.
▲
Show HN: I forced Claude to play Tetris in Emacs
(imgur.com)
13 points
iLemming
2 months ago
3 comments
148.
▲
Show HN: Skyvern 2.0 – open-source AI Browser Agent scoring 85.8% on WebVoyager
(eval.skyvern.com)
9 points
suchintan
a year ago
3 comments
149.
▲
Ask HN: Survey: Scripting languages for realtime applications
2 points
schoetbi
9 years ago
2 comments
150.
▲
The Car Wash Problem: A variable isolation study on prompt architecture
2 points
midmost44
4 months ago
1 comment
More