Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Judge0 – the most advanced open-source online code execution system in the world
(github.com/judge0)
5 points
digitalnalogika
2 years ago
discuss
2.
▲
Judge0 API Goes Freemium
(github.com/judge0)
1 point
_q35k
6 years ago
discuss
3.
▲
Ask HN: Help me improve my C-like language, C3
12 points
Nuoji
6 years ago
7 comments
4.
▲
Show HN: The actual registry price of 246 TLD's
(github.com/judge2020)
4 points
judge2020
8 years ago
1 comment
5.
▲
Show HN: Fast open-source autograding library written in Django
(github.com/arthtyagi)
4 points
arthtyagi
6 years ago
discuss
6.
▲
The real cost of TLDs
(github.com/judge2020)
3 points
hexene
6 years ago
1 comment
7.
▲
Show HN: Fast open-source autograding library written in Django
(github.com/arthtyagi)
3 points
arthtyagi
6 years ago
discuss
8.
▲
Show HN: Fast Open-source autograder for coding problems (Django)
(github.com/arthtyagi)
2 points
arthtyagi
6 years ago
1 comment
9.
▲
Show HN: Blazing fast open-source autograder for coding problems (Django)
(github.com/arthtyagi)
2 points
arthtyagi
6 years ago
1 comment
10.
▲
Show HN: Open-source autograder for coding problems (Django)
(github.com/arthtyagi)
2 points
arthtyagi
6 years ago
discuss
11.
▲
Show HN: Fast open-source autograding library written in Django
(github.com/arthtyagi)
1 point
arthtyagi
6 years ago
discuss
12.
▲
Show HN: The real registration cost of TLDs
(github.com/judge2020)
1 point
judge2020
7 years ago
discuss
13.
▲
Composo open-sources its LLM-as-Judge technique (83.6% on RewardBench 2)
(github.com/composo-ai)
5 points
mlukewizard
2 months ago
discuss
14.
▲
Awesome-LLM-Judges
(github.com/haizelabs)
2 points
leonardtang
a year ago
discuss
15.
▲
LLM Judges
(github.com/haizelabs)
2 points
leonardtang
a year ago
discuss
16.
▲
UVa Online Judge Solutions Repo (Work in Progress)
(github.com/jcbages)
2 points
jcbages
9 years ago
discuss
17.
▲
Show HN: Lightweight LLM-as-a-Judge Tool
(github.com/frequena)
2 points
frequena
9 months ago
discuss
18.
▲
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost
(app.uniclaw.ai)
2 points
skysniper
2 months ago
discuss
19.
▲
The Divine Judgement: Enforce TypeScript Types at Runtime
(github.com/Divine-Software)
1 point
LeviticusMB
3 years ago
discuss
20.
▲
Ranking 1k ShowHN posts by estimated merit using an LLM judge and TrueSkill
(github.com/kouhxp)
7 points
mrkn1
25 days ago
2 comments
21.
▲
Type-challenges: Collection of TypeScript type challenges with online judge
(github.com/type-challenges)
4 points
olalonde
3 years ago
discuss
22.
▲
LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers
(github.com/dial481)
3 points
dial481
2 months ago
3 comments
23.
▲
Show HN: Using AI to judge a drinking game – SplitTheG.dev
(splittheg.dev)
3 points
BitNibbleByte
a year ago
2 comments
24.
▲
Show HN: Signals – finding the most informative agent traces without LLM judges
(arxiv.org)
3 points
sparacha
2 months ago
discuss
25.
▲
Justice: Yet Another Online Judge
(github.com)
3 points
liumangchao
7 years ago
discuss
26.
▲
Show HN: Grading Notes for LLM-as-Judge
(github.com/shabie)
2 points
shabie
2 years ago
3 comments
27.
▲
Show HN: pg_roast – A Postgres extension that harshly judges your database
(github.com/samirketema)
2 points
samirketema
a month ago
1 comment
28.
▲
Open-source LLM-as-judge eval suite with root cause analysis and failure mining
(github.com/colingfly)
2 points
colinfly
3 months ago
1 comment
29.
▲
Show HN: Yet Another Online Judge Implementation
(github.com)
2 points
zsgsdesign
7 years ago
1 comment
30.
▲
Codejudge: A lightweight online judge
(github.com/sankha93)
2 points
sankha93
13 years ago
discuss
More