Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Judge0 – the most advanced open-source online code execution system in the world (github.com/judge0)
5 points
digitalnalogika
2 years ago
discuss
2.
Judge0 API Goes Freemium (github.com/judge0)
1 point
_q35k
6 years ago
discuss
3.
Ask HN: Help me improve my C-like language, C3
12 points
Nuoji
6 years ago
7 comments
4.
Show HN: The actual registry price of 246 TLD's (github.com/judge2020)
4 points
judge2020
8 years ago
1 comment
5.
Show HN: Fast open-source autograding library written in Django (github.com/arthtyagi)
4 points
arthtyagi
6 years ago
discuss
6.
The real cost of TLDs (github.com/judge2020)
3 points
hexene
6 years ago
1 comment
7.
Show HN: Fast open-source autograding library written in Django (github.com/arthtyagi)
3 points
arthtyagi
6 years ago
discuss
8.
Show HN: Fast Open-source autograder for coding problems (Django) (github.com/arthtyagi)
2 points
arthtyagi
6 years ago
1 comment
9.
Show HN: Blazing fast open-source autograder for coding problems (Django) (github.com/arthtyagi)
2 points
arthtyagi
6 years ago
1 comment
10.
Show HN: Open-source autograder for coding problems (Django) (github.com/arthtyagi)
2 points
arthtyagi
6 years ago
discuss
11.
Show HN: Fast open-source autograding library written in Django (github.com/arthtyagi)
1 point
arthtyagi
6 years ago
discuss
12.
Show HN: The real registration cost of TLDs (github.com/judge2020)
1 point
judge2020
7 years ago
discuss
13.
Composo open-sources its LLM-as-Judge technique (83.6% on RewardBench 2) (github.com/composo-ai)
5 points
mlukewizard
2 months ago
discuss
14.
Awesome-LLM-Judges (github.com/haizelabs)
2 points
leonardtang
a year ago
discuss
15.
LLM Judges (github.com/haizelabs)
2 points
leonardtang
a year ago
discuss
16.
UVa Online Judge Solutions Repo (Work in Progress) (github.com/jcbages)
2 points
jcbages
9 years ago
discuss
17.
Show HN: Lightweight LLM-as-a-Judge Tool (github.com/frequena)
2 points
frequena
9 months ago
discuss
18.
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost (app.uniclaw.ai)
2 points
skysniper
2 months ago
discuss
19.
The Divine Judgement: Enforce TypeScript Types at Runtime (github.com/Divine-Software)
1 point
LeviticusMB
3 years ago
discuss
20.
Ranking 1k ShowHN posts by estimated merit using an LLM judge and TrueSkill (github.com/kouhxp)
7 points
mrkn1
25 days ago
2 comments
21.
Type-challenges: Collection of TypeScript type challenges with online judge (github.com/type-challenges)
4 points
olalonde
3 years ago
discuss
22.
LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers (github.com/dial481)
3 points
dial481
2 months ago
3 comments
23.
Show HN: Using AI to judge a drinking game – SplitTheG.dev (splittheg.dev)
3 points
BitNibbleByte
a year ago
2 comments
24.
Show HN: Signals – finding the most informative agent traces without LLM judges (arxiv.org)
3 points
sparacha
2 months ago
discuss
25.
Justice: Yet Another Online Judge (github.com)
3 points
liumangchao
7 years ago
discuss
26.
Show HN: Grading Notes for LLM-as-Judge (github.com/shabie)
2 points
shabie
2 years ago
3 comments
27.
Show HN: pg_roast – A Postgres extension that harshly judges your database (github.com/samirketema)
2 points
samirketema
a month ago
1 comment
28.
Open-source LLM-as-judge eval suite with root cause analysis and failure mining (github.com/colingfly)
2 points
colinfly
3 months ago
1 comment
29.
Show HN: Yet Another Online Judge Implementation (github.com)
2 points
zsgsdesign
7 years ago
1 comment
30.
Codejudge: A lightweight online judge (github.com/sankha93)
2 points
sankha93
13 years ago
discuss
More