Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
91.
▲
Show HN: Using AI to judge a drinking game – SplitTheG.dev
(splittheg.dev)
3 points
BitNibbleByte
a year ago
2 comments
92.
▲
Apply for the Judicial Innovation Fellowship
(github.com/JIFGeorgetown)
3 points
epicfaace
3 years ago
1 comment
93.
▲
Gavel is a project expo judging system
(github.com/anishathalye)
3 points
mnem
10 years ago
1 comment
94.
▲
Show HN: NyaayWatch – Observability layer for the Indian judiciary
(nyaaywatch.in)
3 points
Rudraksh06
a month ago
discuss
95.
▲
Show HN: Signals – finding the most informative agent traces without LLM judges
(arxiv.org)
3 points
sparacha
2 months ago
discuss
96.
▲
Show HN: Cognition-wheel – parallel LLM fusion with bias masking and judging
(github.com/Hormold)
3 points
Hormold
a year ago
discuss
97.
▲
Justice: Yet Another Online Judge
(github.com)
3 points
liumangchao
7 years ago
discuss
98.
▲
Show HN: Grading Notes for LLM-as-Judge
(github.com/shabie)
2 points
shabie
2 years ago
3 comments
99.
▲
Show HN: pg_roast – A Postgres extension that harshly judges your database
(github.com/samirketema)
2 points
samirketema
2 months ago
1 comment
100.
▲
Open-source LLM-as-judge eval suite with root cause analysis and failure mining
(github.com/colingfly)
2 points
colinfly
3 months ago
1 comment
101.
▲
Show HN: Yet Another Online Judge Implementation
(github.com)
2 points
zsgsdesign
7 years ago
1 comment
102.
▲
Ask HN: Criteria for judging JavaScript project?
2 points
octref
11 years ago
1 comment
103.
▲
Hey Jude as a vbScript
(github.com/mockmyberet)
2 points
tommybecker
13 years ago
discuss
104.
▲
Codejudge: A lightweight online judge
(github.com/sankha93)
2 points
sankha93
13 years ago
discuss
105.
▲
Show HN: CoJudge – open-source, offline judge for studying LC-style problems
(github.com/cojudge)
2 points
ansliy
7 months ago
discuss
106.
▲
Evaluating Large Language Models Using LLM-as-a-Judge
(github.com/aws-samples)
2 points
mooreds
2 years ago
discuss
107.
▲
Scruples: Corpus of ethical judgments extracted from Reddit
(github.com/allenai)
2 points
nikochiko
6 years ago
discuss
108.
▲
JHU CSSE Covid-19 Data Repo Removes Information on Palestine
(github.com/CSSEGISandData)
2 points
jnmandal
6 years ago
discuss
109.
▲
Novel Coronavirus (Covid-19) Cases, Provided by JHU CSSE
(github.com/CSSEGISandData)
2 points
itbeho
6 years ago
discuss
110.
▲
Covid-19: Novel Coronavirus (Covid-19) Cases, Provided by JHU CSSE
(github.com/CSSEGISandData)
2 points
DyslexicAtheist
6 years ago
discuss
111.
▲
Coderunner – A judge for your programs,run and test your programs through Python
(github.com/codeclassroom)
2 points
bhupesh
7 years ago
discuss
112.
▲
Show HN: A command line interface to UVA online judge (competitive programming)
(github.com/scvalencia)
2 points
scvalencia
10 years ago
discuss
113.
▲
Show HN: Meaning-Based Judgment Simulation for LLM Interfaces
1 point
GENIXUS
a year ago
2 comments
114.
▲
Show HN: Judgment Boundary – Stop as a First-Class Outcome for AI Systems
(github.com/Nick-heo-eg)
1 point
echoos
4 months ago
1 comment
115.
▲
LLM Position Bias Benchmark: Swapped-Order Pairwise Judging
(github.com/lechmazur)
1 point
zone411
a month ago
discuss
116.
▲
Show HN: Claude-relais – A plan/build/judge loop mixing Claude with Cursor
(github.com/clementrog)
1 point
crog
4 months ago
discuss
117.
▲
Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU
(github.com/MikeVeerman)
1 point
MikeVeerman
4 months ago
discuss
118.
▲
Precision-Based Sampling of LLM Judges
(sunnybak.net)
1 point
sunny-bak
a year ago
discuss
119.
▲
Show HN: Lone Arena – Self-hosted LLM human evaluation, you be the judge
(github.com/Contextualist)
1 point
Contextualist
2 years ago
discuss
120.
▲
Collection of TypeScript type challenges with online judge
(github.com/type-challenges)
1 point
max-m
2 years ago
discuss
More