Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
91.
Show HN: Using AI to judge a drinking game – SplitTheG.dev (splittheg.dev)
3 points
BitNibbleByte
a year ago
2 comments
92.
Apply for the Judicial Innovation Fellowship (github.com/JIFGeorgetown)
3 points
epicfaace
3 years ago
1 comment
93.
Gavel is a project expo judging system (github.com/anishathalye)
3 points
mnem
10 years ago
1 comment
94.
Show HN: NyaayWatch – Observability layer for the Indian judiciary (nyaaywatch.in)
3 points
Rudraksh06
a month ago
discuss
95.
Show HN: Signals – finding the most informative agent traces without LLM judges (arxiv.org)
3 points
sparacha
2 months ago
discuss
96.
Show HN: Cognition-wheel – parallel LLM fusion with bias masking and judging (github.com/Hormold)
3 points
Hormold
a year ago
discuss
97.
Justice: Yet Another Online Judge (github.com)
3 points
liumangchao
7 years ago
discuss
98.
Show HN: Grading Notes for LLM-as-Judge (github.com/shabie)
2 points
shabie
2 years ago
3 comments
99.
Show HN: pg_roast – A Postgres extension that harshly judges your database (github.com/samirketema)
2 points
samirketema
2 months ago
1 comment
100.
Open-source LLM-as-judge eval suite with root cause analysis and failure mining (github.com/colingfly)
2 points
colinfly
3 months ago
1 comment
101.
Show HN: Yet Another Online Judge Implementation (github.com)
2 points
zsgsdesign
7 years ago
1 comment
102.
Ask HN: Criteria for judging JavaScript project?
2 points
octref
11 years ago
1 comment
103.
Hey Jude as a vbScript (github.com/mockmyberet)
2 points
tommybecker
13 years ago
discuss
104.
Codejudge: A lightweight online judge (github.com/sankha93)
2 points
sankha93
13 years ago
discuss
105.
Show HN: CoJudge – open-source, offline judge for studying LC-style problems (github.com/cojudge)
2 points
ansliy
7 months ago
discuss
106.
Evaluating Large Language Models Using LLM-as-a-Judge (github.com/aws-samples)
2 points
mooreds
2 years ago
discuss
107.
Scruples: Corpus of ethical judgments extracted from Reddit (github.com/allenai)
2 points
nikochiko
6 years ago
discuss
108.
JHU CSSE Covid-19 Data Repo Removes Information on Palestine (github.com/CSSEGISandData)
2 points
jnmandal
6 years ago
discuss
109.
Novel Coronavirus (Covid-19) Cases, Provided by JHU CSSE (github.com/CSSEGISandData)
2 points
itbeho
6 years ago
discuss
110.
Covid-19: Novel Coronavirus (Covid-19) Cases, Provided by JHU CSSE (github.com/CSSEGISandData)
2 points
DyslexicAtheist
6 years ago
discuss
111.
Coderunner – A judge for your programs,run and test your programs through Python (github.com/codeclassroom)
2 points
bhupesh
7 years ago
discuss
112.
Show HN: A command line interface to UVA online judge (competitive programming) (github.com/scvalencia)
2 points
scvalencia
10 years ago
discuss
113.
Show HN: Meaning-Based Judgment Simulation for LLM Interfaces
1 point
GENIXUS
a year ago
2 comments
114.
Show HN: Judgment Boundary – Stop as a First-Class Outcome for AI Systems (github.com/Nick-heo-eg)
1 point
echoos
4 months ago
1 comment
115.
LLM Position Bias Benchmark: Swapped-Order Pairwise Judging (github.com/lechmazur)
1 point
zone411
a month ago
discuss
116.
Show HN: Claude-relais – A plan/build/judge loop mixing Claude with Cursor (github.com/clementrog)
1 point
crog
4 months ago
discuss
117.
Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU (github.com/MikeVeerman)
1 point
MikeVeerman
4 months ago
discuss
118.
Precision-Based Sampling of LLM Judges (sunnybak.net)
1 point
sunny-bak
a year ago
discuss
119.
Show HN: Lone Arena – Self-hosted LLM human evaluation, you be the judge (github.com/Contextualist)
1 point
Contextualist
2 years ago
discuss
120.
Collection of TypeScript type challenges with online judge (github.com/type-challenges)
1 point
max-m
2 years ago
discuss
More