Search: github.com/judge0 | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

Show HN: CoJudge – open-source, offline judge for studying LC-style problems (github.com/cojudge)

2 points

7 months ago

32.

Evaluating Large Language Models Using LLM-as-a-Judge (github.com/aws-samples)

2 points

2 years ago

33.

Coderunner – A judge for your programs,run and test your programs through Python (github.com/codeclassroom)

2 points

7 years ago

34.

Show HN: A command line interface to UVA online judge (competitive programming) (github.com/scvalencia)

2 points

10 years ago

35.

Show HN: Claude-relais – A plan/build/judge loop mixing Claude with Cursor (github.com/clementrog)

1 point

4 months ago

36.

Precision-Based Sampling of LLM Judges (sunnybak.net)

1 point

a year ago

37.

Show HN: Lone Arena – Self-hosted LLM human evaluation, you be the judge (github.com/Contextualist)

1 point

2 years ago

38.

Collection of TypeScript type challenges with online judge (github.com/type-challenges)

1 point

2 years ago

39.

Show HN: A self hosted online judge for meetups and workshops, written in Go (github.com/MohamedBassem)

1 point

9 years ago

40.

Show HN: Minimal, self-hosted exercise tracker (github.com/bmtwl)

127 points

a year ago

41.

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com/Danau5tin)

125 points

10 months ago

42.

Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps

117 points

a year ago

43.

Show HN: SirixDB – Bitemporal binary JSON database system and event store (github.com/sirixdb)

109 points

3 years ago

44.

Launch HN: Traceloop (YC W23) – Detecting LLM Hallucinations with OpenTelemetry

101 points

2 years ago

45.

Show HN: Index – New Open Source browser agent (github.com/lmnr-ai)

98 points

a year ago

46.

Show HN: RULER – Easily apply RL to any agent (openpipe.ai)

81 points

a year ago

47.

Show HN: Torrix, self hosted, LLM Observability,(no Postgres, no Redis) (github.com/torrix-ai)

74 points

23 days ago

48.

Show HN: OCR Benchmark Focusing on Automation (nanonets.com)

58 points

a year ago

49.

Show HN: TensorZero – open-source data and learning flywheel for LLMs (github.com/tensorzero)

49 points

GabrielBianconi

2 years ago

50.

Show HN: Helicone (YC W23) – OSS LLM Observability and Development Platform (github.com/Helicone)

29 points

a year ago

51.

Show HN: Create LLM graders and run evals in JavaScript with one file (github.com/bolt-foundry)

28 points

a year ago

52.

Show HN: OSS sustain guard – Sustainability signals for OSS dependencies (onukura.github.io)

21 points

5 months ago

53.

Show HN: Anytype – a local and collaborative database with API and MCP server (zhanna.any.org)

20 points

a year ago

54.

Show HN: I built an open-source AI data layer that connects any LLM to any data (github.com/bagofwords1)

18 points

8 months ago

55.

Show HN: TinyFish Web Agent (82% on hard tasks vs. Operator's 43%) (tinyfish.ai)

17 points

4 months ago

56.

Show HN: Meta-agent: self-improving agent harnesses from live traces (github.com/canvas-org)

14 points

2 months ago

57.

Show HN: Ebiose – A Darwin‑Style Playground for Self‑Evolving AI Agents (github.com/ebiose-ai)

12 points

a year ago

58.

Show HN: OpenTiger – Autonomous dev orchestration that never stops (github.com/Andyyyy64)

11 points

3 months ago

59.

Show HN: Kiln – AI Boilerplate with Evals, Fine-Tuning, Synthetic Data, and Git (github.com/Kiln-AI)

10 points

10 months ago

60.

Show HN: Unsiloed AI – #1 on olmOCR-Bench

9 points

10 days ago