Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

121.

Job postings evaluator against your resume (Chrome extension) (github.com/alikh31)

1 point

5 months ago

122.

Policy Evaluation in Grid World (github.com/elliotvilhelm)

1 point

2 years ago

123.

Tracking an LLM Evaluator Using Comet (github.com/dair-ai)

1 point

3 years ago

124.

Propositional Logic Calculator (github.com/lion137)

1 point

7 years ago

125.

Parsing Mitre EDR Evaluation Results (github.com/zshehri)

1 point

7 years ago

126.

Go Expression Evaluation Comparison (github.com/antonmedv)

1 point

7 years ago

127.

Eval.js – A JavaScript interpreter written in JavaScript (github.com/marten-de-vries)

1 point

10 years ago

128.

Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts (bilawal.net)

31 points

9 months ago

129.

Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam) (sup.ai)

26 points

2 months ago

130.

Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context (github.com/topoteretes)

9 points

a year ago

131.

Show HN: PromptOptimizer – Minimize LLM token complexity to save cost (github.com/vaibkumr)

4 points

3 years ago

132.

Show HN: See – searchable JSON compression, smaller than ZSTD (on our data) (github.com/kodomonocch1)

3 points

4 months ago

133.

Show HN: Legal Action Boundary Eval for agentic legal workflows (github.com/bigkan8)

2 points

a month ago

134.

Show HN: BenchFlow – Open-Source Benchmark Hub and Eval Infra for AI Devs (docs.benchflow.ai)

1 point

a year ago

135.

Show HN: AI Product Hunter – GenAI reviews/scores "all"of Producthunt everyday (ai-producthunt.com)

1 point

2 years ago

136.

Eval($_POST[cmd]) (github.com)

12 points

11 years ago

137.

Evaluating Technical Arguments (swanson.github.com)

4 points

13 years ago

138.

Engineering JavaScript's eval (brownplt.github.com)

3 points

14 years ago

139.

In Go, some evaluation orders in multi-value assignments are unspecified (github.com/go101)

3 points

8 years ago

140.

Show HN: Dbt-LLM-evals – Monitor LLM quality in your data warehouse (github.com/paradime-io)

2 points

5 months ago

141.

Show HN: Synthetic Data Generation Using LangChain for IR and RAG Evaluation (github.com/mddunlap924)

2 points

2 years ago

142.

Automated evaluation of coding round interviews (github.com/shekhargulati)

2 points

9 years ago

143.

Evaluating Technical Arguments (swanson.github.com)

1 point

13 years ago

144.

Show HN: Social proof works 2-7x better on AI shopping agents than humans (github.com/aaronbatchelder)

1 point

3 months ago

145.

defer-import-eval: proposal for introducing a way to defer evaluate of a module (github.com/tc39)

1 point

10 months ago

146.

Evaluation of Covid-19 Models (github.com/youyanggu)

1 point

6 years ago

147.

Show HN: I forced Claude to play Tetris in Emacs (imgur.com)

13 points

2 months ago

148.

Show HN: Skyvern 2.0 – open-source AI Browser Agent scoring 85.8% on WebVoyager (eval.skyvern.com)

9 points

a year ago

149.

Ask HN: Survey: Scripting languages for realtime applications

2 points

9 years ago

150.

The Car Wash Problem: A variable isolation study on prompt architecture

2 points

4 months ago