Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

211.

Show HN: Tonic Validate Metrics – an open-source RAG evaluation metrics package (github.com/TonicAI)

40 points

3 years ago

212.

Generic engine to evaluate logical circuits on homomorphic encryption (github.com/virtualsecureplatform)

38 points

5 years ago

213.

Stop Evaluating LLMs on Vibes (github.com/truera)

35 points

3 years ago

214.

Show HN: Create LLM graders and run evals in JavaScript with one file (github.com/bolt-foundry)

28 points

a year ago

215.

Show HN: SumEval – Multi-language evaluation framework for text summarization (github.com/chakki-works)

25 points

9 years ago

216.

λ-calculus evaluator (zaach.github.com)

24 points

16 years ago

217.

Evaluate Scheme in Ruby's virtual machine (gist.github.com)

24 points

14 years ago

218.

Numexpr: Fast numerical array expression evaluator for Python, NumPy, Pandas (github.com/pydata)

23 points

19 days ago

219.

Show HN: Phoenix OSS – Applying LLM Spans, Traces, and Evals for AI Insights (github.com/Arize-ai)

23 points

3 years ago

220.

Show HN: I implemented evals metrics for LLMs that runs locally on your machine (github.com/confident-ai)

22 points

2 years ago

221.

Utility to estimate tasks using PERT (Program evaluation and review technique) (github.com/arzzen)

22 points

10 years ago

222.

Thorn in a HaizeStack test for evaluating long-context adversarial robustness (github.com/haizelabs)

19 points

2 years ago

223.

Math.mk - GNUmake eval gone wild (github.com/adam-f)

19 points

14 years ago

224.

Show HN: DeepEval – Evaluation and Unit Testing for LLMs (github.com/confident-ai)

18 points

3 years ago

225.

Python Search – eval(raw_input()) (github.com)

17 points

12 years ago

226.

Show HN: Ragas – Open-source library for evals and testing RAG systems (github.com/explodinggradients)

15 points

2 years ago

227.

Show HN: An Empirical Evaluation of Linear Probing Algorithms (github.com/senderista)

14 points

7 years ago

228.

Show HN: Evaluate LLM-based RAG Applications with automated test set generation (github.com/Giskard-AI)

13 points

2 years ago

229.

Common Expression Language (CEL); lightweight expression evaluation (github.com/google)

12 points

Wxc2jjJmST9XWWL

5 years ago

230.

How Erlang evaluates funs (i.e. lambdas) (gist.github.com)

12 points

17 years ago

231.

Show HN: UpTrain (YC W23) – open-source tool to evaluate LLM response quality (demo.uptrain.ai)

12 points

3 years ago

232.

Show HN: Open-source toolkit for ML model evaluation and active learning (github.com/encord-team)

11 points

3 years ago

233.

Fexl – Highly robust functional evaluation (github.com/chkoreff)

10 points

12 years ago

234.

Show HN: Kiln – AI Boilerplate with Evals, Fine-Tuning, Synthetic Data, and Git (github.com/Kiln-AI)

10 points

10 months ago

235.

Pixar just open sourced their high-performance subdivision evaluator (github.com/PixarAnimationStudios)

10 points

14 years ago

236.

Show HN: C++ Mathematical Expression Parser and Evaluation Benchmark (github.com/ArashPartow)

10 points

8 years ago

237.

Can ELO tournaments be used to evaluate LLMs and RAG? (github.com/zetaalphavector)

9 points

3 years ago

238.

Show HN: Evolve expressions that evaluate to a target number (github.com/yati-sagade)

8 points

11 years ago

239.

Rllab – framework for developing and evaluating reinforcement learning algorithms (github.com/rllab)

8 points

10 years ago

240.

Show HN: Code-Knack – A code evaluator on your web page (github.com/lyricat)

8 points

7 years ago