Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

421.

Show HN: Paramount – OSS package for *Human* Evals of AI support (github.com/ask-fini)

2 points

2 years ago

422.

SDMetrics: Library for evaluating synthetic data quality (github.com/sdv-dev)

2 points

2 years ago

423.

Promptfoo – Testing and Evaluation for LLMs (github.com/promptfoo)

2 points

2 years ago

424.

Google DeepMind's research on uncertain ground truth in AI eval (github.com/google-deepmind)

2 points

3 years ago

425.

Show HN: Reference-free evaluation of LLM-powered chatbots (github.com/parea-ai)

2 points

3 years ago

426.

Ragas – Framework for RAG Evaluation (github.com/explodinggradients)

2 points

3 years ago

427.

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models (github.com/ruixiangcui)

2 points

3 years ago

428.

RAGElo: Toolkit for evaluating RAG agents using tournament-style Elo ranking (github.com/zetaalphavector)

2 points

3 years ago

429.

Starwhale: A new MLOps platform for Model Evaluation (github.com/star-whale)

2 points

3 years ago

430.

ChainForge now supports chat evaluation (github.com/ianarawjo)

2 points

3 years ago

431.

Show HN: CLI for testing and evaluating LLM prompts and outputs (github.com/promptfoo)

2 points

3 years ago

432.

OSS for training, serving, and evaluating LLM based ChatBots (github.com/lm-sys)

2 points

3 years ago

433.

Show HN: XV - Expression Evaluator for C (github.com/tidwall)

2 points

3 years ago

434.

Croner: Trigger functions or evaluate cron expressions in JavaScript or TS (github.com/Hexagon)

2 points

3 years ago

435.

Haskell library for evaluating whether chess moves are allowed (github.com/ArnoVanLumig)

2 points

3 years ago

436.

Show HN: Brace Lang – parse brace groups and evaluate them however you want (github.com/xaedes)

2 points

4 years ago

437.

Show HN: Convert VHDL to Verilog using GHDL (+ first evaluation) (github.com/stnolting)

2 points

youre_the_voice

4 years ago

438.

SIMD Library for Evaluating Elementary Functions, Vectorized Libm and DFT (github.com/shibatch)

2 points

4 years ago

439.

PicoMath: Fast math evaluation library (C++ header-only) (github.com/Nitrillo)

2 points

4 years ago

440.

Parse and evaluate MS Excel formula in JavaScript (github.com/LesterLyu)

2 points

4 years ago

441.

Show HN: ANECompat, evaluate CoreML model compatibility with Apple Neural Engine (github.com/fredyshox)

2 points

4 years ago

442.

Paper Walkthrough: Is Automated Topic Model Evaluation Broken (github.com/acatovic)

2 points

4 years ago

443.

Lisp Evaluator for FreeBASIC (github.com/jayrm)

2 points

4 years ago

444.

Armory Adversarial Robustness Evaluation Test Bed (github.com/twosixlabs)

2 points

4 years ago

445.

Show HN: Lambda Calculus evaluation with type-annotations in TypeScript (github.com/EvolveYourMind)

2 points

4 years ago

446.

Damov: Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks (github.com/CMU-SAFARI)

2 points

5 years ago

447.

JavaScript lexical scope and eval explored (dbrans.github.com)

2 points

15 years ago

448.

ZickStandardLisp: A Lisp Evaluator in Lisp (github.com/zick)

2 points

5 years ago

449.

Evaluate my junior project on GitHub

2 points

6 years ago

450.

Datasets and Evaluation Metrics for NLP (True Open Source GPT Alternative) (github.com/huggingface)

2 points

6 years ago