Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

361.

Show HN: Metaprogramming in PHP without eval. Still evil? Hells yeah it is. (github.com/adlawson)

2 points

12 years ago

362.

Go-webkit2: WebKit bindings for Go (w/headless support & JavaScript evaluation) (sourcegraph.com)

2 points

13 years ago

363.

Cloning Bench: Evaluating AI Agents on Visual Website Cloning (github.com/vibrantlabsai)

2 points

2 months ago

364.

Show HN: TrustVector – Trust evaluations for AI models, agents, & MCP (github.com/guard0-ai)

2 points

4 months ago

365.

A Comprehensive Benchmark for Document Parsing and Evaluation (2025) (github.com/opendatalab)

2 points

4 months ago

366.

Show HN: 3.4575x Neuralink compression – highest that passes eval.sh (github.com/hxrdxkxvxd)

2 points

6 months ago

367.

Vera-MH: open-source eval for chatbot safety in mental health (github.com/SpringCare)

2 points

7 months ago

368.

Evaluations are crucial, but what should you eval on? (github.com/langwatch)

2 points

a year ago

369.

ModelClash: Dynamic LLM Evaluation Through AI Duels (github.com/mrconter1)

2 points

2 years ago

370.

Neovim interactive evaluation for Lisp family languages (github.com/Olical)

2 points

2 years ago

371.

A new choice for Model Evaluation: Starwhale (github.com/star-whale)

2 points

3 years ago

372.

Grape (Graph Representation LeArning, Predictions and Evaluation) (github.com/AnacletoLAB)

2 points

3 years ago

373.

AgentBench: Evaluating LLMs as Agents (github.com/THUDM)

2 points

3 years ago

374.

Code Evaluate Play Loop (Common Lisp OpenGL Library and Environment) (github.com/cbaggers)

2 points

5 years ago

375.

A Java s-expression evaluator to express logic as part of data (github.com/kannangce)

2 points

6 years ago

376.

Show HN: Evaluating Computational Creativity (github.com/CreaPar)

2 points

10 years ago

377.

Show HN: Klipse – code evaluator pluggable on a web page clojure/ruby/JavaScript (github.com/viebel)

2 points

10 years ago

378.

Power Assert in Elixir – Shows evaluation results each expression (github.com/ma2gedev)

2 points

10 years ago

379.

A simple Scheme interpreter that can run the metacircular evaluator (github.com/hmgle)

2 points

11 years ago

380.

[JS] Eval-like extendable function to solve math expressions from strings (github.com/aviaryan)

2 points

11 years ago

381.

Extensible, fast and secure Scala expression evaluation engine (github.com/ghik)

2 points

11 years ago

382.

Clojure plugin for vim(omni-complete, repl, eval...) (github.com/tpope)

2 points

13 years ago

383.

Pythonic lazy evaluation (poulejapon.github.com)

2 points

13 years ago

384.

Marketing skill for Claude with 26 evals – +20pp over baseline (github.com/inerrata)

2 points

8 days ago

385.

Fast, portable, non-Turing complete expression evaluation with gradual typing (github.com/google)

2 points

16 days ago

386.

Show HN: Nexa-Gauge – LLM eval framework, now with self-hosted model support (github.com/harnexa)

2 points

23 days ago

387.

How many of us are evaling our skills? (github.com/BintzGavin)

2 points

a month ago

388.

Show HN: Verdict – model evals on your own data, not someone else's benchmark (github.com/aevyraai)

2 points

a month ago

389.

Show HN: SkillCompass – open-source quality evaluator for your AI skills (github.com/Evol-ai)

2 points

2 months ago

390.

Stockfish removes classical evaluation functions in favor of NNUE only (2023) (github.com/official-stockfish)

2 points

2 months ago