Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
361.
Show HN: Metaprogramming in PHP without eval. Still evil? Hells yeah it is. (github.com/adlawson)
2 points
adlawson
12 years ago
1 comment
362.
Go-webkit2: WebKit bindings for Go (w/headless support & JavaScript evaluation) (sourcegraph.com)
2 points
sqs
13 years ago
1 comment
363.
Cloning Bench: Evaluating AI Agents on Visual Website Cloning (github.com/vibrantlabsai)
2 points
shahules
2 months ago
1 comment
364.
Show HN: TrustVector – Trust evaluations for AI models, agents, & MCP (github.com/guard0-ai)
2 points
hckdisc
4 months ago
1 comment
365.
A Comprehensive Benchmark for Document Parsing and Evaluation (2025) (github.com/opendatalab)
2 points
oceansky
4 months ago
1 comment
366.
Show HN: 3.4575x Neuralink compression – highest that passes eval.sh (github.com/hxrdxkxvxd)
2 points
hxrdxk-
6 months ago
1 comment
367.
Vera-MH: open-source eval for chatbot safety in mental health (github.com/SpringCare)
2 points
__lucab
7 months ago
1 comment
368.
Evaluations are crucial, but what should you eval on? (github.com/langwatch)
2 points
draismaa
a year ago
1 comment
369.
ModelClash: Dynamic LLM Evaluation Through AI Duels (github.com/mrconter1)
2 points
AIReach
2 years ago
1 comment
370.
Neovim interactive evaluation for Lisp family languages (github.com/Olical)
2 points
pretext
2 years ago
1 comment
371.
A new choice for Model Evaluation: Starwhale (github.com/star-whale)
2 points
liutianweidlut
3 years ago
1 comment
372.
Grape (Graph Representation LeArning, Predictions and Evaluation) (github.com/AnacletoLAB)
2 points
bryanrasmussen
3 years ago
1 comment
373.
AgentBench: Evaluating LLMs as Agents (github.com/THUDM)
2 points
tikkun
3 years ago
1 comment
374.
Code Evaluate Play Loop (Common Lisp OpenGL Library and Environment) (github.com/cbaggers)
2 points
podiki
5 years ago
1 comment
375.
A Java s-expression evaluator to express logic as part of data (github.com/kannangce)
2 points
kannangce
6 years ago
1 comment
376.
Show HN: Evaluating Computational Creativity (github.com/CreaPar)
2 points
rcorcs
10 years ago
1 comment
377.
Show HN: Klipse – code evaluator pluggable on a web page clojure/ruby/JavaScript (github.com/viebel)
2 points
viebel
10 years ago
1 comment
378.
Power Assert in Elixir – Shows evaluation results each expression (github.com/ma2gedev)
2 points
ma2ge
10 years ago
1 comment
379.
A simple Scheme interpreter that can run the metacircular evaluator (github.com/hmgle)
2 points
gleport
11 years ago
discuss
380.
[JS] Eval-like extendable function to solve math expressions from strings (github.com/aviaryan)
2 points
parvarez
11 years ago
discuss
381.
Extensible, fast and secure Scala expression evaluation engine (github.com/ghik)
2 points
luu
11 years ago
discuss
382.
Clojure plugin for vim(omni-complete, repl, eval...) (github.com/tpope)
2 points
irahul
13 years ago
discuss
383.
Pythonic lazy evaluation (poulejapon.github.com)
2 points
gklein
13 years ago
discuss
384.
Marketing skill for Claude with 26 evals – +20pp over baseline (github.com/inerrata)
2 points
healman
8 days ago
discuss
385.
Fast, portable, non-Turing complete expression evaluation with gradual typing (github.com/google)
2 points
tjek
16 days ago
discuss
386.
Show HN: Nexa-Gauge – LLM eval framework, now with self-hosted model support (github.com/harnexa)
2 points
Sardhendu
23 days ago
discuss
387.
How many of us are evaling our skills? (github.com/BintzGavin)
2 points
GavinBintz
a month ago
discuss
388.
Show HN: Verdict – model evals on your own data, not someone else's benchmark (github.com/aevyraai)
2 points
agunapal
a month ago
discuss
389.
Show HN: SkillCompass – open-source quality evaluator for your AI skills (github.com/Evol-ai)
2 points
yo103jg
2 months ago
discuss
390.
Stockfish removes classical evaluation functions in favor of NNUE only (2023) (github.com/official-stockfish)
2 points
knuckleheads
2 months ago
discuss
More