Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
361.
▲
Show HN: Metaprogramming in PHP without eval. Still evil? Hells yeah it is.
(github.com/adlawson)
2 points
adlawson
12 years ago
1 comment
362.
▲
Go-webkit2: WebKit bindings for Go (w/headless support & JavaScript evaluation)
(sourcegraph.com)
2 points
sqs
13 years ago
1 comment
363.
▲
Cloning Bench: Evaluating AI Agents on Visual Website Cloning
(github.com/vibrantlabsai)
2 points
shahules
2 months ago
1 comment
364.
▲
Show HN: TrustVector – Trust evaluations for AI models, agents, & MCP
(github.com/guard0-ai)
2 points
hckdisc
4 months ago
1 comment
365.
▲
A Comprehensive Benchmark for Document Parsing and Evaluation (2025)
(github.com/opendatalab)
2 points
oceansky
4 months ago
1 comment
366.
▲
Show HN: 3.4575x Neuralink compression – highest that passes eval.sh
(github.com/hxrdxkxvxd)
2 points
hxrdxk-
6 months ago
1 comment
367.
▲
Vera-MH: open-source eval for chatbot safety in mental health
(github.com/SpringCare)
2 points
__lucab
7 months ago
1 comment
368.
▲
Evaluations are crucial, but what should you eval on?
(github.com/langwatch)
2 points
draismaa
a year ago
1 comment
369.
▲
ModelClash: Dynamic LLM Evaluation Through AI Duels
(github.com/mrconter1)
2 points
AIReach
2 years ago
1 comment
370.
▲
Neovim interactive evaluation for Lisp family languages
(github.com/Olical)
2 points
pretext
2 years ago
1 comment
371.
▲
A new choice for Model Evaluation: Starwhale
(github.com/star-whale)
2 points
liutianweidlut
3 years ago
1 comment
372.
▲
Grape (Graph Representation LeArning, Predictions and Evaluation)
(github.com/AnacletoLAB)
2 points
bryanrasmussen
3 years ago
1 comment
373.
▲
AgentBench: Evaluating LLMs as Agents
(github.com/THUDM)
2 points
tikkun
3 years ago
1 comment
374.
▲
Code Evaluate Play Loop (Common Lisp OpenGL Library and Environment)
(github.com/cbaggers)
2 points
podiki
5 years ago
1 comment
375.
▲
A Java s-expression evaluator to express logic as part of data
(github.com/kannangce)
2 points
kannangce
6 years ago
1 comment
376.
▲
Show HN: Evaluating Computational Creativity
(github.com/CreaPar)
2 points
rcorcs
10 years ago
1 comment
377.
▲
Show HN: Klipse – code evaluator pluggable on a web page clojure/ruby/JavaScript
(github.com/viebel)
2 points
viebel
10 years ago
1 comment
378.
▲
Power Assert in Elixir – Shows evaluation results each expression
(github.com/ma2gedev)
2 points
ma2ge
10 years ago
1 comment
379.
▲
A simple Scheme interpreter that can run the metacircular evaluator
(github.com/hmgle)
2 points
gleport
11 years ago
discuss
380.
▲
[JS] Eval-like extendable function to solve math expressions from strings
(github.com/aviaryan)
2 points
parvarez
11 years ago
discuss
381.
▲
Extensible, fast and secure Scala expression evaluation engine
(github.com/ghik)
2 points
luu
11 years ago
discuss
382.
▲
Clojure plugin for vim(omni-complete, repl, eval...)
(github.com/tpope)
2 points
irahul
13 years ago
discuss
383.
▲
Pythonic lazy evaluation
(poulejapon.github.com)
2 points
gklein
13 years ago
discuss
384.
▲
Marketing skill for Claude with 26 evals – +20pp over baseline
(github.com/inerrata)
2 points
healman
8 days ago
discuss
385.
▲
Fast, portable, non-Turing complete expression evaluation with gradual typing
(github.com/google)
2 points
tjek
16 days ago
discuss
386.
▲
Show HN: Nexa-Gauge – LLM eval framework, now with self-hosted model support
(github.com/harnexa)
2 points
Sardhendu
23 days ago
discuss
387.
▲
How many of us are evaling our skills?
(github.com/BintzGavin)
2 points
GavinBintz
a month ago
discuss
388.
▲
Show HN: Verdict – model evals on your own data, not someone else's benchmark
(github.com/aevyraai)
2 points
agunapal
a month ago
discuss
389.
▲
Show HN: SkillCompass – open-source quality evaluator for your AI skills
(github.com/Evol-ai)
2 points
yo103jg
2 months ago
discuss
390.
▲
Stockfish removes classical evaluation functions in favor of NNUE only (2023)
(github.com/official-stockfish)
2 points
knuckleheads
2 months ago
discuss
More