Search: github.com/eval | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

181.

Benchmark GGUF models with a one line of code (github.com/NexaAI)

1 point

2 years ago

182.

Benchmark GGUF models with a ONE line of code (github.com/NexaAI)

1 point

2 years ago

183.

Evaluation of robotics data recording file formats (github.com/foxglove)

1 point

4 years ago

184.

Full LLM training and evaluation toolkit (github.com/huggingface)

249 points

2 years ago

185.

RouteLLM: A framework for serving and evaluating LLM routers (github.com/lm-sys)

244 points

2 years ago

186.

Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs (github.com/hegelai)

211 points

3 years ago

187.

Interactive GCC (igcc) is a read-eval-print loop (REPL) for C/C++ (github.com/alexandru-dinu)

170 points

3 years ago

188.

Comptime – C# meta-programming with compile-time code generation and evaluation (github.com/sebastienros)

150 points

6 months ago

189.

Code, Eval, Play, Loop – Common Lisp OpenGL Environment (github.com/cbaggers)

139 points

11 years ago

190.

Apache HTTP Server: 'RewriteCond expr' always evaluates to true (github.com/apache)

136 points

10 months ago

191.

Lave: eval in reverse (github.com/jed)

133 points

10 years ago

192.

Show HN: Faster LLM evaluation with Bayesian optimization (github.com/rentruewang)

131 points

2 years ago

193.

Show HN: Ragas – Open-source library for evaluating RAG pipelines (github.com/explodinggradients)

121 points

2 years ago

194.

LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation (github.com/naver)

119 points

4 months ago

195.

Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps

117 points

a year ago

196.

A Fast Excel Formula Parser and Evaluator (github.com/LesterLyu)

106 points

4 years ago

197.

Show HN: Eole, a Lévy-optimal lambda calculus evaluator written in Rust (github.com/HerrmannM)

106 points

7 years ago

198.

Evaluation of Deep Learning Toolkits (github.com/zer0n)

94 points

10 years ago

199.

Show HN: Lazy evaluation in Python (github.com/llllllllll)

88 points

11 years ago

200.

Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents (github.com/google)

86 points

7 months ago

201.

Show HN: Opik, an open source LLM evaluation framework (github.com/comet-ml)

86 points

2 years ago

202.

PhaseLLM: Standardized Chat LLM API (Cohere, Claude, GPT) + Evaluation Framework (github.com/wgryc)

86 points

3 years ago

203.

AutoMLPipeline – Create and evaluate machine learning pipeline architectures (github.com/IBM)

80 points

6 years ago

204.

Cedar is an open source policy language and evaluation engine (github.com)

72 points

3 years ago

205.

Show HN: Paramount – Human Evals of AI Customer Support (github.com/ask-fini)

71 points

2 years ago

206.

Evaluate Markdown code blocks within Vim (github.com/gpanders)

68 points

2 years ago

207.

Show HN: LazyCode – C++14 composable, lazily evaluated map, filter, fold (github.com/SaadAttieh)

66 points

7 years ago

208.

A Case for Safe Eval (github.com/robert-j-webb)

58 points

8 years ago

209.

TensorFlow Model Analysis – A library for evaluating TensorFlow models (github.com/tensorflow)

58 points

8 years ago

210.

Show HN: A MCP server to evaluate Python code in WASM VM using RustPython (github.com/tuananh)

41 points

a year ago