Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
181.
Benchmark GGUF models with a one line of code (github.com/NexaAI)
1 point
mountainview
2 years ago
discuss
182.
Benchmark GGUF models with a ONE line of code (github.com/NexaAI)
1 point
jinqueeny
2 years ago
discuss
183.
Evaluation of robotics data recording file formats (github.com/foxglove)
1 point
ahamez
4 years ago
discuss
184.
Full LLM training and evaluation toolkit (github.com/huggingface)
249 points
testerui
2 years ago
6 comments
185.
RouteLLM: A framework for serving and evaluating LLM routers (github.com/lm-sys)
244 points
djhu9
2 years ago
36 comments
186.
Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs (github.com/hegelai)
211 points
krawfy
3 years ago
24 comments
187.
Interactive GCC (igcc) is a read-eval-print loop (REPL) for C/C++ (github.com/alexandru-dinu)
170 points
pr337h4m
3 years ago
69 comments
188.
Comptime – C# meta-programming with compile-time code generation and evaluation (github.com/sebastienros)
150 points
bj-rn
6 months ago
66 comments
189.
Code, Eval, Play, Loop – Common Lisp OpenGL Environment (github.com/cbaggers)
139 points
_zhqs
11 years ago
18 comments
190.
Apache HTTP Server: 'RewriteCond expr' always evaluates to true (github.com/apache)
136 points
Bogdanp
10 months ago
70 comments
191.
Lave: eval in reverse (github.com/jed)
133 points
danso
10 years ago
37 comments
192.
Show HN: Faster LLM evaluation with Bayesian optimization (github.com/rentruewang)
131 points
renchuw
2 years ago
43 comments
193.
Show HN: Ragas – Open-source library for evaluating RAG pipelines (github.com/explodinggradients)
121 points
shahules
2 years ago
26 comments
194.
LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation (github.com/naver)
119 points
PaulHoule
4 months ago
25 comments
195.
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
117 points
jeffreyip
a year ago
27 comments
196.
A Fast Excel Formula Parser and Evaluator (github.com/LesterLyu)
106 points
EntICOnc
4 years ago
34 comments
197.
Show HN: Eole, a Lévy-optimal lambda calculus evaluator written in Rust (github.com/HerrmannM)
106 points
HerrmannM
7 years ago
9 comments
198.
Evaluation of Deep Learning Toolkits (github.com/zer0n)
94 points
marcelsalathe
10 years ago
10 comments
199.
Show HN: Lazy evaluation in Python (github.com/llllllllll)
88 points
joejev
11 years ago
21 comments
200.
Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents (github.com/google)
86 points
maxloh
7 months ago
24 comments
201.
Show HN: Opik, an open source LLM evaluation framework (github.com/comet-ml)
86 points
calebkaiser
2 years ago
15 comments
202.
PhaseLLM: Standardized Chat LLM API (Cohere, Claude, GPT) + Evaluation Framework (github.com/wgryc)
86 points
cl42
3 years ago
3 comments
203.
AutoMLPipeline – Create and evaluate machine learning pipeline architectures (github.com/IBM)
80 points
bwidlar
6 years ago
13 comments
204.
Cedar is an open source policy language and evaluation engine (github.com)
72 points
mooreds
3 years ago
17 comments
205.
Show HN: Paramount – Human Evals of AI Customer Support (github.com/ask-fini)
71 points
hakimk
2 years ago
44 comments
206.
Evaluate Markdown code blocks within Vim (github.com/gpanders)
68 points
pentestercrab
2 years ago
18 comments
207.
Show HN: LazyCode – C++14 composable, lazily evaluated map, filter, fold (github.com/SaadAttieh)
66 points
SaadAttieh
7 years ago
5 comments
208.
A Case for Safe Eval (github.com/robert-j-webb)
58 points
_pabj
8 years ago
42 comments
209.
TensorFlow Model Analysis – A library for evaluating TensorFlow models (github.com/tensorflow)
58 points
wjarek
8 years ago
12 comments
210.
Show HN: A MCP server to evaluate Python code in WASM VM using RustPython (github.com/tuananh)
41 points
tuananh
a year ago
13 comments
More