Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
841.
▲
LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers
(github.com/dial481)
3 points
dial481
3 months ago
3 comments
842.
▲
Benchmarking PostgreSQL vs. CockroachDB
(github.com/philippta)
3 points
philippta
2 years ago
3 comments
843.
▲
Show HN: Time Series Benchmark TurboPFor,TurboFloat,TurboFloat LzX,TurboGorilla
(github.com/powturbo)
3 points
powturbo
3 years ago
3 comments
844.
▲
PhantomGPU: GPU performance emulator to benchmark ML models on virtual GPUs
(github.com/bugthesystem)
3 points
ziyasal
a year ago
2 comments
845.
▲
Ask HN: How to benchmark different LLM models in parallel?
3 points
dhruvagga
2 years ago
2 comments
846.
▲
AIBuildAI – An AI agent that automatically builds AI models (#1 on MLE-Bench)
(github.com/aibuildai)
3 points
ruz048
3 months ago
1 comment
847.
▲
Show HN: Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis
(github.com/sadopc)
3 points
sadopc
4 months ago
1 comment
848.
▲
Mcpbr: Stop guessing and evaluate your MCP server against standard benchmarks
(github.com/greynewell)
3 points
captradeoff
5 months ago
1 comment
849.
▲
PrinceJS: Benchmark Corrections and Lessons from a 13-Year-Old Developer
(github.com/MatthewTheCoder1218)
3 points
lilprince1218
7 months ago
1 comment
850.
▲
Show HN: Open Operator Evals – real-world benchmarks for LLM web agents
(github.com/nottelabs)
3 points
monoid73
a year ago
1 comment
851.
▲
Show HN: AWS CIS Benchmarks Compliance with Kexa (Open Source)
(github.com/4urcloud)
3 points
4urcloud_adrien
2 years ago
1 comment
852.
▲
.NET MAUI Native vs. Self-Drawn Controls Benchmark
(github.com/jsuarezruiz)
3 points
zigzag312
4 years ago
1 comment
853.
▲
Show HN: Graphsignal – benchmark and profile machine learning anywhere
(github.com/graphsignal)
3 points
complexity2
4 years ago
1 comment
854.
▲
Zig-gamedev project: DirectX 12 Mesh Shader benchmark
(github.com/michal-z)
3 points
michal-z
4 years ago
1 comment
855.
▲
A great optimization for CPython with great benchmarking results
(github.com/python)
3 points
androdiscens
5 years ago
1 comment
856.
▲
Show HN: Open-Source Performance Metrics and Benchmarks for Machine Learning/NLP
(github.com/codeforequity-at)
3 points
ftreml
6 years ago
1 comment
857.
▲
Show HN: A simple python network benchmark tool
(github.com/ljishen)
3 points
shiwo
8 years ago
1 comment
858.
▲
Benchmarking Nginx with Go
(gist.github.com)
3 points
strzalek
11 years ago
discuss
859.
▲
A bunch of good git tutorials
(gist.github.com)
3 points
jaseemabid
13 years ago
discuss
860.
▲
Why Vector Search fails at LLM memory (and a benchmark to prove it)
(github.com/tenurehq)
3 points
decorner
4 days ago
discuss
861.
▲
Research repository for the Americas – benchmarks, models, governance
(github.com/GENIA-Americas)
3 points
emergingrulek12
10 days ago
discuss
862.
▲
We built a charting benchmark suite: ChartGPU, Plotly, ECharts, SciChart
(github.com/abtsoftware)
3 points
abtgraphics
2 months ago
discuss
863.
▲
ParseBench: Document Parsing Benchmark for AI Agents
(github.com/run-llama)
3 points
firasd
2 months ago
discuss
864.
▲
Show HN: FC-Eval – CLI to Benchmark Local or Cloud LLMs on Function Calling
(github.com/gauravvij)
3 points
gauravvij137
3 months ago
discuss
865.
▲
Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions
(github.com/lechmazur)
3 points
zone411
3 months ago
discuss
866.
▲
Elixir, Kotlin, C# Outperform Python, TypeScript and Go on AutoCode Benchmark
(github.com/Tencent-Hunyuan)
3 points
bnchrch
4 months ago
discuss
867.
▲
Benchmarking Postgres for FTS with TOASTed JSONBs and GINs Against Elasticsearch
(github.com/inevolin)
3 points
thunderbong
5 months ago
discuss
868.
▲
Show HN: llmnop – Rust CLI for benchmarking LLM endpoints
(github.com/jpreagan)
3 points
jpreagan
5 months ago
discuss
869.
▲
LZAV 5.7: Enhanced Compression, Speed, C++ Compliance, and Benchmark Results
(github.com/avaneev)
3 points
birdculture
6 months ago
discuss
870.
▲
80.1 % on LoCoMo Long-Term Memory Benchmark with a pure open-source RAG pipeline
3 points
ViktorKuz
6 months ago
discuss
More