Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
841.
LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers (github.com/dial481)
3 points
dial481
3 months ago
3 comments
842.
Benchmarking PostgreSQL vs. CockroachDB (github.com/philippta)
3 points
philippta
2 years ago
3 comments
843.
Show HN: Time Series Benchmark TurboPFor,TurboFloat,TurboFloat LzX,TurboGorilla (github.com/powturbo)
3 points
powturbo
3 years ago
3 comments
844.
PhantomGPU: GPU performance emulator to benchmark ML models on virtual GPUs (github.com/bugthesystem)
3 points
ziyasal
a year ago
2 comments
845.
Ask HN: How to benchmark different LLM models in parallel?
3 points
dhruvagga
2 years ago
2 comments
846.
AIBuildAI – An AI agent that automatically builds AI models (#1 on MLE-Bench) (github.com/aibuildai)
3 points
ruz048
3 months ago
1 comment
847.
Show HN: Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis (github.com/sadopc)
3 points
sadopc
4 months ago
1 comment
848.
Mcpbr: Stop guessing and evaluate your MCP server against standard benchmarks (github.com/greynewell)
3 points
captradeoff
5 months ago
1 comment
849.
PrinceJS: Benchmark Corrections and Lessons from a 13-Year-Old Developer (github.com/MatthewTheCoder1218)
3 points
lilprince1218
7 months ago
1 comment
850.
Show HN: Open Operator Evals – real-world benchmarks for LLM web agents (github.com/nottelabs)
3 points
monoid73
a year ago
1 comment
851.
Show HN: AWS CIS Benchmarks Compliance with Kexa (Open Source) (github.com/4urcloud)
3 points
4urcloud_adrien
2 years ago
1 comment
852.
.NET MAUI Native vs. Self-Drawn Controls Benchmark (github.com/jsuarezruiz)
3 points
zigzag312
4 years ago
1 comment
853.
Show HN: Graphsignal – benchmark and profile machine learning anywhere (github.com/graphsignal)
3 points
complexity2
4 years ago
1 comment
854.
Zig-gamedev project: DirectX 12 Mesh Shader benchmark (github.com/michal-z)
3 points
michal-z
4 years ago
1 comment
855.
A great optimization for CPython with great benchmarking results (github.com/python)
3 points
androdiscens
5 years ago
1 comment
856.
Show HN: Open-Source Performance Metrics and Benchmarks for Machine Learning/NLP (github.com/codeforequity-at)
3 points
ftreml
6 years ago
1 comment
857.
Show HN: A simple python network benchmark tool (github.com/ljishen)
3 points
shiwo
8 years ago
1 comment
858.
Benchmarking Nginx with Go (gist.github.com)
3 points
strzalek
11 years ago
discuss
859.
A bunch of good git tutorials (gist.github.com)
3 points
jaseemabid
13 years ago
discuss
860.
Why Vector Search fails at LLM memory (and a benchmark to prove it) (github.com/tenurehq)
3 points
decorner
4 days ago
discuss
861.
Research repository for the Americas – benchmarks, models, governance (github.com/GENIA-Americas)
3 points
emergingrulek12
10 days ago
discuss
862.
We built a charting benchmark suite: ChartGPU, Plotly, ECharts, SciChart (github.com/abtsoftware)
3 points
abtgraphics
2 months ago
discuss
863.
ParseBench: Document Parsing Benchmark for AI Agents (github.com/run-llama)
3 points
firasd
2 months ago
discuss
864.
Show HN: FC-Eval – CLI to Benchmark Local or Cloud LLMs on Function Calling (github.com/gauravvij)
3 points
gauravvij137
3 months ago
discuss
865.
Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions (github.com/lechmazur)
3 points
zone411
3 months ago
discuss
866.
Elixir, Kotlin, C# Outperform Python, TypeScript and Go on AutoCode Benchmark (github.com/Tencent-Hunyuan)
3 points
bnchrch
4 months ago
discuss
867.
Benchmarking Postgres for FTS with TOASTed JSONBs and GINs Against Elasticsearch (github.com/inevolin)
3 points
thunderbong
5 months ago
discuss
868.
Show HN: llmnop – Rust CLI for benchmarking LLM endpoints (github.com/jpreagan)
3 points
jpreagan
5 months ago
discuss
869.
LZAV 5.7: Enhanced Compression, Speed, C++ Compliance, and Benchmark Results (github.com/avaneev)
3 points
birdculture
6 months ago
discuss
870.
80.1 % on LoCoMo Long-Term Memory Benchmark with a pure open-source RAG pipeline
3 points
ViktorKuz
6 months ago
discuss
More