Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
961.
PostgreSQL vs. ClickHouse: Learnings from building my first database benchmark (github.com/514-labs)
2 points
oatsandsugar
10 months ago
discuss
962.
Show HN: New SWE-bench leaderboard compares LMs without fancy agent scaffolds (swebench.com)
2 points
lieret
10 months ago
discuss
963.
Show HN: VDBbench 1.0: open-source benchmarking for VectorDBs (github.com/zilliztech)
2 points
Fendy
a year ago
discuss
964.
MAIR: A Benchmark for Evaluating Instructed Retrieval (github.com/sunnweiwei)
2 points
fzliu
a year ago
discuss
965.
Show HN: Comprehensive Benchmark Suite for Story Visualization (github.com/ViStoryBench)
2 points
hzwer
a year ago
discuss
966.
Show HN: Benchmarks agree with the complexity analysis of the TopoSort algorithm (github.com/williamw520)
2 points
ww520
a year ago
discuss
967.
Show HN: I built an open-source benchmark that evaluates LLMs through gameplay (llmshowdown.io)
2 points
jmogi
a year ago
discuss
968.
QuickBench: A Zero-Dependency Linux Benchmark for CPU, Memory, and Storage (github.com/bearstech)
2 points
kadrek
a year ago
discuss
969.
Elimination Game Benchmark: Social Reasoning, Strategy, and Deception in LLMs (github.com/lechmazur)
2 points
amichail
a year ago
discuss
970.
Latest Benchmarks Show 10x Faster Prefix Queries vs. Etcd
2 points
absolute7
a year ago
discuss
971.
C++ Showing std:swap faster than XOR trick to swap numbers via naive benchmark (github.com/vladov3000)
2 points
signa11
2 years ago
discuss
972.
Benchmarks Comparing PyTorch and MLX on Apple Silicon GPUs (github.com/LucasSte)
2 points
tosh
2 years ago
discuss
973.
Test/benchmark of using 32-bit pointers in 64-bit code on Windows (github.com/tringi)
2 points
thunderbong
2 years ago
discuss
974.
(JS) benchmark tooling that makes your heart warm (github.com/evanwashere)
2 points
devcat
2 years ago
discuss
975.
Show HN: Retrieval engine with SOTA performance on challenging RAG benchmarks (github.com/D-Star-AI)
2 points
zmccormick7
2 years ago
discuss
976.
miniF2F: Formal to Formal Mathematics Benchmark (github.com/openai)
2 points
tosh
2 years ago
discuss
977.
Pgdsat – Postgres database security assessment tool for CIS benchmarks (github.com/HexaCluster)
2 points
avi_vallarapu
2 years ago
discuss
978.
Spam-T5: Benchmarking Large Language Models for Few-Shot Email Spam Detection (github.com/jpmorganchase)
2 points
mariuz
2 years ago
discuss
979.
GPT-4-turbo-2024-04-09 "wins" simple evals benchmark (github.com/openai)
2 points
zurfer
2 years ago
discuss
980.
Benchmarks for JDK HTTP Server Running on Java 21 with Virtual Threads (github.com/ebarlas)
2 points
simonpure
2 years ago
discuss
981.
BEIR: A Heterogeneous Benchmark for Information Retrieval (github.com/beir-cellar)
2 points
dmezzetti
2 years ago
discuss
982.
Benchmarking C/C++ hash table libraries for small keys (github.com/attractivechaos)
2 points
attractivechaos
2 years ago
discuss
983.
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models (github.com/ruixiangcui)
2 points
accrual
3 years ago
discuss
984.
Benchmarking Tool for Vector DBs (github.com/zilliztech)
2 points
fzliu
3 years ago
discuss
985.
Blossom Bindings (Re: Backbone Events vs Ember Bindings: A Benchmark) (fohr.github.com)
2 points
blktiger
14 years ago
discuss
986.
VectorDB benchmark for both cloud and open source (github.com/zilliztech)
2 points
liliuleo93
3 years ago
discuss
987.
SciTS: A tool to benchmark Time-series on different databases (github.com/jalalmostafa)
2 points
jalalmostafa
3 years ago
discuss
988.
Cloud Vector Database Benchmark Result (github.com/zilliztech)
2 points
liliuleo93
3 years ago
discuss
989.
Deduplication Solutions Benchmark (github.com/borgbackup)
2 points
todsacerdoti
3 years ago
discuss
990.
Vector Database Performance Benchmarking (github.com/zilliztech)
2 points
fzliu
3 years ago
discuss
More