Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
751.
Google/fuzzbench: Fuzzer benchmarking as a service (github.com/google)
11 points
edward
6 years ago
discuss
752.
A benchmark to compare synchronization techniques for multicore programming (github.com/gramoli)
11 points
wsmith
10 years ago
discuss
753.
HTTP benchmarking tool written in Crystal (github.com/Sdogruyol)
11 points
sdogruyol
11 years ago
discuss
754.
Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces (npmjs.com)
10 points
george_ciobanu
a month ago
2 comments
755.
A mind-bending simulation of the movie Inception, in C and ASM. (github.com/karthick18)
10 points
jlangenauer
16 years ago
discuss
756.
Show HN: LLM Debate Benchmark (github.com/lechmazur)
9 points
zone411
3 months ago
3 comments
757.
Recursive grep written in Go benched against a C++ and Rust variant (github.com/bep)
9 points
bjornerik
a month ago
2 comments
758.
LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models (github.com/lechmazur)
9 points
zone411
2 months ago
discuss
759.
You Do Not Need a Vector Database (For RAG): Benchmarking IR Methods
9 points
ylow
3 years ago
discuss
760.
Trival PHP string concatenation benchmarks, proving time better spent elsewhere. (github.com/magnetikonline)
8 points
magnetikonline
12 years ago
6 comments
761.
Real-world benchmarks (gist.github.com)
8 points
geelen
13 years ago
2 comments
762.
Show HN: Bazaar – a new LLM benchmark for economic reasoning under uncertainty (github.com/lechmazur)
8 points
zone411
a year ago
1 comment
763.
OpenChat_8192 Beats ChatGPT-3.5 on Vicuna GPT-4 Benchmark
8 points
thibo_skabgia
3 years ago
1 comment
764.
Raspberry Pi httpd micro benchmark (gist.github.com)
8 points
mpg123
10 years ago
1 comment
765.
Show HN: LLM Creative Story‑Writing Benchmark V3 (github.com/lechmazur)
8 points
zone411
9 months ago
discuss
766.
Show HN: LLM Divergent Thinking Creativity Benchmark (github.com/lechmazur)
8 points
zone411
a year ago
discuss
767.
Show HN: Iron Cushion, a CouchDB benchmark and load testing tool (github.com/mgp)
8 points
shadowmatter
14 years ago
discuss
768.
Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python (github.com/SWE-agent)
7 points
lieret
10 months ago
4 comments
769.
A caffeine driven, simplistic approach to benchmarking Node.js code. (github.com/logicalparadox)
7 points
vesln
14 years ago
3 comments
770.
Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure (github.com/lechmazur)
7 points
zone411
a year ago
1 comment
771.
Show HN: LLM Deceptiveness and Gullibility Benchmark (github.com/lechmazur)
7 points
zone411
2 years ago
1 comment
772.
Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js (github.com/andrewvc)
7 points
andrewvc
14 years ago
1 comment
773.
Wrk – an HTTP benchmarking tool (github.com/wg)
7 points
jnazario
13 years ago
discuss
774.
Show HN: Get a report on your compliance to CIS Benchmarks (Azure and AWS) (github.com/4urcloud)
7 points
adrien4urcloud
2 years ago
discuss
775.
Show HN: Ben, your benchmarking assistant, written in Go (github.com/drish)
7 points
drish
8 years ago
discuss
776.
Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets (github.com/LinearBoost)
6 points
hamid9
2 years ago
5 comments
777.
Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?
6 points
KuriousCat
2 years ago
3 comments
778.
Benchmark GGUF model with ONE line of code (github.com/NexaAI)
6 points
alanzhuly
2 years ago
1 comment
779.
New code-focused LLM needle in the haystack benchmark (github.com/HammingHQ)
6 points
sumanyusharma
2 years ago
1 comment
780.
Response to Google's Keras-PyTorch Benchmarks (gist.github.com)
6 points
mindcrime
2 years ago
1 comment
More