Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
751.
▲
Google/fuzzbench: Fuzzer benchmarking as a service
(github.com/google)
11 points
edward
6 years ago
discuss
752.
▲
A benchmark to compare synchronization techniques for multicore programming
(github.com/gramoli)
11 points
wsmith
10 years ago
discuss
753.
▲
HTTP benchmarking tool written in Crystal
(github.com/Sdogruyol)
11 points
sdogruyol
11 years ago
discuss
754.
▲
Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces
(npmjs.com)
10 points
george_ciobanu
a month ago
2 comments
755.
▲
A mind-bending simulation of the movie Inception, in C and ASM.
(github.com/karthick18)
10 points
jlangenauer
16 years ago
discuss
756.
▲
Show HN: LLM Debate Benchmark
(github.com/lechmazur)
9 points
zone411
3 months ago
3 comments
757.
▲
Recursive grep written in Go benched against a C++ and Rust variant
(github.com/bep)
9 points
bjornerik
a month ago
2 comments
758.
▲
LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models
(github.com/lechmazur)
9 points
zone411
2 months ago
discuss
759.
▲
You Do Not Need a Vector Database (For RAG): Benchmarking IR Methods
9 points
ylow
3 years ago
discuss
760.
▲
Trival PHP string concatenation benchmarks, proving time better spent elsewhere.
(github.com/magnetikonline)
8 points
magnetikonline
12 years ago
6 comments
761.
▲
Real-world benchmarks
(gist.github.com)
8 points
geelen
13 years ago
2 comments
762.
▲
Show HN: Bazaar – a new LLM benchmark for economic reasoning under uncertainty
(github.com/lechmazur)
8 points
zone411
a year ago
1 comment
763.
▲
OpenChat_8192 Beats ChatGPT-3.5 on Vicuna GPT-4 Benchmark
8 points
thibo_skabgia
3 years ago
1 comment
764.
▲
Raspberry Pi httpd micro benchmark
(gist.github.com)
8 points
mpg123
10 years ago
1 comment
765.
▲
Show HN: LLM Creative Story‑Writing Benchmark V3
(github.com/lechmazur)
8 points
zone411
9 months ago
discuss
766.
▲
Show HN: LLM Divergent Thinking Creativity Benchmark
(github.com/lechmazur)
8 points
zone411
a year ago
discuss
767.
▲
Show HN: Iron Cushion, a CouchDB benchmark and load testing tool
(github.com/mgp)
8 points
shadowmatter
14 years ago
discuss
768.
▲
Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python
(github.com/SWE-agent)
7 points
lieret
10 months ago
4 comments
769.
▲
A caffeine driven, simplistic approach to benchmarking Node.js code.
(github.com/logicalparadox)
7 points
vesln
14 years ago
3 comments
770.
▲
Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure
(github.com/lechmazur)
7 points
zone411
a year ago
1 comment
771.
▲
Show HN: LLM Deceptiveness and Gullibility Benchmark
(github.com/lechmazur)
7 points
zone411
2 years ago
1 comment
772.
▲
Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js
(github.com/andrewvc)
7 points
andrewvc
14 years ago
1 comment
773.
▲
Wrk – an HTTP benchmarking tool
(github.com/wg)
7 points
jnazario
13 years ago
discuss
774.
▲
Show HN: Get a report on your compliance to CIS Benchmarks (Azure and AWS)
(github.com/4urcloud)
7 points
adrien4urcloud
2 years ago
discuss
775.
▲
Show HN: Ben, your benchmarking assistant, written in Go
(github.com/drish)
7 points
drish
8 years ago
discuss
776.
▲
Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets
(github.com/LinearBoost)
6 points
hamid9
2 years ago
5 comments
777.
▲
Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?
6 points
KuriousCat
2 years ago
3 comments
778.
▲
Benchmark GGUF model with ONE line of code
(github.com/NexaAI)
6 points
alanzhuly
2 years ago
1 comment
779.
▲
New code-focused LLM needle in the haystack benchmark
(github.com/HammingHQ)
6 points
sumanyusharma
2 years ago
1 comment
780.
▲
Response to Google's Keras-PyTorch Benchmarks
(gist.github.com)
6 points
mindcrime
2 years ago
1 comment
More