Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
211.
▲
A Big Alignment Loophole of Current Froniter LLMs
(github.com/wuyoscar)
4 points
pythonsen
2 months ago
1 comment
212.
▲
Show HN: Benchmarking Zip Libraries in Node.js, Go, Rust, Python, C++, and Java
(github.com/shaheemMPM)
4 points
shaheem_mpm
2 years ago
1 comment
213.
▲
Compiler Benchmark
(github.com/nordlow)
4 points
arunc
6 years ago
1 comment
214.
▲
JavaScript Minification Benchmarks
(github.com/privatenumber)
4 points
javatuts
3 months ago
discuss
215.
▲
Show HN: GPT-5 vs. Claude 4 Sonnet on 200 Requests Benchmark
(github.com/Cubent-Dev)
4 points
anwarlaksir
9 months ago
discuss
216.
▲
Zsh-bench – Benchmark for interactive zsh
(github.com/romkatv)
4 points
dennis-tra
5 years ago
discuss
217.
▲
Benchmark of WebAssembly Runtimes
(github.com/jedisct1)
4 points
drocer88
5 years ago
discuss
218.
▲
BIG-bench: a collaborative language model benchmark from Google and OpenAI
(github.com/google)
4 points
guyga
5 years ago
discuss
219.
▲
Show HN: I benchmarked how good LLMs are at proofreading English
(github.com/reviseio)
3 points
artursapek
a month ago
2 comments
220.
▲
Open-sourcing our clinical triage benchmark for evaluating LLMs
(github.com/medaks)
3 points
klemenvod
a year ago
2 comments
221.
▲
C# benchmark beats Go and Rust
(github.com/gardc)
3 points
PKop
2 years ago
2 comments
222.
▲
Show HN: Cloud Benchmarker: See how fast your cloud instances are for real!
(github.com/Dicklesworthstone)
3 points
eigenvalue
3 years ago
1 comment
223.
▲
Optimize noisy, high resolution images into rather “tiny” files
(github.com/colbyn)
3 points
danielsokil
6 years ago
1 comment
224.
▲
Show HN: A fast, thread-safe C hashmap with lazy sorting
(github.com/RaphaelPrevost)
3 points
jaguarwan
16 days ago
discuss
225.
▲
XAIDR – first runtime benchmark for agent-to-agent attack detection
(github.com/anirudhraokotaru)
3 points
delphisec
a month ago
discuss
226.
▲
Bullshit Benchmark: how do chatbots respond to silly questions?
(github.com/petergpt)
3 points
twistorial
3 months ago
discuss
227.
▲
DBaaS Performance Benchmarks – The results will shock you!
(github.com/iamalnewkirk)
3 points
iamalnewkirk
4 months ago
discuss
228.
▲
Time Ablation Experiments on tau2-bench
(github.com/sshh12)
3 points
sshh12
5 months ago
discuss
229.
▲
Neo Scored 34.2% SOTA on OpenAI MLE-Bench
(github.com/openai)
3 points
vijgaurav
9 months ago
discuss
230.
▲
Show HN: Zbench, RAG evals using chess Elo ratings
(github.com/zeroentropy-ai)
3 points
ghita_
10 months ago
discuss
231.
▲
Browser Engine Benchmark
(github.com/techinz)
3 points
handfuloflight
a year ago
discuss
232.
▲
GPU Benchmark
(github.com/yachty66)
3 points
ushakov
a year ago
discuss
233.
▲
Show HN: Theoretical Tflops ≠ Real-World Performance – Testing GPU Flops
(github.com/mag-)
3 points
rkwasny
2 years ago
discuss
234.
▲
Bringup-Bench: benchmarks for bringing up CPUs, accelerators, compilers, OSes
(github.com/toddmaustin)
3 points
matt_d
3 years ago
discuss
235.
▲
gRPC Benchmarks
(github.com/LesnyRumcajs)
3 points
dustinmoris
5 years ago
discuss
236.
▲
Benchmarking OpenBLAS on an Apple MacBook M1
(github.com/danielchalef)
3 points
roseway4
5 years ago
discuss
237.
▲
Show HN: Super Simple Disk Benchmark v1.1.1 written in rust
(github.com/sassman)
3 points
5422m4n
6 years ago
discuss
238.
▲
Measure Amazon S3's performance from any location
(github.com/dvassallo)
3 points
DVassallo
7 years ago
discuss
239.
▲
Kube-bench: checks deployment according to security best practices
(github.com/aquasecurity)
3 points
based2
7 years ago
discuss
240.
▲
Faster R-CNN and Mask R-CNN in PyTorch 1.0
(github.com/facebookresearch)
3 points
stablemap
8 years ago
discuss
More