Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
211.
A Big Alignment Loophole of Current Froniter LLMs (github.com/wuyoscar)
4 points
pythonsen
2 months ago
1 comment
212.
Show HN: Benchmarking Zip Libraries in Node.js, Go, Rust, Python, C++, and Java (github.com/shaheemMPM)
4 points
shaheem_mpm
2 years ago
1 comment
213.
Compiler Benchmark (github.com/nordlow)
4 points
arunc
6 years ago
1 comment
214.
JavaScript Minification Benchmarks (github.com/privatenumber)
4 points
javatuts
3 months ago
discuss
215.
Show HN: GPT-5 vs. Claude 4 Sonnet on 200 Requests Benchmark (github.com/Cubent-Dev)
4 points
anwarlaksir
9 months ago
discuss
216.
Zsh-bench – Benchmark for interactive zsh (github.com/romkatv)
4 points
dennis-tra
5 years ago
discuss
217.
Benchmark of WebAssembly Runtimes (github.com/jedisct1)
4 points
drocer88
5 years ago
discuss
218.
BIG-bench: a collaborative language model benchmark from Google and OpenAI (github.com/google)
4 points
guyga
5 years ago
discuss
219.
Show HN: I benchmarked how good LLMs are at proofreading English (github.com/reviseio)
3 points
artursapek
a month ago
2 comments
220.
Open-sourcing our clinical triage benchmark for evaluating LLMs (github.com/medaks)
3 points
klemenvod
a year ago
2 comments
221.
C# benchmark beats Go and Rust (github.com/gardc)
3 points
PKop
2 years ago
2 comments
222.
Show HN: Cloud Benchmarker: See how fast your cloud instances are for real! (github.com/Dicklesworthstone)
3 points
eigenvalue
3 years ago
1 comment
223.
Optimize noisy, high resolution images into rather “tiny” files (github.com/colbyn)
3 points
danielsokil
6 years ago
1 comment
224.
Show HN: A fast, thread-safe C hashmap with lazy sorting (github.com/RaphaelPrevost)
3 points
jaguarwan
16 days ago
discuss
225.
XAIDR – first runtime benchmark for agent-to-agent attack detection (github.com/anirudhraokotaru)
3 points
delphisec
a month ago
discuss
226.
Bullshit Benchmark: how do chatbots respond to silly questions? (github.com/petergpt)
3 points
twistorial
3 months ago
discuss
227.
DBaaS Performance Benchmarks – The results will shock you! (github.com/iamalnewkirk)
3 points
iamalnewkirk
4 months ago
discuss
228.
Time Ablation Experiments on tau2-bench (github.com/sshh12)
3 points
sshh12
5 months ago
discuss
229.
Neo Scored 34.2% SOTA on OpenAI MLE-Bench (github.com/openai)
3 points
vijgaurav
9 months ago
discuss
230.
Show HN: Zbench, RAG evals using chess Elo ratings (github.com/zeroentropy-ai)
3 points
ghita_
10 months ago
discuss
231.
Browser Engine Benchmark (github.com/techinz)
3 points
handfuloflight
a year ago
discuss
232.
GPU Benchmark (github.com/yachty66)
3 points
ushakov
a year ago
discuss
233.
Show HN: Theoretical Tflops ≠ Real-World Performance – Testing GPU Flops (github.com/mag-)
3 points
rkwasny
2 years ago
discuss
234.
Bringup-Bench: benchmarks for bringing up CPUs, accelerators, compilers, OSes (github.com/toddmaustin)
3 points
matt_d
3 years ago
discuss
235.
gRPC Benchmarks (github.com/LesnyRumcajs)
3 points
dustinmoris
5 years ago
discuss
236.
Benchmarking OpenBLAS on an Apple MacBook M1 (github.com/danielchalef)
3 points
roseway4
5 years ago
discuss
237.
Show HN: Super Simple Disk Benchmark v1.1.1 written in rust (github.com/sassman)
3 points
5422m4n
6 years ago
discuss
238.
Measure Amazon S3's performance from any location (github.com/dvassallo)
3 points
DVassallo
7 years ago
discuss
239.
Kube-bench: checks deployment according to security best practices (github.com/aquasecurity)
3 points
based2
7 years ago
discuss
240.
Faster R-CNN and Mask R-CNN in PyTorch 1.0 (github.com/facebookresearch)
3 points
stablemap
8 years ago
discuss
More