Search: github.com/jbenc | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

211.

A Big Alignment Loophole of Current Froniter LLMs (github.com/wuyoscar)

4 points

2 months ago

212.

Show HN: Benchmarking Zip Libraries in Node.js, Go, Rust, Python, C++, and Java (github.com/shaheemMPM)

4 points

2 years ago

213.

Compiler Benchmark (github.com/nordlow)

4 points

6 years ago

214.

JavaScript Minification Benchmarks (github.com/privatenumber)

4 points

3 months ago

215.

Show HN: GPT-5 vs. Claude 4 Sonnet on 200 Requests Benchmark (github.com/Cubent-Dev)

4 points

9 months ago

216.

Zsh-bench – Benchmark for interactive zsh (github.com/romkatv)

4 points

5 years ago

217.

Benchmark of WebAssembly Runtimes (github.com/jedisct1)

4 points

5 years ago

218.

BIG-bench: a collaborative language model benchmark from Google and OpenAI (github.com/google)

4 points

5 years ago

219.

Show HN: I benchmarked how good LLMs are at proofreading English (github.com/reviseio)

3 points

a month ago

220.

Open-sourcing our clinical triage benchmark for evaluating LLMs (github.com/medaks)

3 points

a year ago

221.

C# benchmark beats Go and Rust (github.com/gardc)

3 points

2 years ago

222.

Show HN: Cloud Benchmarker: See how fast your cloud instances are for real! (github.com/Dicklesworthstone)

3 points

3 years ago

223.

Optimize noisy, high resolution images into rather “tiny” files (github.com/colbyn)

3 points

6 years ago

224.

Show HN: A fast, thread-safe C hashmap with lazy sorting (github.com/RaphaelPrevost)

3 points

16 days ago

225.

XAIDR – first runtime benchmark for agent-to-agent attack detection (github.com/anirudhraokotaru)

3 points

a month ago

226.

Bullshit Benchmark: how do chatbots respond to silly questions? (github.com/petergpt)

3 points

3 months ago

227.

DBaaS Performance Benchmarks – The results will shock you! (github.com/iamalnewkirk)

3 points

4 months ago

228.

Time Ablation Experiments on tau2-bench (github.com/sshh12)

3 points

5 months ago

229.

Neo Scored 34.2% SOTA on OpenAI MLE-Bench (github.com/openai)

3 points

9 months ago

230.

Show HN: Zbench, RAG evals using chess Elo ratings (github.com/zeroentropy-ai)

3 points

10 months ago

231.

Browser Engine Benchmark (github.com/techinz)

3 points

a year ago

232.

GPU Benchmark (github.com/yachty66)

3 points

a year ago

233.

Show HN: Theoretical Tflops ≠ Real-World Performance – Testing GPU Flops (github.com/mag-)

3 points

2 years ago

234.

Bringup-Bench: benchmarks for bringing up CPUs, accelerators, compilers, OSes (github.com/toddmaustin)

3 points

3 years ago

235.

gRPC Benchmarks (github.com/LesnyRumcajs)

3 points

5 years ago

236.

Benchmarking OpenBLAS on an Apple MacBook M1 (github.com/danielchalef)

3 points

5 years ago

237.

Show HN: Super Simple Disk Benchmark v1.1.1 written in rust (github.com/sassman)

3 points

6 years ago

238.

Measure Amazon S3's performance from any location (github.com/dvassallo)

3 points

7 years ago

239.

Kube-bench: checks deployment according to security best practices (github.com/aquasecurity)

3 points

7 years ago

240.

Faster R-CNN and Mask R-CNN in PyTorch 1.0 (github.com/facebookresearch)

3 points

8 years ago