Search: github.com/bendc | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

751.

Google/fuzzbench: Fuzzer benchmarking as a service (github.com/google)

11 points

6 years ago

752.

A benchmark to compare synchronization techniques for multicore programming (github.com/gramoli)

11 points

10 years ago

753.

HTTP benchmarking tool written in Crystal (github.com/Sdogruyol)

11 points

11 years ago

754.

Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces (npmjs.com)

10 points

a month ago

755.

A mind-bending simulation of the movie Inception, in C and ASM. (github.com/karthick18)

10 points

16 years ago

756.

Show HN: LLM Debate Benchmark (github.com/lechmazur)

9 points

3 months ago

757.

Recursive grep written in Go benched against a C++ and Rust variant (github.com/bep)

9 points

a month ago

758.

LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models (github.com/lechmazur)

9 points

2 months ago

759.

You Do Not Need a Vector Database (For RAG): Benchmarking IR Methods

9 points

3 years ago

760.

Trival PHP string concatenation benchmarks, proving time better spent elsewhere. (github.com/magnetikonline)

8 points

12 years ago

761.

Real-world benchmarks (gist.github.com)

8 points

13 years ago

762.

Show HN: Bazaar – a new LLM benchmark for economic reasoning under uncertainty (github.com/lechmazur)

8 points

a year ago

763.

OpenChat_8192 Beats ChatGPT-3.5 on Vicuna GPT-4 Benchmark

8 points

3 years ago

764.

Raspberry Pi httpd micro benchmark (gist.github.com)

8 points

10 years ago

765.

Show HN: LLM Creative Story‑Writing Benchmark V3 (github.com/lechmazur)

8 points

9 months ago

766.

Show HN: LLM Divergent Thinking Creativity Benchmark (github.com/lechmazur)

8 points

a year ago

767.

Show HN: Iron Cushion, a CouchDB benchmark and load testing tool (github.com/mgp)

8 points

14 years ago

768.

Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python (github.com/SWE-agent)

7 points

10 months ago

769.

A caffeine driven, simplistic approach to benchmarking Node.js code. (github.com/logicalparadox)

7 points

14 years ago

770.

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure (github.com/lechmazur)

7 points

a year ago

771.

Show HN: LLM Deceptiveness and Gullibility Benchmark (github.com/lechmazur)

7 points

2 years ago

772.

Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js (github.com/andrewvc)

7 points

14 years ago

773.

Wrk – an HTTP benchmarking tool (github.com/wg)

7 points

13 years ago

774.

Show HN: Get a report on your compliance to CIS Benchmarks (Azure and AWS) (github.com/4urcloud)

7 points

2 years ago

775.

Show HN: Ben, your benchmarking assistant, written in Go (github.com/drish)

7 points

8 years ago

776.

Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets (github.com/LinearBoost)

6 points

2 years ago

777.

Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?

6 points

2 years ago

778.

Benchmark GGUF model with ONE line of code (github.com/NexaAI)

6 points

2 years ago

779.

New code-focused LLM needle in the haystack benchmark (github.com/HammingHQ)

6 points

2 years ago

780.

Response to Google's Keras-PyTorch Benchmarks (gist.github.com)

6 points

2 years ago