Search: github.com/b1nc | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

841.

LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers (github.com/dial481)

3 points

3 months ago

842.

Benchmarking PostgreSQL vs. CockroachDB (github.com/philippta)

3 points

2 years ago

843.

Show HN: Time Series Benchmark TurboPFor,TurboFloat,TurboFloat LzX,TurboGorilla (github.com/powturbo)

3 points

3 years ago

844.

PhantomGPU: GPU performance emulator to benchmark ML models on virtual GPUs (github.com/bugthesystem)

3 points

a year ago

845.

Ask HN: How to benchmark different LLM models in parallel?

3 points

2 years ago

846.

AIBuildAI – An AI agent that automatically builds AI models (#1 on MLE-Bench) (github.com/aibuildai)

3 points

3 months ago

847.

Show HN: Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis (github.com/sadopc)

3 points

4 months ago

848.

Mcpbr: Stop guessing and evaluate your MCP server against standard benchmarks (github.com/greynewell)

3 points

5 months ago

849.

PrinceJS: Benchmark Corrections and Lessons from a 13-Year-Old Developer (github.com/MatthewTheCoder1218)

3 points

7 months ago

850.

Show HN: Open Operator Evals – real-world benchmarks for LLM web agents (github.com/nottelabs)

3 points

a year ago

851.

Show HN: AWS CIS Benchmarks Compliance with Kexa (Open Source) (github.com/4urcloud)

3 points

4urcloud_adrien

2 years ago

852.

.NET MAUI Native vs. Self-Drawn Controls Benchmark (github.com/jsuarezruiz)

3 points

4 years ago

853.

Show HN: Graphsignal – benchmark and profile machine learning anywhere (github.com/graphsignal)

3 points

4 years ago

854.

Zig-gamedev project: DirectX 12 Mesh Shader benchmark (github.com/michal-z)

3 points

4 years ago

855.

A great optimization for CPython with great benchmarking results (github.com/python)

3 points

5 years ago

856.

Show HN: Open-Source Performance Metrics and Benchmarks for Machine Learning/NLP (github.com/codeforequity-at)

3 points

6 years ago

857.

Show HN: A simple python network benchmark tool (github.com/ljishen)

3 points

8 years ago

858.

Benchmarking Nginx with Go (gist.github.com)

3 points

11 years ago

859.

A bunch of good git tutorials (gist.github.com)

3 points

13 years ago

860.

Why Vector Search fails at LLM memory (and a benchmark to prove it) (github.com/tenurehq)

3 points

4 days ago

861.

Research repository for the Americas – benchmarks, models, governance (github.com/GENIA-Americas)

3 points

emergingrulek12

10 days ago

862.

We built a charting benchmark suite: ChartGPU, Plotly, ECharts, SciChart (github.com/abtsoftware)

3 points

2 months ago

863.

ParseBench: Document Parsing Benchmark for AI Agents (github.com/run-llama)

3 points

2 months ago

864.

Show HN: FC-Eval – CLI to Benchmark Local or Cloud LLMs on Function Calling (github.com/gauravvij)

3 points

3 months ago

865.

Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions (github.com/lechmazur)

3 points

3 months ago

866.

Elixir, Kotlin, C# Outperform Python, TypeScript and Go on AutoCode Benchmark (github.com/Tencent-Hunyuan)

3 points

4 months ago

867.

Benchmarking Postgres for FTS with TOASTed JSONBs and GINs Against Elasticsearch (github.com/inevolin)

3 points

5 months ago

868.

Show HN: llmnop – Rust CLI for benchmarking LLM endpoints (github.com/jpreagan)

3 points

5 months ago

869.

LZAV 5.7: Enhanced Compression, Speed, C++ Compliance, and Benchmark Results (github.com/avaneev)

3 points

6 months ago

870.

80.1 % on LoCoMo Long-Term Memory Benchmark with a pure open-source RAG pipeline

3 points

6 months ago