Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
481.
Llama2.mojo - outperforms Karpathy’s llama2.c by 30% in multi-threaded inference (github.com/tairov)
2 points
swyx
3 years ago
1 comment
482.
Stable Fast: Lightweight Inference Optimization Library for Stable Diffusion (github.com/chengzeyi)
2 points
chengzeyi
3 years ago
1 comment
483.
Kyanite: NN inference library, in/for Rust, using CPU or Nvidia GPUs (github.com/KarelPeeters)
2 points
homarp
3 years ago
1 comment
484.
Show HN: Llama2.f90 – Toy LLaMA2 model inference in Fortran (github.com/rbitr)
2 points
andy99
3 years ago
1 comment
485.
CTranslate2: An efficient inference engine for Transformer models (github.com/OpenNMT)
2 points
wsxiaoys
3 years ago
1 comment
486.
Nebullvm open-source accelerator of AI inference. Feedback?
2 points
emilec___
4 years ago
1 comment
487.
Gluon: A static, type inferred and embeddable language written in Rust (github.com/gluon-lang)
2 points
fish45
5 years ago
1 comment
488.
Schema – Infer, Translate Between GraphQL, JSON, YAML, TOML, XML (github.com/Confbase)
2 points
confbase
6 years ago
1 comment
489.
Monero Binaries on getmonero.org Infected (github.com/monero-project)
2 points
rocqua
7 years ago
1 comment
490.
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton (github.com/Zyora-Dev)
2 points
zyoraclub
5 days ago
discuss
491.
Tuning CPU-only Qwen3-30B inference with an IBM Quantum sampling loop (github.com/Shack870)
2 points
Royce-CMR
8 days ago
discuss
492.
SIE: Unified Inference Engine for Embeddings, Reranking, and Extraction (github.com/superlinked)
2 points
modinfo
10 days ago
discuss
493.
Show HN: YieldOS-Lite – A simulator for LLM inference control-plane governance (github.com/nikitph)
2 points
loaderchips
13 days ago
discuss
494.
KinetiX: An intra-inference hardware interlock for LLMs (github.com/johndoerch-eng)
2 points
kinetix_system
15 days ago
discuss
495.
Show HN: AI/ML benchmark for local LLM inference and XGBoost training on GPU/CPU (github.com/albedan)
2 points
albedan
22 days ago
discuss
496.
Arknet – decentralized AI inference, fair launch, one binary (github.com/st-hannibal)
2 points
st-hannibal
a month ago
discuss
497.
Openpi-flash: Real-time inference engine for openpi (github.com/Hebbian-Robotics)
2 points
kstonekuan
a month ago
discuss
498.
Show HN: Stateful Inference with 99% Token Savings (github.com/umbecanessa)
2 points
wasnaga
a month ago
discuss
499.
Rcarmo/go-AI: A mildly sane inference API library for go (github.com/rcarmo)
2 points
rcarmo
2 months ago
discuss
500.
Show HN: Mimikos – Zero-config mock server that infers API behavior from OpenAPI
2 points
codeguruking
2 months ago
discuss
501.
Kubernetes operator for deploying, serving, and improve LLM inference engines (github.com/cliver-project)
2 points
LaSombra
2 months ago
discuss
502.
Living Memory Inference (github.com/alash3al)
2 points
alash3al
2 months ago
discuss
503.
Swift package AI inference engine generated from Rust crate (github.com/ondeinference)
2 points
kampak212
2 months ago
discuss
504.
Open-source ZK proofs for ML inference – verify AI decisions cryptographically (github.com/OE-GOD)
2 points
OE-GOD
2 months ago
discuss
505.
AirLLM optimizes inference memory usage (github.com/lyogavin)
2 points
nreece
3 months ago
discuss
506.
Show HN: I wrote an LLM inference engine in pure Go – 48 tok/s zero dependencies (github.com/computerex)
2 points
computerex
3 months ago
discuss
507.
Show HN: Name-classifier – infers attributes about a person from a name (github.com/douglas-larocca)
2 points
defgeneric
3 months ago
discuss
508.
C inference for Qwen3-ASR 0.6B and 1.7B transcriptions models (github.com/antirez)
2 points
Curiositry
3 months ago
discuss
509.
Show HN: I built a unified inference layer for Document Processing Models (github.com/adithya-s-k)
2 points
Adithya-Kolavi
3 months ago
discuss
510.
Show HN: Evolved x86 AVX-512 kernels for NF4 LLM inference (github.com/Anuar81)
2 points
Anuar81
4 months ago
discuss
More