Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
391.
▲
Cake: Distributed LLM and StableDiffusion inference for mobile desktop or server
(github.com/evilsocket)
3 points
ethanpil
a year ago
1 comment
392.
▲
Geniusrise – inference APIs for text, vision, audio, multi-modal AI models
(github.com)
3 points
ixaxaar
2 years ago
1 comment
393.
▲
Show HN: Collider – the platform for local LLM debug and inference at warp speed
(github.com/gotzmann)
3 points
Ambix
3 years ago
1 comment
394.
▲
Takeoff Inference Server Is Now Open Source
(github.com/titanml)
3 points
mezark
3 years ago
1 comment
395.
▲
Show HN: GPT-2 inference on the CPU using C/C++
(github.com/ggerganov)
3 points
ggerganov
4 years ago
1 comment
396.
▲
Hummingbird,compiles trained ML models into tensor computation for inference
(github.com/microsoft)
3 points
tourist_on_road
6 years ago
1 comment
397.
▲
HE-Transformer: Deep Learning Inference with Homomorphic Encryption
(github.com/NervanaSystems)
3 points
ArtWomb
7 years ago
1 comment
398.
▲
Inferring polygon vertices with a VGG-16 model
(github.com/AidanRocke)
3 points
aidanrocke
8 years ago
1 comment
399.
▲
Show HN: Piqc – GPU waste scanner for LLM inference clusters
(github.com/paralleliq)
3 points
paralleliq
4 days ago
discuss
400.
▲
Show HN: Llmff v1.0 FFmpeg for Inference
(github.com/syndicalt)
3 points
syndicalt
6 days ago
discuss
401.
▲
Atlas TQ1_0 – Pure C++ Ternary (1.58-Bit) Inference Engine for CPU
(github.com/xxxn3m3s1sxxx)
3 points
xxxn3m3s1sxxx
19 days ago
discuss
402.
▲
Atlas – Pure Rust Inference Engine
(github.com/Avarok-Cybersecurity)
3 points
danborn26
a month ago
discuss
403.
▲
Show HN: Valkyr LM Inference with Realtime Guarantees
(github.com/Foundation42)
3 points
quatonion
a month ago
discuss
404.
▲
Show HN: Coelanox – auditable inference runtime in Rust (BERT runs today)
(coelanox.com)
3 points
Shark1n4Suit
2 months ago
discuss
405.
▲
LLM inference load balancer optimized for AMD Radeon VII GPUs
(github.com/janit)
3 points
velmu
2 months ago
discuss
406.
▲
RvLLM: High-performance LLM inference in Rust
(github.com/m0at)
3 points
mji
2 months ago
discuss
407.
▲
Rust-native hybrid training and inference engine for Apple Neural Engine and GPU
(github.com/ncdrone)
3 points
ngaut
3 months ago
discuss
408.
▲
Show HN: Doppler.js – WebGPU inference, faster/simpler than transformer.js
3 points
clocksmith
3 months ago
discuss
409.
▲
OMLX – LLM Inference Server for Apple Silicon (Ollama for MLX)
(github.com/jundot)
3 points
fintechie
4 months ago
discuss
410.
▲
Free LLM API Resources – A List of Free LLM Inference APIs
(github.com/cheahjs)
3 points
willmarquis
4 months ago
discuss
411.
▲
A tiny LM that does inference at compile time
(github.com/erodola)
3 points
signa11
5 months ago
discuss
412.
▲
Snow HN: ~950 line inference engine, on par with vLLM
(github.com/naklecha)
3 points
naklecha
5 months ago
discuss
413.
▲
Bare-Metal Llama 2 Inference in C++20 (No Frameworks, ARM Neon)
(github.com/farukalpay)
3 points
ornurla
5 months ago
discuss
414.
▲
OpenVINO – open-source toolkit for optimizing and deploying AI inference
(github.com/openvinotoolkit)
3 points
peter_d_sherman
5 months ago
discuss
415.
▲
Show HN: Distributed Storage System to 8x LLM Inference, GPU Training Efficiency
(github.com/blackbird-io)
3 points
hackerpanda123
8 months ago
discuss
416.
▲
LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease
(github.com/bentoml)
3 points
djhu9
9 months ago
discuss
417.
▲
LLM Inference in pure Java with a GPU acceleration enabled
(github.com/beehive-lab)
3 points
mikepapadim
a year ago
discuss
418.
▲
Show HN: I made TypeScript's type inference more strict (and smarter)
(github.com/kakasoo)
3 points
kakasoo
a year ago
discuss
419.
▲
The Path to Open-Sourcing the DeepSeek Inference Engine
(github.com/deepseek-ai)
3 points
vitorgrs
a year ago
discuss
420.
▲
Deepseek CPP for CPU only inference
(github.com/andrewkchan)
3 points
amrrs
a year ago
discuss
More