Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
391.
Cake: Distributed LLM and StableDiffusion inference for mobile desktop or server (github.com/evilsocket)
3 points
ethanpil
a year ago
1 comment
392.
Geniusrise – inference APIs for text, vision, audio, multi-modal AI models (github.com)
3 points
ixaxaar
2 years ago
1 comment
393.
Show HN: Collider – the platform for local LLM debug and inference at warp speed (github.com/gotzmann)
3 points
Ambix
3 years ago
1 comment
394.
Takeoff Inference Server Is Now Open Source (github.com/titanml)
3 points
mezark
3 years ago
1 comment
395.
Show HN: GPT-2 inference on the CPU using C/C++ (github.com/ggerganov)
3 points
ggerganov
4 years ago
1 comment
396.
Hummingbird,compiles trained ML models into tensor computation for inference (github.com/microsoft)
3 points
tourist_on_road
6 years ago
1 comment
397.
HE-Transformer: Deep Learning Inference with Homomorphic Encryption (github.com/NervanaSystems)
3 points
ArtWomb
7 years ago
1 comment
398.
Inferring polygon vertices with a VGG-16 model (github.com/AidanRocke)
3 points
aidanrocke
8 years ago
1 comment
399.
Show HN: Piqc – GPU waste scanner for LLM inference clusters (github.com/paralleliq)
3 points
paralleliq
4 days ago
discuss
400.
Show HN: Llmff v1.0 FFmpeg for Inference (github.com/syndicalt)
3 points
syndicalt
6 days ago
discuss
401.
Atlas TQ1_0 – Pure C++ Ternary (1.58-Bit) Inference Engine for CPU (github.com/xxxn3m3s1sxxx)
3 points
xxxn3m3s1sxxx
19 days ago
discuss
402.
Atlas – Pure Rust Inference Engine (github.com/Avarok-Cybersecurity)
3 points
danborn26
a month ago
discuss
403.
Show HN: Valkyr LM Inference with Realtime Guarantees (github.com/Foundation42)
3 points
quatonion
a month ago
discuss
404.
Show HN: Coelanox – auditable inference runtime in Rust (BERT runs today) (coelanox.com)
3 points
Shark1n4Suit
2 months ago
discuss
405.
LLM inference load balancer optimized for AMD Radeon VII GPUs (github.com/janit)
3 points
velmu
2 months ago
discuss
406.
RvLLM: High-performance LLM inference in Rust (github.com/m0at)
3 points
mji
2 months ago
discuss
407.
Rust-native hybrid training and inference engine for Apple Neural Engine and GPU (github.com/ncdrone)
3 points
ngaut
3 months ago
discuss
408.
Show HN: Doppler.js – WebGPU inference, faster/simpler than transformer.js
3 points
clocksmith
3 months ago
discuss
409.
OMLX – LLM Inference Server for Apple Silicon (Ollama for MLX) (github.com/jundot)
3 points
fintechie
4 months ago
discuss
410.
Free LLM API Resources – A List of Free LLM Inference APIs (github.com/cheahjs)
3 points
willmarquis
4 months ago
discuss
411.
A tiny LM that does inference at compile time (github.com/erodola)
3 points
signa11
5 months ago
discuss
412.
Snow HN: ~950 line inference engine, on par with vLLM (github.com/naklecha)
3 points
naklecha
5 months ago
discuss
413.
Bare-Metal Llama 2 Inference in C++20 (No Frameworks, ARM Neon) (github.com/farukalpay)
3 points
ornurla
5 months ago
discuss
414.
OpenVINO – open-source toolkit for optimizing and deploying AI inference (github.com/openvinotoolkit)
3 points
peter_d_sherman
5 months ago
discuss
415.
Show HN: Distributed Storage System to 8x LLM Inference, GPU Training Efficiency (github.com/blackbird-io)
3 points
hackerpanda123
8 months ago
discuss
416.
LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease (github.com/bentoml)
3 points
djhu9
9 months ago
discuss
417.
LLM Inference in pure Java with a GPU acceleration enabled (github.com/beehive-lab)
3 points
mikepapadim
a year ago
discuss
418.
Show HN: I made TypeScript's type inference more strict (and smarter) (github.com/kakasoo)
3 points
kakasoo
a year ago
discuss
419.
The Path to Open-Sourcing the DeepSeek Inference Engine (github.com/deepseek-ai)
3 points
vitorgrs
a year ago
discuss
420.
Deepseek CPP for CPU only inference (github.com/andrewkchan)
3 points
amrrs
a year ago
discuss
More