Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
421.
▲
AntiSlop Sampler for LLM Inference
(github.com/sam-paech)
3 points
rahimnathwani
a year ago
discuss
422.
▲
Jet.jl: static type checker with type inference for Julia
(github.com/aviatesk)
3 points
fanf2
2 years ago
discuss
423.
▲
Show HN: Bayesian Neural Networks and Uncertainty for Inferring Unseen Classes
(github.com/MNoorFawi)
3 points
mnoorfawi
2 years ago
discuss
424.
▲
Real-Time Streaming Apps with Nvidia Open Source Triton Inference
(github.com/nickaggarwal)
3 points
agcat
2 years ago
discuss
425.
▲
Distributed LLM Inference with Llama.cpp
(github.com/ggerganov)
3 points
tosh
2 years ago
discuss
426.
▲
Practical Llama 3 inference implemented in a single Java file
(github.com/mukel)
3 points
simonpure
2 years ago
discuss
427.
▲
Gemma.cpp: lightweight, standalone C++ inference engine for Gemma models
(github.com/google)
3 points
ot
2 years ago
discuss
428.
▲
Llama.cpp supports distributed inference across machines on a local network
(github.com/ggerganov)
3 points
behnamoh
2 years ago
discuss
429.
▲
RCE in Nvidia Triton Inference Server
(github.com/protectai)
3 points
byt3bl33d3r
2 years ago
discuss
430.
▲
Show HN: Inference-only implementation of Mamba optimized for CPU
(github.com/flawedmatrix)
3 points
flawedmatrix
2 years ago
discuss
431.
▲
Show HN: NOS – A fast, and ergonomic PyTorch inference server
(github.com/autonomi-ai)
3 points
EarlyOom
2 years ago
discuss
432.
▲
Training and inference code for audio generation models
(github.com/Stability-AI)
3 points
treesciencebot
3 years ago
discuss
433.
▲
Vllm: High-throughput and memory-efficient inference and serving engine for LLMs
(github.com/vllm-project)
3 points
tosh
3 years ago
discuss
434.
▲
Small inference runtime for deep neural networks
(github.com/maekawatoshiki)
3 points
uint256_t
3 years ago
discuss
435.
▲
Inference at the edge: Efficient transformer model inference on-device
(github.com/ggerganov)
3 points
lioeters
3 years ago
discuss
436.
▲
WebGPU ONNX inference runtime written in Rust
(github.com/webonnx)
3 points
f_devd
3 years ago
discuss
437.
▲
Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs
(github.com/graphsignal)
3 points
npgraph
3 years ago
discuss
438.
▲
Show HN: Nix-init – Generate Nix packages from URLs with dependency inference
(github.com/nix-community)
3 points
figsoda
3 years ago
discuss
439.
▲
Fast type inference library for Common Lisp
(github.com/marcoheisig)
3 points
medo-bear
4 years ago
discuss
440.
▲
Using OpenAI Codex's “DaVinci-Edit” Model for Gradual Type Inference
(github.com/GammaTauAI)
3 points
elleven
4 years ago
discuss
441.
▲
Show HN: Spartan Schema - Ultra-minimal JSON schemas with Typescript inference
(github.com/ar-nelson)
3 points
ar-nelson
4 years ago
discuss
442.
▲
Type inference for the database access layer in PHP
3 points
markusstaab
4 years ago
discuss
443.
▲
The exhaustive Pattern Matching library for TypeScript with smart type inference
(github.com/gvergnaud)
3 points
itstaken
4 years ago
discuss
444.
▲
Nebullvm, an open-source library to accelerate AI inference by 5-20x
(github.com/nebuly-ai)
3 points
emilec___
4 years ago
discuss
445.
▲
Epispot: A Python Library for Modeling Infectious Diseases
(github.com/epispot)
3 points
cbracketdash
5 years ago
discuss
446.
▲
Cortex: Serverless Inference for MLOps Teams
(github.com/cortexlabs)
3 points
ChefboyOG
5 years ago
discuss
447.
▲
Show HN: We just launched MegaAI. It's a 4k30fps, 4W, 4TOPS inference powerhouse
3 points
seventytwo
6 years ago
discuss
448.
▲
Corona Model with a Varying Infection Rate
(github.com/meisserecon)
3 points
gniv
6 years ago
discuss
449.
▲
Corona Model with a Varying Infection Rate
(github.com/meisserecon)
3 points
JacobDotVI
6 years ago
discuss
450.
▲
Covid-19 Infection Fatality Rates
(github.com/clauswilke)
3 points
flocial
6 years ago
discuss
More