Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
301.
▲
Sahi: A Vision library for sliced inference on large images/small objects
(github.com/obss)
8 points
yagizdegirmenci
5 years ago
discuss
302.
▲
Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference
(github.com/docker)
7 points
ericcurtin
7 months ago
1 comment
303.
▲
Show HN: Jlama – A fast Java inference engine for GPT and Llama models
(github.com/tjake)
7 points
tjake
3 years ago
1 comment
304.
▲
Show HN: Llamero – A GUI app to easily download, install and infer LLaMA models
(github.com/mpociot)
7 points
mpociot
3 years ago
1 comment
305.
▲
Alpa: Auto-parallelizing large model training and inference (by UC Berkeley)
(github.com/alpa-projects)
7 points
zhisbug
4 years ago
1 comment
306.
▲
Show HN: Secure XGBoost training and inference on encrypted data
(github.com/mc2-project)
7 points
chesterl
6 years ago
1 comment
307.
▲
Show HN: Composable middleware for LLM inference Optimization Passes
(github.com/liquidos-ai)
7 points
human_hack3r
3 months ago
discuss
308.
▲
Distributed LLama3 Inference
(github.com/evilsocket)
7 points
345765476586
2 years ago
discuss
309.
▲
Stable Diffusion Inference on iOS
(github.com/madebyollin)
7 points
pizza
4 years ago
discuss
310.
▲
Cligen: A Native API-Inferred Command-Line Interface Generator for Nim
(github.com/c-blake)
6 points
TheWiggles
10 months ago
3 comments
311.
▲
RxInferServer – Remote Bayesian Inference from Python via Julia
6 points
bvdmitri
a year ago
3 comments
312.
▲
Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite
(github.com/larq)
6 points
khelwegen
6 years ago
1 comment
313.
▲
OpenUMA – bring Apple-style unified memory to x86 AI inference (Rust, Linux)
(github.com/hamtun24)
6 points
hamtun24
2 months ago
discuss
314.
▲
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention
(github.com/ggml-org)
6 points
diwank
9 months ago
discuss
315.
▲
Llama Inference in 150 Lines
(gist.github.com)
6 points
kevmo314
2 years ago
discuss
316.
▲
Show HN: Launch StableStudio local inference in one commmand
(github.com/brycedrennan)
6 points
bryced
3 years ago
discuss
317.
▲
Rust+OpenCL+AVX2 implementation of LLaMA inference code
(github.com/Noeda)
6 points
myers
3 years ago
discuss
318.
▲
ncnn: High-performance neural network inference framework optimized for mobile
(github.com/Tencent)
6 points
davikr
3 years ago
discuss
319.
▲
Wase – WebAssembly made easy. Strongly typed infered low-level language for WASM
(github.com/area9innovation)
6 points
asgeralstrup
4 years ago
discuss
320.
▲
Statistical Inference Considered Harmful
(github.com/frankmcsherry)
6 points
rargulati
10 years ago
discuss
321.
▲
Ask HN: What is the best tool to infer data type of tabular data?
5 points
mahalel
5 years ago
7 comments
322.
▲
Show HN: Zod – TypeScript-first validation library with static type inference
(github.com/vriad)
5 points
vriad
6 years ago
3 comments
323.
▲
Show HN: GPT-J inference on the CPU using C/C++
(github.com/ggerganov)
5 points
ggerganov
4 years ago
2 comments
324.
▲
I implemented CLIP inference in plain C/C++
(github.com/monatis)
5 points
monatis
3 years ago
1 comment
325.
▲
GeosPy: Geolocation Inference Made Easy
(github.com/tylfin)
5 points
tylfin
10 years ago
1 comment
326.
▲
AI Agent that at inference time updates it's harness and model weights
(github.com/hexo-ai)
5 points
martianvoid
6 days ago
discuss
327.
▲
Show HN: Smile-Serve – Inference Server for ML, ONNX, and LLM
(github.com/haifengl)
5 points
haifeng
a month ago
discuss
328.
▲
vLLM introduces memory optimizations for long-context inference
(github.com/vllm-project)
5 points
addisud
2 months ago
discuss
329.
▲
Zinc – LLM inference engine written in Zig, running 35B models on $550 AMD GPUs
(github.com/zolotukhin)
5 points
mvdwoord
2 months ago
discuss
330.
▲
Show HN: Llmtop – Htop for LLM Inference Clusters (vLLM, SGLang, Ollama, llama)
(github.com/InfraWhisperer)
5 points
rpotluri
3 months ago
discuss
More