Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
301.
Sahi: A Vision library for sliced inference on large images/small objects (github.com/obss)
8 points
yagizdegirmenci
5 years ago
discuss
302.
Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference (github.com/docker)
7 points
ericcurtin
7 months ago
1 comment
303.
Show HN: Jlama – A fast Java inference engine for GPT and Llama models (github.com/tjake)
7 points
tjake
3 years ago
1 comment
304.
Show HN: Llamero – A GUI app to easily download, install and infer LLaMA models (github.com/mpociot)
7 points
mpociot
3 years ago
1 comment
305.
Alpa: Auto-parallelizing large model training and inference (by UC Berkeley) (github.com/alpa-projects)
7 points
zhisbug
4 years ago
1 comment
306.
Show HN: Secure XGBoost training and inference on encrypted data (github.com/mc2-project)
7 points
chesterl
6 years ago
1 comment
307.
Show HN: Composable middleware for LLM inference Optimization Passes (github.com/liquidos-ai)
7 points
human_hack3r
3 months ago
discuss
308.
Distributed LLama3 Inference (github.com/evilsocket)
7 points
345765476586
2 years ago
discuss
309.
Stable Diffusion Inference on iOS (github.com/madebyollin)
7 points
pizza
4 years ago
discuss
310.
Cligen: A Native API-Inferred Command-Line Interface Generator for Nim (github.com/c-blake)
6 points
TheWiggles
10 months ago
3 comments
311.
RxInferServer – Remote Bayesian Inference from Python via Julia
6 points
bvdmitri
a year ago
3 comments
312.
Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite (github.com/larq)
6 points
khelwegen
6 years ago
1 comment
313.
OpenUMA – bring Apple-style unified memory to x86 AI inference (Rust, Linux) (github.com/hamtun24)
6 points
hamtun24
2 months ago
discuss
314.
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention (github.com/ggml-org)
6 points
diwank
9 months ago
discuss
315.
Llama Inference in 150 Lines (gist.github.com)
6 points
kevmo314
2 years ago
discuss
316.
Show HN: Launch StableStudio local inference in one commmand (github.com/brycedrennan)
6 points
bryced
3 years ago
discuss
317.
Rust+OpenCL+AVX2 implementation of LLaMA inference code (github.com/Noeda)
6 points
myers
3 years ago
discuss
318.
ncnn: High-performance neural network inference framework optimized for mobile (github.com/Tencent)
6 points
davikr
3 years ago
discuss
319.
Wase – WebAssembly made easy. Strongly typed infered low-level language for WASM (github.com/area9innovation)
6 points
asgeralstrup
4 years ago
discuss
320.
Statistical Inference Considered Harmful (github.com/frankmcsherry)
6 points
rargulati
10 years ago
discuss
321.
Ask HN: What is the best tool to infer data type of tabular data?
5 points
mahalel
5 years ago
7 comments
322.
Show HN: Zod – TypeScript-first validation library with static type inference (github.com/vriad)
5 points
vriad
6 years ago
3 comments
323.
Show HN: GPT-J inference on the CPU using C/C++ (github.com/ggerganov)
5 points
ggerganov
4 years ago
2 comments
324.
I implemented CLIP inference in plain C/C++ (github.com/monatis)
5 points
monatis
3 years ago
1 comment
325.
GeosPy: Geolocation Inference Made Easy (github.com/tylfin)
5 points
tylfin
10 years ago
1 comment
326.
AI Agent that at inference time updates it's harness and model weights (github.com/hexo-ai)
5 points
martianvoid
6 days ago
discuss
327.
Show HN: Smile-Serve – Inference Server for ML, ONNX, and LLM (github.com/haifengl)
5 points
haifeng
a month ago
discuss
328.
vLLM introduces memory optimizations for long-context inference (github.com/vllm-project)
5 points
addisud
2 months ago
discuss
329.
Zinc – LLM inference engine written in Zig, running 35B models on $550 AMD GPUs (github.com/zolotukhin)
5 points
mvdwoord
2 months ago
discuss
330.
Show HN: Llmtop – Htop for LLM Inference Clusters (vLLM, SGLang, Ollama, llama) (github.com/InfraWhisperer)
5 points
rpotluri
3 months ago
discuss
More