Search: github.com/tnfe | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

301.

Sahi: A Vision library for sliced inference on large images/small objects (github.com/obss)

8 points

yagizdegirmenci

5 years ago

302.

Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference (github.com/docker)

7 points

7 months ago

303.

Show HN: Jlama – A fast Java inference engine for GPT and Llama models (github.com/tjake)

7 points

3 years ago

304.

Show HN: Llamero – A GUI app to easily download, install and infer LLaMA models (github.com/mpociot)

7 points

3 years ago

305.

Alpa: Auto-parallelizing large model training and inference (by UC Berkeley) (github.com/alpa-projects)

7 points

4 years ago

306.

Show HN: Secure XGBoost training and inference on encrypted data (github.com/mc2-project)

7 points

6 years ago

307.

Show HN: Composable middleware for LLM inference Optimization Passes (github.com/liquidos-ai)

7 points

3 months ago

308.

Distributed LLama3 Inference (github.com/evilsocket)

7 points

2 years ago

309.

Stable Diffusion Inference on iOS (github.com/madebyollin)

7 points

4 years ago

310.

Cligen: A Native API-Inferred Command-Line Interface Generator for Nim (github.com/c-blake)

6 points

10 months ago

311.

RxInferServer – Remote Bayesian Inference from Python via Julia

6 points

a year ago

312.

Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite (github.com/larq)

6 points

6 years ago

313.

OpenUMA – bring Apple-style unified memory to x86 AI inference (Rust, Linux) (github.com/hamtun24)

6 points

2 months ago

314.

Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention (github.com/ggml-org)

6 points

9 months ago

315.

Llama Inference in 150 Lines (gist.github.com)

6 points

2 years ago

316.

Show HN: Launch StableStudio local inference in one commmand (github.com/brycedrennan)

6 points

3 years ago

317.

Rust+OpenCL+AVX2 implementation of LLaMA inference code (github.com/Noeda)

6 points

3 years ago

318.

ncnn: High-performance neural network inference framework optimized for mobile (github.com/Tencent)

6 points

3 years ago

319.

Wase – WebAssembly made easy. Strongly typed infered low-level language for WASM (github.com/area9innovation)

6 points

4 years ago

320.

Statistical Inference Considered Harmful (github.com/frankmcsherry)

6 points

10 years ago

321.

Ask HN: What is the best tool to infer data type of tabular data?

5 points

5 years ago

322.

Show HN: Zod – TypeScript-first validation library with static type inference (github.com/vriad)

5 points

6 years ago

323.

Show HN: GPT-J inference on the CPU using C/C++ (github.com/ggerganov)

5 points

4 years ago

324.

I implemented CLIP inference in plain C/C++ (github.com/monatis)

5 points

3 years ago

325.

GeosPy: Geolocation Inference Made Easy (github.com/tylfin)

5 points

10 years ago

326.

AI Agent that at inference time updates it's harness and model weights (github.com/hexo-ai)

5 points

6 days ago

327.

Show HN: Smile-Serve – Inference Server for ML, ONNX, and LLM (github.com/haifengl)

5 points

a month ago

328.

vLLM introduces memory optimizations for long-context inference (github.com/vllm-project)

5 points

2 months ago

329.

Zinc – LLM inference engine written in Zig, running 35B models on $550 AMD GPUs (github.com/zolotukhin)

5 points

2 months ago

330.

Show HN: Llmtop – Htop for LLM Inference Clusters (vLLM, SGLang, Ollama, llama) (github.com/InfraWhisperer)

5 points

3 months ago