Search: github.com/tnfe | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

391.

Cake: Distributed LLM and StableDiffusion inference for mobile desktop or server (github.com/evilsocket)

3 points

a year ago

392.

Geniusrise – inference APIs for text, vision, audio, multi-modal AI models (github.com)

3 points

2 years ago

393.

Show HN: Collider – the platform for local LLM debug and inference at warp speed (github.com/gotzmann)

3 points

3 years ago

394.

Takeoff Inference Server Is Now Open Source (github.com/titanml)

3 points

3 years ago

395.

Show HN: GPT-2 inference on the CPU using C/C++ (github.com/ggerganov)

3 points

4 years ago

396.

Hummingbird,compiles trained ML models into tensor computation for inference (github.com/microsoft)

3 points

tourist_on_road

6 years ago

397.

HE-Transformer: Deep Learning Inference with Homomorphic Encryption (github.com/NervanaSystems)

3 points

7 years ago

398.

Inferring polygon vertices with a VGG-16 model (github.com/AidanRocke)

3 points

8 years ago

399.

Show HN: Piqc – GPU waste scanner for LLM inference clusters (github.com/paralleliq)

3 points

4 days ago

400.

Show HN: Llmff v1.0 FFmpeg for Inference (github.com/syndicalt)

3 points

6 days ago

401.

Atlas TQ1_0 – Pure C++ Ternary (1.58-Bit) Inference Engine for CPU (github.com/xxxn3m3s1sxxx)

3 points

19 days ago

402.

Atlas – Pure Rust Inference Engine (github.com/Avarok-Cybersecurity)

3 points

a month ago

403.

Show HN: Valkyr LM Inference with Realtime Guarantees (github.com/Foundation42)

3 points

a month ago

404.

Show HN: Coelanox – auditable inference runtime in Rust (BERT runs today) (coelanox.com)

3 points

2 months ago

405.

LLM inference load balancer optimized for AMD Radeon VII GPUs (github.com/janit)

3 points

2 months ago

406.

RvLLM: High-performance LLM inference in Rust (github.com/m0at)

3 points

2 months ago

407.

Rust-native hybrid training and inference engine for Apple Neural Engine and GPU (github.com/ncdrone)

3 points

3 months ago

408.

Show HN: Doppler.js – WebGPU inference, faster/simpler than transformer.js

3 points

3 months ago

409.

OMLX – LLM Inference Server for Apple Silicon (Ollama for MLX) (github.com/jundot)

3 points

4 months ago

410.

Free LLM API Resources – A List of Free LLM Inference APIs (github.com/cheahjs)

3 points

4 months ago

411.

A tiny LM that does inference at compile time (github.com/erodola)

3 points

5 months ago

412.

Snow HN: ~950 line inference engine, on par with vLLM (github.com/naklecha)

3 points

5 months ago

413.

Bare-Metal Llama 2 Inference in C++20 (No Frameworks, ARM Neon) (github.com/farukalpay)

3 points

5 months ago

414.

OpenVINO – open-source toolkit for optimizing and deploying AI inference (github.com/openvinotoolkit)

3 points

peter_d_sherman

5 months ago

415.

Show HN: Distributed Storage System to 8x LLM Inference, GPU Training Efficiency (github.com/blackbird-io)

3 points

8 months ago

416.

LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease (github.com/bentoml)

3 points

9 months ago

417.

LLM Inference in pure Java with a GPU acceleration enabled (github.com/beehive-lab)

3 points

a year ago

418.

Show HN: I made TypeScript's type inference more strict (and smarter) (github.com/kakasoo)

3 points

a year ago

419.

The Path to Open-Sourcing the DeepSeek Inference Engine (github.com/deepseek-ai)

3 points

a year ago

420.

Deepseek CPP for CPU only inference (github.com/andrewkchan)

3 points

a year ago