Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
541.
▲
llm.f90: LLM Inference in Fortran
(github.com/rbitr)
2 points
tosh
2 years ago
discuss
542.
▲
SGLang: Fast and Expressive LLM Inference with RadixAttention for 5x Throughput
(github.com/skypilot-org)
2 points
covi
2 years ago
discuss
543.
▲
Inference of Mamba models in pure C
(github.com/kroggen)
2 points
kroggen
2 years ago
discuss
544.
▲
Mamba LLM Inference on CPU
(github.com/rbitr)
2 points
andy99
2 years ago
discuss
545.
▲
Official PR Reveals the Inference Code for Mixtral 8x7B
(github.com/vllm-project)
2 points
georgehill
2 years ago
discuss
546.
▲
Stable-fast for SD inference: Faster than AITemplate, On par with TensorRT
(github.com/chengzeyi)
2 points
chengzeyi
3 years ago
discuss
547.
▲
DeepSpeed-FastGen: High-Throughput for LLMs via MII and DeepSpeed-Inference
(github.com/microsoft)
2 points
CharlesW
3 years ago
discuss
548.
▲
Show HN: Llama2 inference in one file of pure OCaml
(github.com/jackpeck)
2 points
0c
3 years ago
discuss
549.
▲
Tairov/llama2.mojo: Inference Llama 2 in one file of pure
(github.com/tairov)
2 points
freediver
3 years ago
discuss
550.
▲
Llama2 Inference in pure Mojo
(github.com/tairov)
2 points
atairov
3 years ago
discuss
551.
▲
OpenTau – Using Large Language Models for Gradual Type Inference
(github.com/GammaTauAI)
2 points
bcjordan
3 years ago
discuss
552.
▲
OpenAI Function Calling Helper Library – Infer Python Function JSON Schema
(github.com/jakecyr)
2 points
jakecyr
3 years ago
discuss
553.
▲
LMFlow – Toolkit for Finetuning and Inference of Large Foundation Models
(github.com/OptimalScale)
2 points
T-A
3 years ago
discuss
554.
▲
Show HN: Python Monitoring for LLMs, OpenAI, Inference, GPUs
(github.com/graphsignal)
2 points
npgraph
3 years ago
discuss
555.
▲
Inference at the Edge
(github.com/ggerganov)
2 points
georgehill
3 years ago
discuss
556.
▲
VoltaML – convert DL models in high performance inference runtimes
(github.com/VoltaML)
2 points
yolo123
3 years ago
discuss
557.
▲
Posteriordb: Database with posteriors of interest for Bayesian inference
(github.com/stan-dev)
2 points
Tomte
4 years ago
discuss
558.
▲
Machine Learning and Causal Inference Taught by Brigham Frandsen
(github.com/Mixtape-Sessions)
2 points
simonpure
4 years ago
discuss
559.
▲
Stable Diffusion inference on iOS / macOS using MPSGraph
(github.com/madebyollin)
2 points
ollin
4 years ago
discuss
560.
▲
Hugging Face Transformers – Big Model Inference, Bloom, GPT Neo-X, LongT5 etc.
(github.com/huggingface)
2 points
jedwhite
4 years ago
discuss
561.
▲
Neural network inference the Unix way
(github.com/cloudkj)
2 points
behnamoh
4 years ago
discuss
562.
▲
Static analysis and type inference for SQL strings in PHP
(github.com/staabm)
2 points
treve
4 years ago
discuss
563.
▲
Turbo_transformers: Fast Transformer Inference on CPU and GPU
(github.com/Tencent)
2 points
yzh
6 years ago
discuss
564.
▲
Torchlayers: Shape Inference for PyTorch
(github.com/szymonmaszke)
2 points
jonbaer
6 years ago
discuss
565.
▲
Best Practices for JEP 286, Java Local Variable Type Inference (LVTI)
(github.com/PacktPublishing)
2 points
AnghelLeonard
6 years ago
discuss
566.
▲
Xilinx Vitis AI – New Development Stack for AI Inference on Xilinx FPGA and ACAP
(github.com/Xilinx)
2 points
KenanSulayman
7 years ago
discuss
567.
▲
Marko.js fastest on Node, React-compatible Inferno.js speed king in browser
(github.com/marko-js)
2 points
jhsware
7 years ago
discuss
568.
▲
Ncnn: A high-performance neural network inference framework optimized for mobile
(github.com/Tencent)
2 points
homarp
7 years ago
discuss
569.
▲
Secure TensorFlow Inference with Multi-Party Computation
(github.com/mpc-msri)
2 points
mayank0403
7 years ago
discuss
570.
▲
Tract: Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference
(github.com/snipsco)
2 points
Datenstrom
7 years ago
discuss
More