Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
541.
llm.f90: LLM Inference in Fortran (github.com/rbitr)
2 points
tosh
2 years ago
discuss
542.
SGLang: Fast and Expressive LLM Inference with RadixAttention for 5x Throughput (github.com/skypilot-org)
2 points
covi
2 years ago
discuss
543.
Inference of Mamba models in pure C (github.com/kroggen)
2 points
kroggen
2 years ago
discuss
544.
Mamba LLM Inference on CPU (github.com/rbitr)
2 points
andy99
2 years ago
discuss
545.
Official PR Reveals the Inference Code for Mixtral 8x7B (github.com/vllm-project)
2 points
georgehill
2 years ago
discuss
546.
Stable-fast for SD inference: Faster than AITemplate, On par with TensorRT (github.com/chengzeyi)
2 points
chengzeyi
3 years ago
discuss
547.
DeepSpeed-FastGen: High-Throughput for LLMs via MII and DeepSpeed-Inference (github.com/microsoft)
2 points
CharlesW
3 years ago
discuss
548.
Show HN: Llama2 inference in one file of pure OCaml (github.com/jackpeck)
2 points
0c
3 years ago
discuss
549.
Tairov/llama2.mojo: Inference Llama 2 in one file of pure (github.com/tairov)
2 points
freediver
3 years ago
discuss
550.
Llama2 Inference in pure Mojo (github.com/tairov)
2 points
atairov
3 years ago
discuss
551.
OpenTau – Using Large Language Models for Gradual Type Inference (github.com/GammaTauAI)
2 points
bcjordan
3 years ago
discuss
552.
OpenAI Function Calling Helper Library – Infer Python Function JSON Schema (github.com/jakecyr)
2 points
jakecyr
3 years ago
discuss
553.
LMFlow – Toolkit for Finetuning and Inference of Large Foundation Models (github.com/OptimalScale)
2 points
T-A
3 years ago
discuss
554.
Show HN: Python Monitoring for LLMs, OpenAI, Inference, GPUs (github.com/graphsignal)
2 points
npgraph
3 years ago
discuss
555.
Inference at the Edge (github.com/ggerganov)
2 points
georgehill
3 years ago
discuss
556.
VoltaML – convert DL models in high performance inference runtimes (github.com/VoltaML)
2 points
yolo123
3 years ago
discuss
557.
Posteriordb: Database with posteriors of interest for Bayesian inference (github.com/stan-dev)
2 points
Tomte
4 years ago
discuss
558.
Machine Learning and Causal Inference Taught by Brigham Frandsen (github.com/Mixtape-Sessions)
2 points
simonpure
4 years ago
discuss
559.
Stable Diffusion inference on iOS / macOS using MPSGraph (github.com/madebyollin)
2 points
ollin
4 years ago
discuss
560.
Hugging Face Transformers – Big Model Inference, Bloom, GPT Neo-X, LongT5 etc. (github.com/huggingface)
2 points
jedwhite
4 years ago
discuss
561.
Neural network inference the Unix way (github.com/cloudkj)
2 points
behnamoh
4 years ago
discuss
562.
Static analysis and type inference for SQL strings in PHP (github.com/staabm)
2 points
treve
4 years ago
discuss
563.
Turbo_transformers: Fast Transformer Inference on CPU and GPU (github.com/Tencent)
2 points
yzh
6 years ago
discuss
564.
Torchlayers: Shape Inference for PyTorch (github.com/szymonmaszke)
2 points
jonbaer
6 years ago
discuss
565.
Best Practices for JEP 286, Java Local Variable Type Inference (LVTI) (github.com/PacktPublishing)
2 points
AnghelLeonard
6 years ago
discuss
566.
Xilinx Vitis AI – New Development Stack for AI Inference on Xilinx FPGA and ACAP (github.com/Xilinx)
2 points
KenanSulayman
7 years ago
discuss
567.
Marko.js fastest on Node, React-compatible Inferno.js speed king in browser (github.com/marko-js)
2 points
jhsware
7 years ago
discuss
568.
Ncnn: A high-performance neural network inference framework optimized for mobile (github.com/Tencent)
2 points
homarp
7 years ago
discuss
569.
Secure TensorFlow Inference with Multi-Party Computation (github.com/mpc-msri)
2 points
mayank0403
7 years ago
discuss
570.
Tract: Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference (github.com/snipsco)
2 points
Datenstrom
7 years ago
discuss
More