Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
31.
▲
Show HN: Nano-web – a low latency one binary webserver designed for serving SPAs
(github.com/radiosilence)
157 points
antihero
2 years ago
115 comments
32.
▲
Nvidia Dynamo: A Datacenter Scale Distributed Inference Serving Framework
(github.com/ai-dynamo)
150 points
ashvardanian
a year ago
39 comments
33.
▲
Show HN: Blast – Fast, multi-threaded serving engine for web browsing AI agents
(github.com/stanford-mast)
145 points
calebhwin
a year ago
66 comments
34.
▲
Punica: Serving multiple LoRA finetuned LLM as one
(github.com/punica-ai)
135 points
abcdabcd987
3 years ago
26 comments
35.
▲
S-LoRA: Serving Concurrent LoRA Adapters
(github.com/S-LoRA)
73 points
Labo333
2 years ago
20 comments
36.
▲
Kvcached: Virtualized, elastic KV cache for LLM serving on shared GPUs
(notion.so)
69 points
Jrxing
7 months ago
13 comments
37.
▲
Show HN: Cortex – Open-source alternative to SageMaker for model serving
(github.com/cortexlabs)
65 points
calebkaiser
6 years ago
19 comments
38.
▲
My first simple 97-loc Sinatra/Freebase/Heroku single-serving app
(github.com/bkudria)
23 points
bkudria
17 years ago
8 comments
39.
▲
Clipper: A prediction serving system for TensorFlow, PyTorch, PySpark and others
(github.com/ucbrise)
13 points
kot-behemoth
7 years ago
5 comments
40.
▲
Serving both sync and async/comet HTTP with RingoJS
(hns.github.com)
13 points
hannesw
16 years ago
2 comments
41.
▲
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving
(github.com/kvcache-ai)
13 points
zinccat
2 years ago
discuss
42.
▲
Yahoo Cloud Serving Benchmark
(wiki.github.com)
12 points
helwr
16 years ago
discuss
43.
▲
BentoML: A platform for serving and deploying machine learning models
(github.com/bentoml)
10 points
kevlar1818
7 years ago
5 comments
44.
▲
Simple Agent API: A Minimal Setup for Serving Agents with FastAPI and Postgres
(github.com/agno-agi)
10 points
bediashpreet
a year ago
discuss
45.
▲
Chronon: A data platform for serving for AI/ML applications
(github.com/airbnb)
8 points
tanelpoder
9 months ago
2 comments
46.
▲
TorchServe: Model Serving Library for PyTorch
(github.com/pytorch)
8 points
gtrevize
6 years ago
1 comment
47.
▲
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
(github.com/kvcache-ai)
8 points
sarkory
a year ago
discuss
48.
▲
A full-stack Python model serving library
(github.com/Lightning-AI)
8 points
theaniketmaurya
2 years ago
discuss
49.
▲
BentoML: An open-source platform for ML model serving
(github.com/bentoml)
8 points
paranoyang
6 years ago
discuss
50.
▲
Show HN: Serving Django and Twisted using HAproxy
(gist.github.com)
7 points
sspross
13 years ago
4 comments
51.
▲
Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?
6 points
KuriousCat
2 years ago
3 comments
52.
▲
Show HN: PyPI server for serving Python packages out of GitHub
(github.com/brettlangdon)
6 points
brettlangdon
9 years ago
1 comment
53.
▲
Show HN: Deadsimple – A static site server serving Markdown as Bootstrap HTML
(github.com/ncthis)
5 points
lunarcave
12 years ago
9 comments
54.
▲
Turning PostgreSQL into a queue serving 10,000 jobs per second
(gist.github.com)
5 points
chanks
13 years ago
discuss
55.
▲
Show HN: ServeIt – simple API serving for Python ML models
(github.com/rtlee9)
5 points
ryantl
8 years ago
discuss
56.
▲
Show HN: A CLI tool that speeds carthage build by serving pre-built frameworks
5 points
n4rc071x
10 years ago
discuss
57.
▲
Show HN: One-stop ML model managing, converting, profiling and serving platform
(github.com/cap-ntu)
4 points
huangyz0918
6 years ago
3 comments
58.
▲
PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs
(github.com/SJTU-IPADS)
4 points
limoce
2 years ago
1 comment
59.
▲
Show HN: Mosec makes the machine learning model serving flexible and efficient
(github.com/mosecorg)
4 points
urcyanide
5 years ago
1 comment
60.
▲
Show HN: InferMesh – Open-source, GPU-aware inference mesh for large AI serving
(github.com/redbco)
4 points
tommihip
9 months ago
discuss
More