Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
31.
Show HN: Nano-web – a low latency one binary webserver designed for serving SPAs (github.com/radiosilence)
157 points
antihero
2 years ago
115 comments
32.
Nvidia Dynamo: A Datacenter Scale Distributed Inference Serving Framework (github.com/ai-dynamo)
150 points
ashvardanian
a year ago
39 comments
33.
Show HN: Blast – Fast, multi-threaded serving engine for web browsing AI agents (github.com/stanford-mast)
145 points
calebhwin
a year ago
66 comments
34.
Punica: Serving multiple LoRA finetuned LLM as one (github.com/punica-ai)
135 points
abcdabcd987
3 years ago
26 comments
35.
S-LoRA: Serving Concurrent LoRA Adapters (github.com/S-LoRA)
73 points
Labo333
2 years ago
20 comments
36.
Kvcached: Virtualized, elastic KV cache for LLM serving on shared GPUs (notion.so)
69 points
Jrxing
7 months ago
13 comments
37.
Show HN: Cortex – Open-source alternative to SageMaker for model serving (github.com/cortexlabs)
65 points
calebkaiser
6 years ago
19 comments
38.
My first simple 97-loc Sinatra/Freebase/Heroku single-serving app (github.com/bkudria)
23 points
bkudria
17 years ago
8 comments
39.
Clipper: A prediction serving system for TensorFlow, PyTorch, PySpark and others (github.com/ucbrise)
13 points
kot-behemoth
7 years ago
5 comments
40.
Serving both sync and async/comet HTTP with RingoJS (hns.github.com)
13 points
hannesw
16 years ago
2 comments
41.
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
13 points
zinccat
2 years ago
discuss
42.
Yahoo Cloud Serving Benchmark (wiki.github.com)
12 points
helwr
16 years ago
discuss
43.
BentoML: A platform for serving and deploying machine learning models (github.com/bentoml)
10 points
kevlar1818
7 years ago
5 comments
44.
Simple Agent API: A Minimal Setup for Serving Agents with FastAPI and Postgres (github.com/agno-agi)
10 points
bediashpreet
a year ago
discuss
45.
Chronon: A data platform for serving for AI/ML applications (github.com/airbnb)
8 points
tanelpoder
9 months ago
2 comments
46.
TorchServe: Model Serving Library for PyTorch (github.com/pytorch)
8 points
gtrevize
6 years ago
1 comment
47.
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
8 points
sarkory
a year ago
discuss
48.
A full-stack Python model serving library (github.com/Lightning-AI)
8 points
theaniketmaurya
2 years ago
discuss
49.
BentoML: An open-source platform for ML model serving (github.com/bentoml)
8 points
paranoyang
6 years ago
discuss
50.
Show HN: Serving Django and Twisted using HAproxy (gist.github.com)
7 points
sspross
13 years ago
4 comments
51.
Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?
6 points
KuriousCat
2 years ago
3 comments
52.
Show HN: PyPI server for serving Python packages out of GitHub (github.com/brettlangdon)
6 points
brettlangdon
9 years ago
1 comment
53.
Show HN: Deadsimple – A static site server serving Markdown as Bootstrap HTML (github.com/ncthis)
5 points
lunarcave
12 years ago
9 comments
54.
Turning PostgreSQL into a queue serving 10,000 jobs per second (gist.github.com)
5 points
chanks
13 years ago
discuss
55.
Show HN: ServeIt – simple API serving for Python ML models (github.com/rtlee9)
5 points
ryantl
8 years ago
discuss
56.
Show HN: A CLI tool that speeds carthage build by serving pre-built frameworks
5 points
n4rc071x
10 years ago
discuss
57.
Show HN: One-stop ML model managing, converting, profiling and serving platform (github.com/cap-ntu)
4 points
huangyz0918
6 years ago
3 comments
58.
PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs (github.com/SJTU-IPADS)
4 points
limoce
2 years ago
1 comment
59.
Show HN: Mosec makes the machine learning model serving flexible and efficient (github.com/mosecorg)
4 points
urcyanide
5 years ago
1 comment
60.
Show HN: InferMesh – Open-source, GPU-aware inference mesh for large AI serving (github.com/redbco)
4 points
tommihip
9 months ago
discuss
More