Search: github.com/servian | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

Show HN: Nano-web – a low latency one binary webserver designed for serving SPAs (github.com/radiosilence)

157 points

2 years ago

32.

Nvidia Dynamo: A Datacenter Scale Distributed Inference Serving Framework (github.com/ai-dynamo)

150 points

a year ago

33.

Show HN: Blast – Fast, multi-threaded serving engine for web browsing AI agents (github.com/stanford-mast)

145 points

a year ago

34.

Punica: Serving multiple LoRA finetuned LLM as one (github.com/punica-ai)

135 points

3 years ago

35.

S-LoRA: Serving Concurrent LoRA Adapters (github.com/S-LoRA)

73 points

2 years ago

36.

Kvcached: Virtualized, elastic KV cache for LLM serving on shared GPUs (notion.so)

69 points

7 months ago

37.

Show HN: Cortex – Open-source alternative to SageMaker for model serving (github.com/cortexlabs)

65 points

6 years ago

38.

My first simple 97-loc Sinatra/Freebase/Heroku single-serving app (github.com/bkudria)

23 points

17 years ago

39.

Clipper: A prediction serving system for TensorFlow, PyTorch, PySpark and others (github.com/ucbrise)

13 points

7 years ago

40.

Serving both sync and async/comet HTTP with RingoJS (hns.github.com)

13 points

16 years ago

41.

Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)

13 points

2 years ago

42.

Yahoo Cloud Serving Benchmark (wiki.github.com)

12 points

16 years ago

43.

BentoML: A platform for serving and deploying machine learning models (github.com/bentoml)

10 points

7 years ago

44.

Simple Agent API: A Minimal Setup for Serving Agents with FastAPI and Postgres (github.com/agno-agi)

10 points

a year ago

45.

Chronon: A data platform for serving for AI/ML applications (github.com/airbnb)

8 points

9 months ago

46.

TorchServe: Model Serving Library for PyTorch (github.com/pytorch)

8 points

6 years ago

47.

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)

8 points

a year ago

48.

A full-stack Python model serving library (github.com/Lightning-AI)

8 points

theaniketmaurya

2 years ago

49.

BentoML: An open-source platform for ML model serving (github.com/bentoml)

8 points

6 years ago

50.

Show HN: Serving Django and Twisted using HAproxy (gist.github.com)

7 points

13 years ago

51.

Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?

6 points

2 years ago

52.

Show HN: PyPI server for serving Python packages out of GitHub (github.com/brettlangdon)

6 points

9 years ago

53.

Show HN: Deadsimple – A static site server serving Markdown as Bootstrap HTML (github.com/ncthis)

5 points

12 years ago

54.

Turning PostgreSQL into a queue serving 10,000 jobs per second (gist.github.com)

5 points

13 years ago

55.

Show HN: ServeIt – simple API serving for Python ML models (github.com/rtlee9)

5 points

8 years ago

56.

Show HN: A CLI tool that speeds carthage build by serving pre-built frameworks

5 points

10 years ago

57.

Show HN: One-stop ML model managing, converting, profiling and serving platform (github.com/cap-ntu)

4 points

6 years ago

58.

PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs (github.com/SJTU-IPADS)

4 points

2 years ago

59.

Show HN: Mosec makes the machine learning model serving flexible and efficient (github.com/mosecorg)

4 points

5 years ago

60.

Show HN: InferMesh – Open-source, GPU-aware inference mesh for large AI serving (github.com/redbco)

4 points

9 months ago