Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Ask HN: Are you training and running custom LLMs and how are you doing it?
15 points
kordlessagain
3 years ago
2 comments
2.
▲
vLLM introduces memory optimizations for long-context inference
(github.com/vllm-project)
5 points
addisud
2 months ago
discuss
3.
▲
vLLM IR: A Functional Intermediate Representation for vLLM
(github.com/vllm-project)
4 points
matt_d
2 months ago
discuss
4.
▲
Vllm: High-throughput and memory-efficient inference and serving engine for LLMs
(github.com/vllm-project)
3 points
tosh
3 years ago
discuss
5.
▲
Vllm
(github.com/vllm-project)
3 points
kordlessagain
3 years ago
discuss
6.
▲
VLLM-Omni: A framework for efficient model inference with Omni-modality models
(github.com/vllm-project)
2 points
zyh888
6 months ago
1 comment
7.
▲
vLLM (high-throughput LLM serving engine)
(github.com/vllm-project)
2 points
roody_wurlitzer
3 months ago
discuss
8.
▲
Easy, fast, and cheap LLM serving for everyone
(github.com/vllm-project)
2 points
vincent_s
2 years ago
discuss
9.
▲
Official PR Reveals the Inference Code for Mixtral 8x7B
(github.com/vllm-project)
2 points
georgehill
2 years ago
discuss
10.
▲
VLLM
(github.com/vllm-project)
2 points
sherlockxu
3 years ago
discuss
11.
▲
LLM compressor: compress models for efficient deployment
(github.com/vllm-project)
1 point
hajduksplit
2 years ago
1 comment
12.
▲
vLLM multi-turn conversations design
(github.com/vllm-project)
1 point
CCs
4 months ago
discuss
13.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
(github.com/vllm-project)
1 point
rrampage
a year ago
discuss
14.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
(github.com/vllm-project)
1 point
delduca
a year ago
discuss
15.
▲
VLLM Sacrifices Accuracy for Speed
(github.com/vllm-project)
1 point
behnamoh
2 years ago
discuss
16.
▲
vllm
(github.com/vllm-project)
1 point
tosh
2 years ago
discuss
17.
▲
Mixtral Expert Parallelism
(github.com/vllm-project)
1 point
tosh
2 years ago
discuss
18.
▲
I made a GitHub repo for (beginner) Python devs using LangChain for LLM projects
(github.com/lypsoty112)
1 point
MaartenBoon
2 years ago
1 comment
19.
▲
Feedback on an open source Ruby – LLM project
(github.com/pcarolan)
7 points
pcarolan
6 months ago
1 comment
20.
▲
Memex: Rust powered “memory” (doc store and semantic search) for LLM projects
(github.com/spyglass-search)
2 points
homarp
3 years ago
discuss
21.
▲
Show HN: Νοῦς – A Customizable LLM Project
(github.com/Albertlungu)
1 point
albertlungu
6 months ago
discuss
22.
▲
Show HN: Laminar – Open-Source DataDog + PostHog for LLM Apps, Built in Rust
(github.com/lmnr-ai)
203 points
skull8888888
2 years ago
45 comments
23.
▲
Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry
(github.com/openlit)
62 points
aman_041
2 years ago
22 comments
24.
▲
Show HN: OxyJen – Java framework to orchestrate LLMs in a graph-style execution
2 points
bdivyansh11
3 months ago
discuss
25.
▲
Show HN: Tokuin – CLI load tester and token estimator for LLM APIs
(github.com/nooscraft)
2 points
oshadha89
7 months ago
discuss
26.
▲
OpenLIT – Open-Source LLM Observability with OpenTelemetry
2 points
aman_041
2 years ago
discuss
27.
▲
Show HN: Hnsqlite: hnswlib and SQLite integrated for text embedding search
(github.com/jiggy-ai)
2 points
wskish
3 years ago
discuss
28.
▲
Show HN: LLM AuthZ Audit – find auth gaps and prompt injection in LLM apps
(github.com/aiauthz)
1 point
iamspathan
4 months ago
discuss
29.
▲
Show HN: Ask AI Paul Graham
(pocket-pg-851564657364.us-east1.run.app)
1 point
zh2408
a year ago
discuss
30.
▲
Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry
(github.com/openlit)
1 point
patcher99
2 years ago
discuss
More