Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
1.
▲
Show HN: Monitor macOS stats and alert upon reaching configurable thresholds
(github.com/spekulant)
2 points
tomaszsobota
a year ago
discuss
2.
▲
Show HN: Speculative Decoding from Scratch in PyTorch (2.8x CPU Speedup)
(github.com/kunal51107)
4 points
kunal51107
6 months ago
1 comment
3.
▲
CPU security bugs caused by speculative execution
(github.com/marcan)
3 points
delroth
8 years ago
discuss
4.
▲
Speculative decoding of llama2 models in pure C
(github.com/mscheong01)
2 points
mscheong
2 years ago
discuss
5.
▲
Exploring speculation side-channel attacks in Java
(github.com/steveloughran)
1 point
based2
8 years ago
discuss
6.
▲
Unofficial FAQ on CPU Speculative Execution Bugs
(github.com/marcan)
1 point
transpute
8 years ago
discuss
7.
▲
Ruby port of clojure.spec
(github.com/english)
1 point
tosh
9 years ago
discuss
8.
▲
Omnibox Prerendering coming in Chromium 100
(github.com/WICG)
2 points
frankjr
4 years ago
discuss
9.
▲
Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090
(github.com/thc1006)
5 points
thc1006
a month ago
2 comments
10.
▲
An implementation of the Speculative Paxos protocol
(github.com/UWSysLab)
57 points
drkp
10 years ago
15 comments
11.
▲
Speculative: PoC for speeding-up inference via speculative sampling by ggerganov
(github.com/ggerganov)
16 points
kristianp
3 years ago
1 comment
12.
▲
X86-64 Speculative Execution Harness
(gist.github.com)
4 points
EwanToo
8 years ago
4 comments
13.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
(github.com/ggerganov)
4 points
bobivl
3 years ago
1 comment
14.
▲
My speculations writing a coding platform in 8 weeks as a highschooler
4 points
tr1ll10nb1ll
6 years ago
1 comment
15.
▲
Accelerating LLM Serving with Speculative Inference and Token Tree Verification
(github.com/flexflow)
3 points
zhihaojia
3 years ago
1 comment
16.
▲
Cascadeflow: Cut AI API costs 40-85% with speculative model cascading
(github.com/lemony-ai)
3 points
saschabuehrle
7 months ago
discuss
17.
▲
Eagle-3 Speculative Decoding for LLM Inference (5.6x speedup)
(github.com/SafeAILab)
2 points
summarity
a year ago
discuss
18.
▲
MLX: Speculative Decoding
(github.com/ml-explore)
2 points
tosh
2 years ago
discuss
19.
▲
BranchFS is a FUSE-based filesystem enables speculative branching for AI agents
(github.com/multikernel)
1 point
wang_cong
4 months ago
1 comment
20.
▲
Speculations on Test-Time Scaling (OpenAI O1 Model)
(github.com/srush)
1 point
d4rkp4ttern
2 years ago
1 comment
21.
▲
Speculative Speculative Decoding: Really, Really Fast LLM Inference
(github.com/tanishqkumar)
1 point
fizzbuzz07
3 months ago
discuss
22.
▲
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
(github.com/facebookresearch)
1 point
zerojames
2 years ago
discuss
23.
▲
Speculate HN: How does OpenAI's GPT Builder work?
1 point
arthurcolle
2 years ago
discuss
24.
▲
Show HN: Prefetch links on hover and prerender via Speculation Rules API
(github.com/midzer)
1 point
midzer
3 years ago
discuss
25.
▲
Mitigating Speculative Attacks in Crypto
(github.com/HACS-workshop)
1 point
twoodfin
8 years ago
discuss
26.
▲
Show HN: Handwriter.ttf – Handwriting Synthesis with Harfbuzz WASM
(github.com/hsfzxjy)
191 points
hsfzxjy
2 years ago
53 comments
27.
▲
Ask HN: What Happened to GitHub's Atom?
59 points
jonny383
7 years ago
46 comments
28.
▲
Show HN: Run 500B+ Parameter LLMs Locally on a Mac Mini
(github.com/opengraviton)
17 points
fatihturker
3 months ago
10 comments
29.
▲
Show HN: OpenGraviton – Run 500B+ parameter models on a consumer Mac Mini
(opengraviton.github.io)
13 points
fatihturker
3 months ago
5 comments
30.
▲
Share my pain point: I want dead easy version control.
12 points
impendia
14 years ago
13 comments
More