Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
31.
▲
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
(hazyresearch.stanford.edu)
3 points
panabee
3 years ago
discuss
32.
▲
HyenaDNA: Learning from DNA with 1M token context
(hazyresearch.stanford.edu)
3 points
beefman
3 years ago
discuss
33.
▲
AI's Linux Moment: An Open-Source AI Model Love Note
(hazyresearch.stanford.edu)
3 points
tim_sw
3 years ago
discuss
34.
▲
Minions: The rise of small, on-device LMs
(hazyresearch.stanford.edu)
2 points
kiyanwang
a year ago
1 comment
35.
▲
Zoology 1: Measuring and Improving Recall in Efficient Language Models
(hazyresearch.stanford.edu)
2 points
convexstrictly
2 years ago
1 comment
36.
▲
Pixelated Butterfly
(hazyresearch.stanford.edu)
2 points
sdenton4
3 years ago
1 comment
37.
▲
Stuffing MLPs Full of Facts: A Generative Approach to Factual Recall
(hazyresearch.stanford.edu)
2 points
hessdalenlight
6 months ago
discuss
38.
▲
ThunderMittens for Your ThunderKittens
(hazyresearch.stanford.edu)
2 points
mpweiher
7 months ago
discuss
39.
▲
An Unserious Persons Take on Axiomatic Knowledge in the Era of Foundation Models
(hazyresearch.stanford.edu)
2 points
LionTurtle13
2 years ago
discuss
40.
▲
An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models
(hazyresearch.stanford.edu)
2 points
jxmorris12
2 years ago
discuss
41.
▲
Linearizing LLMs with LoLCATs
(hazyresearch.stanford.edu)
2 points
jasondavies
2 years ago
discuss
42.
▲
Efficient language models as arithmetic circuits
(hazyresearch.stanford.edu)
2 points
colinprince
2 years ago
discuss
43.
▲
Combining Continuous-Time, Recurrent, and Convolutional Models
(hazyresearch.stanford.edu)
2 points
georgehill
2 years ago
discuss
44.
▲
HyenaDNA: Learning from DNA with 1M token context
(hazyresearch.stanford.edu)
2 points
thunderbong
3 years ago
discuss
45.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
(hazyresearch.stanford.edu)
2 points
quantisan
3 years ago
discuss
46.
▲
From Deep to Long Learning?
(hazyresearch.stanford.edu)
2 points
sebg
3 years ago
discuss
47.
▲
Based: An Educational and Effective Sequence Mixer
(hazyresearch.stanford.edu)
1 point
pama
2 years ago
1 comment
48.
▲
ThunderKittens 2.0: even faster kernels for your GPUs
(hazyresearch.stanford.edu)
1 point
ecesena
3 months ago
discuss
49.
▲
Loads and Loads of Fluffy Kittens
(hazyresearch.stanford.edu)
1 point
todsacerdoti
7 months ago
discuss
50.
▲
Intelligence per Watt: A Study of Local Intelligence Efficiency
(hazyresearch.stanford.edu)
1 point
simonpure
7 months ago
discuss
51.
▲
Cartridges: Store long contexts in tiny caches with LLM self-study
(hazyresearch.stanford.edu)
1 point
dvrp
a year ago
discuss
52.
▲
Mind the Trust Gap: Fast, Private Local-to-Cloud LLM Chat
(hazyresearch.stanford.edu)
1 point
tmoertel
a year ago
discuss
53.
▲
Correcting and Improving LLM Predictions Without Labels
(hazyresearch.stanford.edu)
1 point
nihit-desai
3 years ago
discuss
54.
▲
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
(hazyresearch.stanford.edu)
1 point
todsacerdoti
3 years ago
discuss
55.
▲
H3: Language Modeling with State Space Models and (Almost) No Attention
(hazyresearch.stanford.edu)
1 point
anewhnaccount2
3 years ago
discuss
56.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
(hazyresearch.stanford.edu)
1 point
pmoriarty
3 years ago
discuss
57.
▲
Chris Re: Is AI Rare or Everywhere?
(hazyresearch.stanford.edu)
1 point
tim_sw
3 years ago
discuss
58.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
(hazyresearch.stanford.edu)
1 point
chriskanan
3 years ago
discuss
59.
▲
Can Longer Sequences Help Take the Next Leap in AI? · Hazy Research
(hazyresearch.stanford.edu)
1 point
bilsbie
4 years ago
discuss
60.
▲
HiPPO: Recurrent Memory with Optimal Polynomial Projections
(hazyresearch.stanford.edu)
1 point
0mp
5 years ago
discuss
More