Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
31.
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores (hazyresearch.stanford.edu)
3 points
panabee
3 years ago
discuss
32.
HyenaDNA: Learning from DNA with 1M token context (hazyresearch.stanford.edu)
3 points
beefman
3 years ago
discuss
33.
AI's Linux Moment: An Open-Source AI Model Love Note (hazyresearch.stanford.edu)
3 points
tim_sw
3 years ago
discuss
34.
Minions: The rise of small, on-device LMs (hazyresearch.stanford.edu)
2 points
kiyanwang
a year ago
1 comment
35.
Zoology 1: Measuring and Improving Recall in Efficient Language Models (hazyresearch.stanford.edu)
2 points
convexstrictly
2 years ago
1 comment
36.
Pixelated Butterfly (hazyresearch.stanford.edu)
2 points
sdenton4
3 years ago
1 comment
37.
Stuffing MLPs Full of Facts: A Generative Approach to Factual Recall (hazyresearch.stanford.edu)
2 points
hessdalenlight
6 months ago
discuss
38.
ThunderMittens for Your ThunderKittens (hazyresearch.stanford.edu)
2 points
mpweiher
7 months ago
discuss
39.
An Unserious Persons Take on Axiomatic Knowledge in the Era of Foundation Models (hazyresearch.stanford.edu)
2 points
LionTurtle13
2 years ago
discuss
40.
An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models (hazyresearch.stanford.edu)
2 points
jxmorris12
2 years ago
discuss
41.
Linearizing LLMs with LoLCATs (hazyresearch.stanford.edu)
2 points
jasondavies
2 years ago
discuss
42.
Efficient language models as arithmetic circuits (hazyresearch.stanford.edu)
2 points
colinprince
2 years ago
discuss
43.
Combining Continuous-Time, Recurrent, and Convolutional Models (hazyresearch.stanford.edu)
2 points
georgehill
2 years ago
discuss
44.
HyenaDNA: Learning from DNA with 1M token context (hazyresearch.stanford.edu)
2 points
thunderbong
3 years ago
discuss
45.
Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)
2 points
quantisan
3 years ago
discuss
46.
From Deep to Long Learning? (hazyresearch.stanford.edu)
2 points
sebg
3 years ago
discuss
47.
Based: An Educational and Effective Sequence Mixer (hazyresearch.stanford.edu)
1 point
pama
2 years ago
1 comment
48.
ThunderKittens 2.0: even faster kernels for your GPUs (hazyresearch.stanford.edu)
1 point
ecesena
3 months ago
discuss
49.
Loads and Loads of Fluffy Kittens (hazyresearch.stanford.edu)
1 point
todsacerdoti
7 months ago
discuss
50.
Intelligence per Watt: A Study of Local Intelligence Efficiency (hazyresearch.stanford.edu)
1 point
simonpure
7 months ago
discuss
51.
Cartridges: Store long contexts in tiny caches with LLM self-study (hazyresearch.stanford.edu)
1 point
dvrp
a year ago
discuss
52.
Mind the Trust Gap: Fast, Private Local-to-Cloud LLM Chat (hazyresearch.stanford.edu)
1 point
tmoertel
a year ago
discuss
53.
Correcting and Improving LLM Predictions Without Labels (hazyresearch.stanford.edu)
1 point
nihit-desai
3 years ago
discuss
54.
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning (hazyresearch.stanford.edu)
1 point
todsacerdoti
3 years ago
discuss
55.
H3: Language Modeling with State Space Models and (Almost) No Attention (hazyresearch.stanford.edu)
1 point
anewhnaccount2
3 years ago
discuss
56.
Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)
1 point
pmoriarty
3 years ago
discuss
57.
Chris Re: Is AI Rare or Everywhere? (hazyresearch.stanford.edu)
1 point
tim_sw
3 years ago
discuss
58.
Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)
1 point
chriskanan
3 years ago
discuss
59.
Can Longer Sequences Help Take the Next Leap in AI? · Hazy Research (hazyresearch.stanford.edu)
1 point
bilsbie
4 years ago
discuss
60.
HiPPO: Recurrent Memory with Optimal Polynomial Projections (hazyresearch.stanford.edu)
1 point
0mp
5 years ago
discuss
More