Search: hazyresearch.stanford.edu | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores (hazyresearch.stanford.edu)

3 points

3 years ago

32.

HyenaDNA: Learning from DNA with 1M token context (hazyresearch.stanford.edu)

3 points

3 years ago

33.

AI's Linux Moment: An Open-Source AI Model Love Note (hazyresearch.stanford.edu)

3 points

3 years ago

34.

Minions: The rise of small, on-device LMs (hazyresearch.stanford.edu)

2 points

a year ago

35.

Zoology 1: Measuring and Improving Recall in Efficient Language Models (hazyresearch.stanford.edu)

2 points

2 years ago

36.

Pixelated Butterfly (hazyresearch.stanford.edu)

2 points

3 years ago

37.

Stuffing MLPs Full of Facts: A Generative Approach to Factual Recall (hazyresearch.stanford.edu)

2 points

6 months ago

38.

ThunderMittens for Your ThunderKittens (hazyresearch.stanford.edu)

2 points

7 months ago

39.

An Unserious Persons Take on Axiomatic Knowledge in the Era of Foundation Models (hazyresearch.stanford.edu)

2 points

2 years ago

40.

An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models (hazyresearch.stanford.edu)

2 points

2 years ago

41.

Linearizing LLMs with LoLCATs (hazyresearch.stanford.edu)

2 points

2 years ago

42.

Efficient language models as arithmetic circuits (hazyresearch.stanford.edu)

2 points

2 years ago

43.

Combining Continuous-Time, Recurrent, and Convolutional Models (hazyresearch.stanford.edu)

2 points

2 years ago

44.

HyenaDNA: Learning from DNA with 1M token context (hazyresearch.stanford.edu)

2 points

3 years ago

45.

Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)

2 points

3 years ago

46.

From Deep to Long Learning? (hazyresearch.stanford.edu)

2 points

3 years ago

47.

Based: An Educational and Effective Sequence Mixer (hazyresearch.stanford.edu)

1 point

2 years ago

48.

ThunderKittens 2.0: even faster kernels for your GPUs (hazyresearch.stanford.edu)

1 point

3 months ago

49.

Loads and Loads of Fluffy Kittens (hazyresearch.stanford.edu)

1 point

7 months ago

50.

Intelligence per Watt: A Study of Local Intelligence Efficiency (hazyresearch.stanford.edu)

1 point

7 months ago

51.

Cartridges: Store long contexts in tiny caches with LLM self-study (hazyresearch.stanford.edu)

1 point

a year ago

52.

Mind the Trust Gap: Fast, Private Local-to-Cloud LLM Chat (hazyresearch.stanford.edu)

1 point

a year ago

53.

Correcting and Improving LLM Predictions Without Labels (hazyresearch.stanford.edu)

1 point

3 years ago

54.

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning (hazyresearch.stanford.edu)

1 point

3 years ago

55.

H3: Language Modeling with State Space Models and (Almost) No Attention (hazyresearch.stanford.edu)

1 point

3 years ago

56.

Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)

1 point

3 years ago

57.

Chris Re: Is AI Rare or Everywhere? (hazyresearch.stanford.edu)

1 point

3 years ago

58.

Hyena Hierarchy: Towards Larger Convolutional Language Models (hazyresearch.stanford.edu)

1 point

3 years ago

59.

Can Longer Sequences Help Take the Next Leap in AI? · Hazy Research (hazyresearch.stanford.edu)

1 point

4 years ago

60.

HiPPO: Recurrent Memory with Optimal Polynomial Projections (hazyresearch.stanford.edu)

1 point

5 years ago