Search: gilesthomas.com | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

Writing an LLM from scratch, part 12 – multi-head attention (gilesthomas.com)

3 points

a year ago

32.

Getting MathML to render properly in Chrome-based browsers (gilesthomas.com)

3 points

a year ago

33.

How Do LLMs Work? (gilesthomas.com)

2 points

9 months ago

34.

Jax Back Ends and Devices (gilesthomas.com)

2 points

a day ago

35.

Using Safetensors with Flax (gilesthomas.com)

2 points

2 days ago

36.

First Looking into Jax (gilesthomas.com)

2 points

5 days ago

37.

10Gb Ethernet: what I had to (re)learn (gilesthomas.com)

2 points

a month ago

38.

LLM from scratch, part 32k – Interventions: gradient accumulation (gilesthomas.com)

2 points

2 months ago

39.

LLM from scratch, part 32j – trying to train a better model in the cloud (gilesthomas.com)

2 points

2 months ago

40.

Writing an LLM from scratch, part 32i – Interventions: what is in the noise? (gilesthomas.com)

2 points

2 months ago

41.

Writing an LLM from scratch, part 32h – Interventions: full fat float32 (gilesthomas.com)

2 points

2 months ago

42.

Automating starting Lambda Labs instances (gilesthomas.com)

2 points

2 months ago

43.

Writing an LLM from scratch, part 32g – Interventions: weight tying (gilesthomas.com)

2 points

2 months ago

44.

Writing an LLM from scratch, part 32g – Interventions: weight tying (gilesthomas.com)

2 points

2 months ago

45.

Writing an LLM from scratch, part 32B – Interventions: gradient clipping (gilesthomas.com)

2 points

4 months ago

46.

Writing an LLM from scratch, part 31 – the models are now on Hugging Face (gilesthomas.com)

2 points

5 months ago

47.

LLM from scratch, part 29 – using DDP to train a base model in the cloud (gilesthomas.com)

2 points

5 months ago

48.

Why smart instruction-following makes prompt injection easier (gilesthomas.com)

2 points

7 months ago

49.

Writing an LLM from scratch, part 25 – instruction fine-tuning (gilesthomas.com)

2 points

7 months ago

50.

Revisiting Karpathy's 'Unreasonable Effectiveness of Recurrent Neural Networks' (gilesthomas.com)

2 points

8 months ago

51.

What AI chatbots are doing under the hood (gilesthomas.com)

2 points

9 months ago

52.

LLM from scratch, part 18 – residuals, shortcut connections, and the Talmud (gilesthomas.com)

2 points

10 months ago

53.

Writing an LLM from scratch, part 11 – batches (gilesthomas.com)

2 points

a year ago

54.

LLM Quantisation Weirdness (gilesthomas.com)

2 points

2 years ago

55.

Fun with Google Books Ngram Viewer and the long S (gilesthomas.com)

2 points

15 years ago

56.

How to bet on the bubble? (with list of 2010/11 YC startup hosting providers) (gilesthomas.com)

1 point

15 years ago

57.

How many python programmers are there in the World today? (gilesthomas.com)

1 point

lifeisstillgood

12 years ago

58.

SNI-based Reverse Proxying for SSL connections (gilesthomas.com)

1 point

13 years ago

59.

10Gb Ethernet: what I had to (re)learn (gilesthomas.com)

1 point

a month ago

60.

Do reasoning LLMs need their own Philosophical Language? (gilesthomas.com)

1 point

a year ago