Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
Constant 14ms attention: 512→524K tokens (24.5x faster than FlashAttention)
github.com/RegularJoe-CEO
1 point
luxiedge
4 months ago
1 comment
Loading...
Constant 14ms attention: 512→524K tokens (24.5x faster than FlashAttention) | Heykuki News