Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
601.
▲
Google’s Neural Machine Translation System
(arxiv.org)
236 points
fitzwatermellow
10 years ago
97 comments
602.
▲
LoRA vs. Full Fine-Tuning: An Illusion of Equivalence
(arxiv.org)
236 points
timbilt
2 years ago
53 comments
603.
▲
How AI impacts skill formation
(arxiv.org)
236 points
northfield27
4 months ago
5 comments
604.
▲
Multiplying Matrices Without Multiplying
(arxiv.org)
235 points
moinnadeem
5 years ago
122 comments
605.
▲
Knowledge from small number of debates outperforms wisdom of large crowds (2017)
(arxiv.org)
235 points
Dowwie
7 years ago
116 comments
606.
▲
Regularized Newton Method with Global $O(1/k^2)$ Convergence
(arxiv.org)
235 points
ColinWright
5 years ago
103 comments
607.
▲
Scaling Transformers to 1B Tokens
(arxiv.org)
234 points
mottiden
3 years ago
68 comments
608.
▲
Thermodynamic Linear Algebra
(arxiv.org)
234 points
aifer4
3 years ago
55 comments
609.
▲
YOLOv4: Optimal Speed and Accuracy of Object Detection
(arxiv.org)
234 points
groar
6 years ago
54 comments
610.
▲
TinyLoRA – Learning to Reason in 13 Parameters
(arxiv.org)
234 points
sorenjan
2 months ago
45 comments
611.
▲
Matrix multiplication using only addition
(arxiv.org)
233 points
daniel-cussen
3 years ago
108 comments
612.
▲
Self-Compressing Neural Networks
(arxiv.org)
233 points
bilsbie
2 years ago
57 comments
613.
▲
New attention mechanisms that outperform standard multi-head attention
(arxiv.org)
233 points
snats
2 years ago
49 comments
614.
▲
Evaluating AGENTS.md: are they helpful for coding agents?
(arxiv.org)
232 points
mustaphah
4 months ago
161 comments
615.
▲
Catala: A Programming Language for the Law
(arxiv.org)
232 points
todsacerdoti
5 years ago
126 comments
616.
▲
Language models are injective and hence invertible
(arxiv.org)
231 points
mazsa
7 months ago
148 comments
617.
▲
XLSTMTime: Long-Term Time Series Forecasting with xLSTM
(arxiv.org)
231 points
beefman
2 years ago
53 comments
618.
▲
Training Language Models to Self-Correct via Reinforcement Learning
(arxiv.org)
230 points
weirdcat
2 years ago
92 comments
619.
▲
Beyond sensor data: Foundation models of behavioral data from wearables
(arxiv.org)
230 points
brandonb
10 months ago
54 comments
620.
▲
Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs
(arxiv.org)
230 points
cpldcpu
a year ago
53 comments
621.
▲
Maximum Flow and Minimum-Cost Flow in Almost-Linear Time
(arxiv.org)
230 points
tarxzvf
4 years ago
40 comments
622.
▲
The FedEx Problem
(arxiv.org)
229 points
subnaught
11 years ago
98 comments
623.
▲
Learning how to think with Meta Chain-of-Thought
(arxiv.org)
229 points
drcwpl
a year ago
75 comments
624.
▲
Enhancement of human color vision by breaking the binocular redundancy
(arxiv.org)
229 points
mxfh
9 years ago
56 comments
625.
▲
A qualitative analysis of pig-butchering scams
(arxiv.org)
227 points
stmw
9 months ago
147 comments
626.
▲
arXiv moving from Cornell servers to Google Cloud
(info.arxiv.org)
225 points
ColinWright
a year ago
162 comments
627.
▲
MemGPT: Towards LLMs as Operating Systems
(arxiv.org)
225 points
belter
3 years ago
106 comments
628.
▲
Integer percentages as fingerprints of electoral falsification
(arxiv.org)
225 points
merraksh
10 years ago
70 comments
629.
▲
TopoNets: High performing vision and language models with brain-like topography
(arxiv.org)
225 points
mayukhdeb
a year ago
68 comments
630.
▲
Horcrux: A Password Manager for Paranoids
(arxiv.org)
224 points
lainon
9 years ago
165 comments
More