Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
601.
Google’s Neural Machine Translation System (arxiv.org)
236 points
fitzwatermellow
10 years ago
97 comments
602.
LoRA vs. Full Fine-Tuning: An Illusion of Equivalence (arxiv.org)
236 points
timbilt
2 years ago
53 comments
603.
How AI impacts skill formation (arxiv.org)
236 points
northfield27
4 months ago
5 comments
604.
Multiplying Matrices Without Multiplying (arxiv.org)
235 points
moinnadeem
5 years ago
122 comments
605.
Knowledge from small number of debates outperforms wisdom of large crowds (2017) (arxiv.org)
235 points
Dowwie
7 years ago
116 comments
606.
Regularized Newton Method with Global $O(1/k^2)$ Convergence (arxiv.org)
235 points
ColinWright
5 years ago
103 comments
607.
Scaling Transformers to 1B Tokens (arxiv.org)
234 points
mottiden
3 years ago
68 comments
608.
Thermodynamic Linear Algebra (arxiv.org)
234 points
aifer4
3 years ago
55 comments
609.
YOLOv4: Optimal Speed and Accuracy of Object Detection (arxiv.org)
234 points
groar
6 years ago
54 comments
610.
TinyLoRA – Learning to Reason in 13 Parameters (arxiv.org)
234 points
sorenjan
2 months ago
45 comments
611.
Matrix multiplication using only addition (arxiv.org)
233 points
daniel-cussen
3 years ago
108 comments
612.
Self-Compressing Neural Networks (arxiv.org)
233 points
bilsbie
2 years ago
57 comments
613.
New attention mechanisms that outperform standard multi-head attention (arxiv.org)
233 points
snats
2 years ago
49 comments
614.
Evaluating AGENTS.md: are they helpful for coding agents? (arxiv.org)
232 points
mustaphah
4 months ago
161 comments
615.
Catala: A Programming Language for the Law (arxiv.org)
232 points
todsacerdoti
5 years ago
126 comments
616.
Language models are injective and hence invertible (arxiv.org)
231 points
mazsa
7 months ago
148 comments
617.
XLSTMTime: Long-Term Time Series Forecasting with xLSTM (arxiv.org)
231 points
beefman
2 years ago
53 comments
618.
Training Language Models to Self-Correct via Reinforcement Learning (arxiv.org)
230 points
weirdcat
2 years ago
92 comments
619.
Beyond sensor data: Foundation models of behavioral data from wearables (arxiv.org)
230 points
brandonb
10 months ago
54 comments
620.
Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs (arxiv.org)
230 points
cpldcpu
a year ago
53 comments
621.
Maximum Flow and Minimum-Cost Flow in Almost-Linear Time (arxiv.org)
230 points
tarxzvf
4 years ago
40 comments
622.
The FedEx Problem (arxiv.org)
229 points
subnaught
11 years ago
98 comments
623.
Learning how to think with Meta Chain-of-Thought (arxiv.org)
229 points
drcwpl
a year ago
75 comments
624.
Enhancement of human color vision by breaking the binocular redundancy (arxiv.org)
229 points
mxfh
9 years ago
56 comments
625.
A qualitative analysis of pig-butchering scams (arxiv.org)
227 points
stmw
9 months ago
147 comments
626.
arXiv moving from Cornell servers to Google Cloud (info.arxiv.org)
225 points
ColinWright
a year ago
162 comments
627.
MemGPT: Towards LLMs as Operating Systems (arxiv.org)
225 points
belter
3 years ago
106 comments
628.
Integer percentages as fingerprints of electoral falsification (arxiv.org)
225 points
merraksh
10 years ago
70 comments
629.
TopoNets: High performing vision and language models with brain-like topography (arxiv.org)
225 points
mayukhdeb
a year ago
68 comments
630.
Horcrux: A Password Manager for Paranoids (arxiv.org)
224 points
lainon
9 years ago
165 comments
More