Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
661.
Regularization is all you need: simple neural nets can excel on tabular data (arxiv.org)
216 points
tracyhenry
5 years ago
93 comments
662.
The Bloom Clock (arxiv.org)
216 points
g0xA52A2A
7 years ago
43 comments
663.
Julia: A fresh approach to numerical computing (arxiv.org)
215 points
sveme
12 years ago
138 comments
664.
A Dyson sphere around a black hole (arxiv.org)
214 points
SkyMarshal
5 years ago
231 comments
665.
Reasoning in Large Language Models: A Geometric Perspective (arxiv.org)
214 points
belter
2 years ago
171 comments
666.
Markets are efficient if and only if P=NP (2010) (arxiv.org)
214 points
DarkContinent
6 years ago
169 comments
667.
Fastai: A Layered API for Deep Learning (arxiv.org)
214 points
pama
6 years ago
42 comments
668.
Harmony Explained: Progress Towards a Scientific Theory of Music (arxiv.org)
213 points
colund
10 years ago
148 comments
669.
Shifts in U.S. Social Media Use, 2020–2024: Decline, Fragmentation, Polarization (2025) (arxiv.org)
212 points
vinnyglennon
4 months ago
209 comments
670.
A Study of 4chan’s Politically Incorrect Forum and its Effect on the Web (arxiv.org)
212 points
ivarious
10 years ago
147 comments
671.
A sleep-like consolidation mechanism for LLMs (arxiv.org)
212 points
juxtapose
15 days ago
140 comments
672.
Why do tree-based models still outperform deep learning on tabular data? (2022) (arxiv.org)
212 points
tosh
2 years ago
111 comments
673.
Large Language Models for Compiler Optimization (arxiv.org)
212 points
famouswaffles
3 years ago
111 comments
674.
Single Headed Attention RNN (arxiv.org)
212 points
spatters
7 years ago
37 comments
675.
Facebook Use of Sensitive Data for Advertising in Europe [pdf] (arxiv.org)
212 points
HugoDaniel
8 years ago
30 comments
676.
The Ultraviolet Myth (arxiv.org)
211 points
Luc
2 years ago
88 comments
677.
MLIR: A Compiler Infrastructure for the End of Moore's Law (arxiv.org)
211 points
xiaodai
6 years ago
68 comments
678.
SparseGPT: Language Models Can Be Accurately Pruned in One-Shot (arxiv.org)
211 points
tosh
3 years ago
62 comments
679.
Searching the Internet for evidence of time travelers (arxiv.org)
210 points
ColinWright
12 years ago
147 comments
680.
Is Cosine-Similarity of Embeddings Really About Similarity? (arxiv.org)
210 points
Jimmc414
2 years ago
115 comments
681.
Failures of Deep Learning (arxiv.org)
210 points
stochastician
9 years ago
44 comments
682.
A neural network solves and generates mathematics problems by program synthesis (arxiv.org)
209 points
geox
4 years ago
86 comments
683.
Emergent Gravity and the Dark Universe (arxiv.org)
209 points
mrreelmo
10 years ago
80 comments
684.
Diffusion Models Beat GANs on Image Synthesis (arxiv.org)
209 points
lnyan
5 years ago
50 comments
685.
Refusal in language models is mediated by a single direction (arxiv.org)
209 points
Tomte
2 years ago
44 comments
686.
An Introduction to Probabilistic Programming (arxiv.org)
209 points
homarp
5 years ago
41 comments
687.
Proof of Work Without All the Work (arxiv.org)
208 points
federicoponzi
9 years ago
115 comments
688.
Complexity Theory, Game Theory, and Economics (arxiv.org)
208 points
lainon
8 years ago
28 comments
689.
Diffusion Training from Scratch on a Micro-Budget (arxiv.org)
208 points
fzliu
2 years ago
27 comments
690.
Transformers Can Do Arithmetic with the Right Embeddings (arxiv.org)
207 points
byt3h3ad
2 years ago
211 comments
More