Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
481.
▲
AbsenceBench: Language models can't tell what's missing
(arxiv.org)
324 points
JnBrymn
a year ago
84 comments
482.
▲
Huawei releases an open weight model trained on Huawei Ascend GPUs
(arxiv.org)
321 points
buyucu
a year ago
333 comments
483.
▲
Explainable Deep Learning: A Field Guide for the Uninitiated
(arxiv.org)
321 points
BERTHart
6 years ago
25 comments
484.
▲
Statistical Analysis shows Echos process voice to serve ads
(arxiv.org)
318 points
BeniBoy
4 years ago
118 comments
485.
▲
StarCoder and StarCoderBase: 15.5B parameter models with 8K context length
(arxiv.org)
317 points
belter
3 years ago
162 comments
486.
▲
Why do tree-based models still outperform deep learning on tabular data?
(arxiv.org)
315 points
isolli
4 years ago
139 comments
487.
▲
QLoRA: Efficient Finetuning of Quantized LLMs
(arxiv.org)
315 points
Garcia98
3 years ago
107 comments
488.
▲
QUIC is not quick enough over fast internet
(arxiv.org)
313 points
carlos-menezes
2 years ago
280 comments
489.
▲
Beyond A*: Better Planning with Transformers
(arxiv.org)
313 points
jonbaer
2 years ago
120 comments
490.
▲
SATAn: Air-Gap Exfiltration Attack via Radio Signals from SATA Cables
(arxiv.org)
312 points
PaulHoule
4 years ago
122 comments
491.
▲
Orca 2: Teaching Small Language Models How to Reason
(arxiv.org)
310 points
fgfm
3 years ago
80 comments
492.
▲
What if an SQL statement returned a database?
(arxiv.org)
309 points
matt_d
2 years ago
159 comments
493.
▲
Hallucination is inevitable: An innate limitation of large language models
(arxiv.org)
308 points
louthy
2 years ago
474 comments
494.
▲
Fitting an elephant with four non-zero parameters
(arxiv.org)
307 points
belter
2 years ago
147 comments
495.
▲
Adversarial policies beat superhuman Go AIs (2023)
(arxiv.org)
306 points
amichail
a year ago
139 comments
496.
▲
“I’ll Finish It This Week” and Other Lies
(arxiv.org)
306 points
lnwlebjel
5 years ago
115 comments
497.
▲
Infinite Photorealistic Worlds Using Procedural Generation
(arxiv.org)
306 points
cpeterso
3 years ago
76 comments
498.
▲
A definition of AGI
(arxiv.org)
305 points
pegasus
7 months ago
514 comments
499.
▲
Chameleon: Meta’s New Multi-Modal LLM
(arxiv.org)
304 points
gabrielbirnbaum
2 years ago
40 comments
500.
▲
LLMs should not replace therapists
(arxiv.org)
303 points
layer8
a year ago
417 comments
501.
▲
Jewish problems
(arxiv.org)
303 points
cal2
15 years ago
187 comments
502.
▲
Better and Faster Large Language Models via Multi-Token Prediction
(arxiv.org)
302 points
jasondavies
2 years ago
128 comments
503.
▲
Automated Unit Test Improvement Using Large Language Models at Meta
(arxiv.org)
301 points
mfiguiere
2 years ago
188 comments
504.
▲
Exponentially faster language modelling
(arxiv.org)
301 points
born-jre
3 years ago
137 comments
505.
▲
Zip Trees
(arxiv.org)
301 points
federicoponzi
8 years ago
36 comments
506.
▲
Best Practices for Applying Deep Learning to Novel Applications
(arxiv.org)
300 points
mindcrime
9 years ago
17 comments
507.
▲
PowerHammer: Exfiltrating Data from Air-Gapped Computers Through Power Lines
(arxiv.org)
298 points
wglb
8 years ago
85 comments
508.
▲
Deterministic Fully-Static Whole-Binary Translation Without Heuristics
(arxiv.org)
298 points
matt_d
a month ago
65 comments
509.
▲
Information Theory: A Tutorial Introduction
(arxiv.org)
297 points
teleforce
5 years ago
26 comments
510.
▲
Why do random forests work? They are self-regularizing adaptive smoothers
(arxiv.org)
295 points
sebg
2 years ago
41 comments
More