Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
481.
AbsenceBench: Language models can't tell what's missing (arxiv.org)
324 points
JnBrymn
a year ago
84 comments
482.
Huawei releases an open weight model trained on Huawei Ascend GPUs (arxiv.org)
321 points
buyucu
a year ago
333 comments
483.
Explainable Deep Learning: A Field Guide for the Uninitiated (arxiv.org)
321 points
BERTHart
6 years ago
25 comments
484.
Statistical Analysis shows Echos process voice to serve ads (arxiv.org)
318 points
BeniBoy
4 years ago
118 comments
485.
StarCoder and StarCoderBase: 15.5B parameter models with 8K context length (arxiv.org)
317 points
belter
3 years ago
162 comments
486.
Why do tree-based models still outperform deep learning on tabular data? (arxiv.org)
315 points
isolli
4 years ago
139 comments
487.
QLoRA: Efficient Finetuning of Quantized LLMs (arxiv.org)
315 points
Garcia98
3 years ago
107 comments
488.
QUIC is not quick enough over fast internet (arxiv.org)
313 points
carlos-menezes
2 years ago
280 comments
489.
Beyond A*: Better Planning with Transformers (arxiv.org)
313 points
jonbaer
2 years ago
120 comments
490.
SATAn: Air-Gap Exfiltration Attack via Radio Signals from SATA Cables (arxiv.org)
312 points
PaulHoule
4 years ago
122 comments
491.
Orca 2: Teaching Small Language Models How to Reason (arxiv.org)
310 points
fgfm
3 years ago
80 comments
492.
What if an SQL statement returned a database? (arxiv.org)
309 points
matt_d
2 years ago
159 comments
493.
Hallucination is inevitable: An innate limitation of large language models (arxiv.org)
308 points
louthy
2 years ago
474 comments
494.
Fitting an elephant with four non-zero parameters (arxiv.org)
307 points
belter
2 years ago
147 comments
495.
Adversarial policies beat superhuman Go AIs (2023) (arxiv.org)
306 points
amichail
a year ago
139 comments
496.
“I’ll Finish It This Week” and Other Lies (arxiv.org)
306 points
lnwlebjel
5 years ago
115 comments
497.
Infinite Photorealistic Worlds Using Procedural Generation (arxiv.org)
306 points
cpeterso
3 years ago
76 comments
498.
A definition of AGI (arxiv.org)
305 points
pegasus
7 months ago
514 comments
499.
Chameleon: Meta’s New Multi-Modal LLM (arxiv.org)
304 points
gabrielbirnbaum
2 years ago
40 comments
500.
LLMs should not replace therapists (arxiv.org)
303 points
layer8
a year ago
417 comments
501.
Jewish problems (arxiv.org)
303 points
cal2
15 years ago
187 comments
502.
Better and Faster Large Language Models via Multi-Token Prediction (arxiv.org)
302 points
jasondavies
2 years ago
128 comments
503.
Automated Unit Test Improvement Using Large Language Models at Meta (arxiv.org)
301 points
mfiguiere
2 years ago
188 comments
504.
Exponentially faster language modelling (arxiv.org)
301 points
born-jre
3 years ago
137 comments
505.
Zip Trees (arxiv.org)
301 points
federicoponzi
8 years ago
36 comments
506.
Best Practices for Applying Deep Learning to Novel Applications (arxiv.org)
300 points
mindcrime
9 years ago
17 comments
507.
PowerHammer: Exfiltrating Data from Air-Gapped Computers Through Power Lines (arxiv.org)
298 points
wglb
8 years ago
85 comments
508.
Deterministic Fully-Static Whole-Binary Translation Without Heuristics (arxiv.org)
298 points
matt_d
a month ago
65 comments
509.
Information Theory: A Tutorial Introduction (arxiv.org)
297 points
teleforce
5 years ago
26 comments
510.
Why do random forests work? They are self-regularizing adaptive smoothers (arxiv.org)
295 points
sebg
2 years ago
41 comments
More