Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
481.
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU (arxiv.org)
326 points
chrsw
2 months ago
57 comments
482.
Impact of a Night of Sleep Deprivation on Novice Developers’ Performance (2018) (arxiv.org)
324 points
gyre007
7 years ago
206 comments
483.
AbsenceBench: Language models can't tell what's missing (arxiv.org)
324 points
JnBrymn
a year ago
84 comments
484.
Huawei releases an open weight model trained on Huawei Ascend GPUs (arxiv.org)
321 points
buyucu
a year ago
333 comments
485.
Explainable Deep Learning: A Field Guide for the Uninitiated (arxiv.org)
321 points
BERTHart
6 years ago
25 comments
486.
Statistical Analysis shows Echos process voice to serve ads (arxiv.org)
318 points
BeniBoy
4 years ago
118 comments
487.
StarCoder and StarCoderBase: 15.5B parameter models with 8K context length (arxiv.org)
317 points
belter
3 years ago
162 comments
488.
Why do tree-based models still outperform deep learning on tabular data? (arxiv.org)
315 points
isolli
4 years ago
139 comments
489.
QLoRA: Efficient Finetuning of Quantized LLMs (arxiv.org)
315 points
Garcia98
3 years ago
107 comments
490.
QUIC is not quick enough over fast internet (arxiv.org)
313 points
carlos-menezes
2 years ago
280 comments
491.
Beyond A*: Better Planning with Transformers (arxiv.org)
313 points
jonbaer
2 years ago
120 comments
492.
SATAn: Air-Gap Exfiltration Attack via Radio Signals from SATA Cables (arxiv.org)
312 points
PaulHoule
4 years ago
122 comments
493.
Orca 2: Teaching Small Language Models How to Reason (arxiv.org)
310 points
fgfm
3 years ago
80 comments
494.
What if an SQL statement returned a database? (arxiv.org)
309 points
matt_d
2 years ago
159 comments
495.
Hallucination is inevitable: An innate limitation of large language models (arxiv.org)
308 points
louthy
2 years ago
474 comments
496.
Fitting an elephant with four non-zero parameters (arxiv.org)
307 points
belter
2 years ago
147 comments
497.
Adversarial policies beat superhuman Go AIs (2023) (arxiv.org)
306 points
amichail
a year ago
139 comments
498.
“I’ll Finish It This Week” and Other Lies (arxiv.org)
306 points
lnwlebjel
5 years ago
115 comments
499.
Infinite Photorealistic Worlds Using Procedural Generation (arxiv.org)
306 points
cpeterso
3 years ago
76 comments
500.
A definition of AGI (arxiv.org)
305 points
pegasus
7 months ago
514 comments
501.
Chameleon: Meta’s New Multi-Modal LLM (arxiv.org)
304 points
gabrielbirnbaum
2 years ago
40 comments
502.
LLMs should not replace therapists (arxiv.org)
303 points
layer8
a year ago
417 comments
503.
Jewish problems (arxiv.org)
303 points
cal2
15 years ago
187 comments
504.
Better and Faster Large Language Models via Multi-Token Prediction (arxiv.org)
302 points
jasondavies
2 years ago
128 comments
505.
Automated Unit Test Improvement Using Large Language Models at Meta (arxiv.org)
301 points
mfiguiere
2 years ago
188 comments
506.
Exponentially faster language modelling (arxiv.org)
301 points
born-jre
3 years ago
137 comments
507.
Zip Trees (arxiv.org)
301 points
federicoponzi
8 years ago
36 comments
508.
Best Practices for Applying Deep Learning to Novel Applications (arxiv.org)
300 points
mindcrime
9 years ago
17 comments
509.
PowerHammer: Exfiltrating Data from Air-Gapped Computers Through Power Lines (arxiv.org)
298 points
wglb
8 years ago
85 comments
510.
Deterministic Fully-Static Whole-Binary Translation Without Heuristics (arxiv.org)
298 points
matt_d
a month ago
65 comments
More