Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Login
Top
New
Best
Ask
Show
Jobs
781.
▲
LoRA+: Efficient Low Rank Adaptation of Large Models
(arxiv.org)
181 points
veryluckyxyz
2 years ago
47 comments
782.
▲
Sparks of Artificial General Intelligence: Early Experiments with GPT-4
(arxiv.org)
180 points
thinxer
3 years ago
236 comments
783.
▲
Can a transformer represent a Kalman filter?
(arxiv.org)
180 points
bluish29
2 years ago
62 comments
784.
▲
Making Democracy Work: Fixing and Simplifying Egalitarian Paxos
(arxiv.org)
180 points
otrack
7 months ago
56 comments
785.
▲
GPU-Friendly Stroke Expansion
(arxiv.org)
180 points
raphlinus
2 years ago
39 comments
786.
▲
Privacy Loss in Apple's Implementation of Differential Privacy on MacOS 10.12
(arxiv.org)
180 points
sohkamyung
9 years ago
39 comments
787.
▲
TikTag: Breaking ARM's memory tagging extension with speculative execution
(arxiv.org)
180 points
skilled
2 years ago
26 comments
788.
▲
I'm afraid I can't do that: Prompt refusal in generative language models
(arxiv.org)
179 points
belter
3 years ago
166 comments
789.
▲
Some remarks on possible superconductivity of composition Pb9CuP6O25
(arxiv.org)
179 points
rsfern
3 years ago
92 comments
790.
▲
On the design of text editors (2020)
(arxiv.org)
179 points
signa11
2 years ago
86 comments
791.
▲
MM1: Methods, Analysis and Insights from Multimodal LLM Pre-training
(arxiv.org)
179 points
lord_sudo
2 years ago
60 comments
792.
▲
Memorizing Transformers
(arxiv.org)
179 points
silencedogood3
4 years ago
32 comments
793.
▲
Grothendieck’s use of equality
(arxiv.org)
178 points
golol
2 years ago
129 comments
794.
▲
Freaky Leaky SMS: Extracting user locations by analyzing SMS timings
(arxiv.org)
178 points
belter
3 years ago
58 comments
795.
▲
Analyzing Modern Nvidia GPU Cores
(arxiv.org)
178 points
mfiguiere
a year ago
37 comments
796.
▲
A Cookbook of Self-Supervised Learning
(arxiv.org)
178 points
ZunarJ5
3 years ago
22 comments
797.
▲
LoRA Learns Less and Forgets Less
(arxiv.org)
177 points
wolecki
2 years ago
60 comments
798.
▲
LLaVA-O1: Let Vision Language Models Reason Step-by-Step
(arxiv.org)
177 points
lnyan
2 years ago
32 comments
799.
▲
A study on robustness and reliability of large language model code generation
(arxiv.org)
176 points
floridsleeves
3 years ago
215 comments
800.
▲
Every model learned by gradient descent is approximately a kernel machine (2020)
(arxiv.org)
176 points
Anon84
2 years ago
136 comments
801.
▲
The Controlled Natural Language of Randall Munroe's Thing Explainer [pdf]
(arxiv.org)
176 points
tkuhn
10 years ago
91 comments
802.
▲
LLMs can teach themselves to better predict the future
(arxiv.org)
176 points
bturtel
a year ago
86 comments
803.
▲
DensePose from WiFi
(arxiv.org)
176 points
deverton
3 years ago
44 comments
804.
▲
Guide to Machine Learning with Geometric, Topological, and Algebraic Structures
(arxiv.org)
176 points
johmathe
2 years ago
27 comments
805.
▲
PaLI-3 Vision Language Models
(arxiv.org)
176 points
maccaw
3 years ago
23 comments
806.
▲
Post-human mathematics
(arxiv.org)
175 points
subnaught
11 years ago
101 comments
807.
▲
BloombergGPT: A Large Language Model for Finance
(arxiv.org)
175 points
SerCe
3 years ago
47 comments
808.
▲
Critical Behavior from Deep Dynamics: A Hidden Dimension in Natural Language
(arxiv.org)
175 points
hacker42
10 years ago
44 comments
809.
▲
Creativity has left the chat: The price of debiasing language models
(arxiv.org)
174 points
hardmaru
2 years ago
225 comments
810.
▲
How does Docker affect energy consumption?
(arxiv.org)
174 points
rbanffy
9 years ago
86 comments
More