Search: aqrxiv.org | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

631.

TopoNets: High performing vision and language models with brain-like topography (arxiv.org)

225 points

a year ago

632.

Horcrux: A Password Manager for Paranoids (arxiv.org)

224 points

9 years ago

633.

Do transformers need three projections? Systematic study of QKV variants (arxiv.org)

224 points

6 days ago

634.

Neural Network Diffusion (arxiv.org)

223 points

2 years ago

635.

Sending a Spacecraft to the Interstellar Asteroid (arxiv.org)

223 points

9 years ago

636.

Oxide: A Formal Semantics for Rust (arxiv.org)

223 points

7 years ago

637.

Solving a million-step LLM task with zero errors (arxiv.org)

222 points

7 months ago

638.

The Principles of Deep Learning Theory (arxiv.org)

221 points

4 years ago

639.

Dissecting Ponzi schemes on Ethereum: identification, analysis, and impact (arxiv.org)

221 points

9 years ago

640.

Tutorial on diffusion models for imaging and vision (arxiv.org)

221 points

2 years ago

641.

Toolformer: Language Models Can Teach Themselves to Use Tools (arxiv.org)

220 points

3 years ago

642.

Mathematics of Deep Learning [pdf] (arxiv.org)

220 points

8 years ago

643.

Wikidata, with 12B facts, can ground LLMs to improve their factuality (arxiv.org)

219 points

3 years ago

644.

Foundations of Large Language Models (arxiv.org)

219 points

a year ago

645.

Self-Normalizing Neural Networks (arxiv.org)

219 points

9 years ago

646.

How real are real numbers? (2004) (arxiv.org)

218 points

9 years ago

647.

Reasoning models reason well, until they don't (arxiv.org)

218 points

7 months ago

648.

Accidentally quadratic: When Python is faster than C++ (arxiv.org)

218 points

5 years ago

649.

TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023) (arxiv.org)

218 points

a year ago

650.

Mathematical methods and human thought in the age of AI (arxiv.org)

218 points

2 months ago

651.

Why Does It Take So Long to Connect to a WiFi Access Point? (arxiv.org)

218 points

9 years ago

652.

Stealing Part of a Production Language Model (arxiv.org)

218 points

2 years ago

653.

Professional software developers don't vibe, they control (arxiv.org)

217 points

5 months ago

654.

Next-Paradigm Programming Languages: What Will They Look Like? (arxiv.org)

217 points

7 years ago

655.

Comparing humans, GPT-4, and GPT-4V on abstraction and reasoning tasks (arxiv.org)

217 points

3 years ago

656.

How to fit any dataset with a single parameter (arxiv.org)

217 points

5 years ago

657.

Gravitational Machines (arxiv.org)

217 points

3 years ago

658.

Mexican Computers: A Brief Technical and Historical Overview (arxiv.org)

217 points

2 years ago

659.

Is artificial consciousness achievable? Lessons from the human brain (arxiv.org)

216 points

2 years ago

660.

Regularization is all you need: simple neural nets can excel on tabular data (arxiv.org)

216 points

5 years ago