Search: arxiv.org | Heykuki News

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

601.

Google’s Neural Machine Translation System (arxiv.org)

236 points

fitzwatermellow

10 years ago

602.

LoRA vs. Full Fine-Tuning: An Illusion of Equivalence (arxiv.org)

236 points

2 years ago

603.

How AI impacts skill formation (arxiv.org)

236 points

4 months ago

604.

Multiplying Matrices Without Multiplying (arxiv.org)

235 points

5 years ago

605.

Knowledge from small number of debates outperforms wisdom of large crowds (2017) (arxiv.org)

235 points

7 years ago

606.

Regularized Newton Method with Global $O(1/k^2)$ Convergence (arxiv.org)

235 points

5 years ago

607.

Scaling Transformers to 1B Tokens (arxiv.org)

234 points

3 years ago

608.

Thermodynamic Linear Algebra (arxiv.org)

234 points

3 years ago

609.

YOLOv4: Optimal Speed and Accuracy of Object Detection (arxiv.org)

234 points

6 years ago

610.

TinyLoRA – Learning to Reason in 13 Parameters (arxiv.org)

234 points

2 months ago

611.

Matrix multiplication using only addition (arxiv.org)

233 points

3 years ago

612.

Self-Compressing Neural Networks (arxiv.org)

233 points

2 years ago

613.

New attention mechanisms that outperform standard multi-head attention (arxiv.org)

233 points

2 years ago

614.

Evaluating AGENTS.md: are they helpful for coding agents? (arxiv.org)

232 points

4 months ago

615.

Catala: A Programming Language for the Law (arxiv.org)

232 points

5 years ago

616.

Language models are injective and hence invertible (arxiv.org)

231 points

7 months ago

617.

XLSTMTime: Long-Term Time Series Forecasting with xLSTM (arxiv.org)

231 points

2 years ago

618.

Training Language Models to Self-Correct via Reinforcement Learning (arxiv.org)

230 points

2 years ago

619.

Beyond sensor data: Foundation models of behavioral data from wearables (arxiv.org)

230 points

10 months ago

620.

Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs (arxiv.org)

230 points

a year ago

621.

Maximum Flow and Minimum-Cost Flow in Almost-Linear Time (arxiv.org)

230 points

4 years ago

622.

The FedEx Problem (arxiv.org)

229 points

11 years ago

623.

Learning how to think with Meta Chain-of-Thought (arxiv.org)

229 points

a year ago

624.

Enhancement of human color vision by breaking the binocular redundancy (arxiv.org)

229 points

9 years ago

625.

A qualitative analysis of pig-butchering scams (arxiv.org)

227 points

9 months ago

626.

arXiv moving from Cornell servers to Google Cloud (info.arxiv.org)

225 points

a year ago

627.

MemGPT: Towards LLMs as Operating Systems (arxiv.org)

225 points

3 years ago

628.

Integer percentages as fingerprints of electoral falsification (arxiv.org)

225 points

10 years ago

629.

TopoNets: High performing vision and language models with brain-like topography (arxiv.org)

225 points

a year ago

630.

Horcrux: A Password Manager for Paranoids (arxiv.org)

224 points

9 years ago