genxy

Born on February 06, 2026•364 Karma

11 hours ago

Even for the web, Rust is a great language out of LLMs. It was quite surprising given the early performance of Python that Rust does so well. It really speaks to the high dimensional generality of transformer translation models.

genxy•

11 hours ago

•on: Private equity bought America's essential services

I don't know why the University of Washington isn't on the list of public institutions, it should be number 6 with 9.4B in assets.

https://www.uwinco.uw.edu/

genxy•

1 day ago

•on: Xiaomi MiMo Token Plan is Now Globally Available

This is because the users are training the product. They need training data, so they sell inference at the price of power.

gruez•

1 day ago

>API Services . If you use the API services, we will collect your IP address and the content (text, audio, video, picture) you submit to analyze the relevant instructions based on the model you select and to generate the returned content. Xiaomi will not use the content you provide for model training or any other purposes.

https://privacy.mi.com/XiaomiMiMoPlatform/en_GB/

koteelok•

1 day ago

Chinese corporation would never lie

colechristensen•

1 day ago

And what legal recourse do you have if they don't follow those rules?

windexh8er•

1 day ago

You have no recourse in the US, either. Trust no one is the only path given all of the training data is stolen in the first place.

It will come to light that one or many of the Frontier providers held the data, changed ToS and trained later minimally. But I think they just don't care and will train regardless. None of them abide by any level of ethics that would actually prevent them from leveraging an opportunity.

Tiberium•

1 day ago

ChatGPT (the setting is shared with Codex) and Claude (shared with Claude Code) also have sharing enabled by default, so why aren't they cheaper?

Springtime•

1 day ago

There's evidence various third-party models (including Deepseek) used distilling in training, based on models from those leading services. So they have more flexibility with pricing.

malnourish•

23 hours ago

Is that fundamentally any different than what e.g., Meta and OpenAI have done?

Besides, hasn't SCotUS ruled that raw LLM output isn't subject to copyright? So these companies would be breaking a ToS at worst.

behnamoh•

1 day ago

So? And Anthropic/OpenAI literally stole copyrighted content to train their models.

Springtime•

23 hours ago

The point was that distilling based on others' models for training means they're not spending the same amount on R&D and/or training, giving them headroom in other ways (responding to the parent's point). It wasn't a comment reflecting on copyright/fair use.

behnamoh•

23 hours ago

In the same fashion, Anthropic/OpenAI also reduced their training cost by not purchasing the license to copyrighted work and stealing it instead.

koteelok•

1 day ago

They are? They give away thousands of dollars via subs.

camelmel•

1 day ago

Is this training data even valuable? Usually AI data annotators get paid to write LLM responses, but here all they'd be getting is a bunch of user queries.

VerTiGo_Etrex•

1 day ago

1. Feed the same queries into Claude 2. Train on the Claude responses 3. ??? 4. Profit

This has been the strategy for months now

genxy•

1 day ago

•on: A sleep-like consolidation mechanism for LLMs

It is a descriptive analogy, get over yourself.

IAmGraydon•

1 day ago

An intelligent reply from an obviously intelligent guy!

A more appropriate title would have been something like "Offline Recurrent Memory Consolidation for Long-Context Language Models". This is supposed to be a research paper, not a story book. The title should give context to other researchers, and not be clearly engineered for clicks. If you don't think so, that's your prerogative, but you're objectively wrong.

genxy•

1 day ago

You write the paper, you write the title. So much anger over a title, you are graydon, make this about yourself.

genxy•

1 day ago

•on: A sleep-like consolidation mechanism for LLMs

Please re-read up to the end of page 2 and then re-ask this question.

genxy•

1 day ago

•on: Dropbox CEO Drew Houston to step down

iCloud is just rebadged GCS.

bhouston•

1 day ago

GCS and AWS are not competitors with iCloud, DropBox, Box, Drive, OneDrive since they are just raw APIs and storage and not a user facing product.

It is similar to saying that most websites are just cloud-hosted SQL rebranded.

dzonga•

1 day ago

iCloud actually runs on FoundationDB - probably one of most underrated DB engines out there.

you can build object storage on FoundationDB + other awesome bespoke stuff.

genxy•

1 day ago

It might use FoundationDB, but it is certainly storing those bytes on GCS.

genxy•

1 day ago

•on: Dropbox CEO Drew Houston to step down

How much are you willing to pay for this service? Ballpark. And what is your ratio of data at rest vs data you want shared? Are you ok with your permanent copy being local?

genxy•

1 day ago

•on: Dropbox CEO Drew Houston to step down

When you get stuck in a task like this, you realize that civilization will collapse with a whimper.

genxy•

2 days ago

•on: Bytecode VMs in surprising places (2024)

The network is the computer

genxy•

2 days ago

•on: Bytecode VMs in surprising places (2024)

Linux running in a shader https://blog.pimaker.at/texts/rvc1/

mastermage•

1 day ago

thats crazy