this post was submitted on 06 Feb 2025
24 points (76.1% liked)

Futurology

3090 readers
35 users here now

founded 2 years ago
MODERATORS
all 8 comments
sorted by: hot top controversial new old
[–] pennomi@lemmy.world 11 points 5 months ago (1 children)

Highly misleading. They finetuned an existing model using a different existing model in a process called distillation.

The article is effectively saying “our model only cost $50 to make, plus whatever tens or hundreds of millions of dollars the models we stole from cost.”

[–] MrPoopyButthole@lemmy.dbzer0.com 4 points 5 months ago

That's rookie numbers I trained one in 1min with $1!

[–] ianhclark510@lemmy.blahaj.zone 3 points 5 months ago

How long before congress bans this one too

[–] cyd@lemmy.world 2 points 5 months ago* (last edited 5 months ago)

The underlying research story is interesting, but the way it's written up actively makes it worse.

The researchers based s1 on Qwen2.5, an open-source model from Alibaba Cloud.

Watch me create a racing car for less than $50. Step 1: start with a Mercedes F1 racer...

[–] PhilipTheBucket@ponder.cat 2 points 5 months ago

Trained it to do very basic arithmetic tasks, not to rival OpenAI.