Technology

40121 readers

283 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

FFmpeg 8 can subtitle your videos on the fly with Whisper (www.theregister.com)

submitted 15 hours ago by along_the_road@beehaw.org to c/technology@beehaw.org

15 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] LukeZaz@beehaw.org -1 points 8 hours ago* (last edited 8 hours ago) (1 children)

The changelog lists 30 significant changes, of which the top new feature is integrating Whisper. This means whisper.cpp, which is Georgi Gerganov's entirely local and offline version of OpenAI's Whisper automatic speech recognition model. The bottom line is that FFmpeg can now automatically subtitle videos for you.

Yeah hey, can anyone chime in if this is at all based off LLMs? Because my problems with the incorrect plagiarism machine don't end just because it's now the offline incorrect plagiarism machine. Making OpenAI's garbage hockey open source doesn't make it okay. Or should I just start calling this shit FOSSwashing?

I dug around for a bit and couldn't find much of anything, but judging by a look at the Github pages for both versions of Whisper, it's looking very related. If that's the case, fuck right off. I don't want AI in FFmpeg, either.

[–] kayohtie@pawb.social 9 points 7 hours ago* (last edited 7 hours ago) (1 children)

It's not AI, it's neural network models in the same way voice recognition in devices has been working for over a decade. Even Dragon has been utilizing language models vectors for a very long time, just requiring voice training instead of utilizing a premade research or open-source data set.

I hate generative AI and it's slop too, but getting angry about neural network models in general is not only absurd, but playing exactly into what corporations want -- conflation of the underlying basic technology concepts with the capitalistic vampirism of art.

EDIT: to add, "research" here can be closed source -- voice models utilized with these tend to be internally-sourced for much of them, at least earlier ones do.

[–] drosophila@lemmy.blahaj.zone 2 points 2 hours ago

It’s not AI, it’s neural network models

These used to be called AI before people decided that only LLMs and Diffusion models were AI. Both of which are types of neural networks.