Interesting. That would be a nice extension, I think most small admins are using the filesystem (I know I am lol).
No. Unfortunately it only works with storages on object storages like S3 buckets, not with filesystem storages. Meaning it access the files remotely one at a time from the bucket, downloading them over the internet (I assume, I didn't make this).
But the more important thing is that, as it states in the readme, no files get saved to your disk, they only stay in your RAM while they are being processed and everything is deleted right after. This is relevant because even having had CSAM on your disk at some point can put you in trouble in some countries, with this tool it never happens.
Which btw is the same reason why mounting the pict-rs folder to your local computer is probably not a good idea.
Yeah I know. It's supposed to be ran from your computer, not the VPS.
Same, it's the reason why I can't stand working with python.
Thank you for doing this, btw. Once you have something working on your hands you could consider spreading the word, maybe to db0 himself. I sure would love a convenient way to run that script, and many other admins probably would too.
Introducing the WCAAS model - Wood Chipper As A Service - coming to a white van near you.
On a serious note, unfortunately those people use VPNs and tend to be very careful with their IPs. A technological solution won't do, chipping is the only cure.
I see. I considered the dependency problem but only thought of using a venv to fix that, however you are right, the python version is also often the cause of compatibility issues.
Sorry I haven't ran this myself yet nor have any experience with that kind of issues. But may I ask why you were concerned with running it inside of a container? Seems rather unnecessary to me.
That is also very true. I think better tooling for that might come with the next pict-rs version, which will move the storage to a database (right now it's in an internal ky-value storage). Hopefully that will make it easier to identify orphaned images.
I’m an instance administrator, what the fuck do I do?
There's one more option. The awesome @db0@lemmy.dbzer0.com has made this tool to detect and automatically remove CSAM content from a pict-rs object storage.
Is the developement discussion for lemmy here on lemmy or on git?
I'd say the bulk of the discussion happens in github issues. There's also a matrix room dedicated to the development of Lemmy but it hasn't received much traffic lately.
Literally 1984.
P.S. "site manager" does sound fancy, I like it.
Actually that has been proposed over the admin matrix chat and I think it could be a smart move. They suggested going as far as banning all VPN IP ranges. But I guess right now everyone is really waiting for the devs to say/do something, before the admins start fixing the problem themselves.