PSA: You can upload images to a Lemmy instance without anyone knowing

bmygsbvur · edit-2 2 years ago

PSA: You can upload images to a Lemmy instance without anyone knowing

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 The last time the topic came up, the only publicly available API for this was owned by the feds. I don’t know if this tool downloads a model (I also don’t know how such a model could be legal to possess) or if it consults an API (which would be a privacy concern). In either case, you’d have to be very careful about false positives.

@ryona.agency · 2 years ago

@p @ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 Yeah, it’s using local CLIP model, something I’ve suggested both to gr*f and jakparty.soy admin. The problem is that it requires a lot of clock cycles, preferably on GPU, so it isn’t something people with $5 VPSes can afford. Not fully sure about effectiveness, either, malicious actors can keep scrambling the image so that it passes the filter yet is still recognizable by human brain.

CMD@bae.st · 2 years ago

@mint @p @Nerd02 @bmygsbvur @db0 This is the type of response I was looking for - and why I’d asked pete. If the big problem’s clock cycles, then maybe there’s something that can be done - after all, the model’s way beefier than what’s needed to solve this particular problem, it does much more.

pistolero@freespeechextremist.com · 2 years ago

@mint @Nerd02 @bmygsbvur @ceo_of_monoeye_dating @db0

> it’s using local CLIP model,

How does this not end up getting used to produce computer-generated CP?

> isn’t something people with $5 VPSes can afford.

Yeah, but when you’re at the $5 VPS stage, you’re usually going to be hosting a couple dozen people at most.

> malicious actors can keep scrambling the image so that it passes the filter yet is still recognizable by human brain.

Yeah. Not foolproof.

CMD@bae.st · 2 years ago

@p @Nerd02 @bmygsbvur @db0 @mint >How does this not end up getting used to produce computer-generated CP?

It was. That’s the problem they wrote this script to try to solve.

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @mint Yeah, presumably it is better at detecting stuff that it produces itself, but my understanding is that this kind of model is legally questionable to possess because of that.

CMD@bae.st · 2 years ago

@p @Nerd02 @bmygsbvur @db0 @mint They’ve had the model on github for months. If they were gonna get bonked, they’d’ve gotten bonked by now.

laurel@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @p @Nerd02 @bmygsbvur @db0 @mint

It’s not their model, it’s an implementation of the openAI paper from some academics hosted here https://github.com/pharmapsychotic/clip-interrogator/

To be specific they use one of the ViT-L/14 models.
This type of labeling models have been around for a long time. They used to be called text-from-image or some other similar verbose description.

If the current generative models can produce porn then they can also produce CSAM, there’s no need to go through another layer.
The issue with models trained on actual illegal material is that then they could be reverse engineered to output the very same material that they have been trained with, in addition to very realistic generated ones. It’s similar to how LLMs can be used to extract potentially private information they’ve been trained with.

laurel@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @mint @p

*some academics hosted here https://github.com/mlfoundations/open_clip
The above link was just the wrapper.

Soy_Magnus@freespeechextremist.com · 2 years ago

@laurel @ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @mint @p HI LAUREL
bearhug.gif

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @mint Yeah, but youtube-dl was on Github for years and then suddenly declared an evil piracy tool and scrubbed and banned. The odds that you get bonked are also higher than the odds that Github gets bonked; “I got it from Github” doesn’t constitute much of a defense.

In either case, I don’t have much investment in the legality of that model because I don’t plan to acquire it. Just it was my understanding that possessing a model that was trained on some source material and that can be used to produce material resembling the source material is considered the same, legally, as possessing the source material. I’m not an expert on that and I don’t think there have even been any cases yet.

CMD@bae.st · 2 years ago

@p @Nerd02 @bmygsbvur @db0 @mint The problem with the models is the fact that training data can be reverse engineered from the model. If the model’s not trained on any CP, there’s not likely to be any problem.

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @mint Ah, okay, so this one wasn’t trained on that material?

@ryona.agency · 2 years ago

@p @ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 Yes, but it should be able to count two concepts together even if there were no overlap between the two in training data.

laurel@freespeechextremist.com · 2 years ago

@p @ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0

> I don’t know if this tool downloads a model
It’s just a model that provides text descriptions for the images fed to it. The tool does some keyword searches on the output to detect illegal material.

pistolero@freespeechextremist.com · 2 years ago

@laurel @Nerd02 @bmygsbvur @ceo_of_monoeye_dating @db0 Then it’s definitely going to be unreliable.

laurel@freespeechextremist.com · 2 years ago

@p @Nerd02 @bmygsbvur @ceo_of_monoeye_dating @db0

Compared to what the feds use yeah, but it is a way to leverage legal training material to detect illegal one.
Think of it like this, you have a model that detects pornographic content and another one that detects age of people depicted. You run the image through both and if the result is over some threshold you flag the image.

In this case they use an off the shelf general model that outputs a text description and they just use the raw keyword weights without the sentence generating phase.

CMD@bae.st · 2 years ago

@laurel @p @Nerd02 @bmygsbvur @db0 If nothing else, the fact that this model exists and is not getting rekt by fedbois is a sign that the problem *can* be solved. I’m bookmarking this package - the next time everyone starts bitching about CP spam, I’m going to throw it on the table.

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @laurel @Nerd02 @bmygsbvur @db0

> If nothing else, the fact that this model exists and is not getting rekt by fedbois is a sign that

This is not a sign of anything. “The cops didn’t seem to care yesterday” doesn’t indicate anything about today.

> the next time everyone starts bitching about CP spam, I’m going to throw it on the table.

“Why don’t you use a ridiculous amount of bandwidth downloading literally every image and then a ridiculous amount of computer juice processing all of it and then deal with the false positives?”

I don’t even use the thumbnailer because it is too heavy. sjw regularly posts 12MB JPEGs. It’s so heavyweight that you could DoS it just by posting a lot of very large images, and you could defeat it pretty easily. Even something like hashing the images is too much for most instances.

CMD@bae.st · 2 years ago

@p @laurel @Nerd02 @bmygsbvur @db0 >“Why don’t you use a ridiculous amount of bandwidth downloading literally every image and then a ridiculous amount of computer juice processing all of it and then deal with the false positives?”

Right, this is actually the key problem - the model is pretty beefy, and doing this for every instance that ain’t your own is a sure way to get completely wrecked.

Regardless, this is better than what we believed before - the tools not only can be built, but they exist and are apparently being used (albeit on a smaller scale - the tool posted above *only* checks images on your own instance, and even then only those that are orphaned.)

pistolero@freespeechextremist.com · 2 years ago

@ceo_of_monoeye_dating @Nerd02 @bmygsbvur @db0 @laurel

> Regardless, this is better than what we believed before - the tools not only can be built,

If it works. I mean, image classifiers aren’t new. There’s no way to verify whether this (or any) tool does the job for which it is intended, though, so it’s not only expensive but it’s unknown how useful it is.

CMD@bae.st · 2 years ago

@p @laurel @Nerd02 @bmygsbvur @db0 There’s no way to make something like this reliable. The only people holding onto a dataset like this are cops and pedos.

Cops don’t release models like this because of Dwork’s result, and pedos aren’t exactly invested in stopping other pedos from fapping to CP.

CMD@bae.st · 2 years ago

@p @Nerd02 @bmygsbvur @db0 It’s the code in the horde-safety package, which I’ve linked here: https://github.com/Haidra-Org/horde-safety/blob/main/horde_safety/csam_checker.py

At a first glance, it looks like it takes an image, runs it through a model to return keywords that would’ve been used to generate such an image, then checks them against a pair of lists containing “underage” words and “pornographic” words. In a deep sense, it detects if an image “has children” and “is porn” without ever having trained on a combination of the two.

The model’s more beefy than what’s needed to solve this problem minimally, but it does appear to solve the problem.