Child abuse images removed from AI image-generator training source, researchers say

girlfreddy · 10 months ago

Child abuse images removed from AI image-generator training source, researchers say

Flying Squid@lemmy.world · 10 months ago

I’m glad they removed them, but it’s kind of closing the barn doors after the horses have bolted at this point.

Iapar@feddit.org · edit-2 2 months ago

deleted by creator

istanbullu@lemmy.ml · 10 months ago

These datasets have billions of images in them (The Laion database have 5 billion images!). There is no way a human can go through them to check for bad content.

Iapar@feddit.org · edit-2 2 months ago

deleted by creator

istanbullu@lemmy.ml · 10 months ago

The dataset sizes needed for machine learning rule out any kind of human verification. It’s just not possible to manually check billions of images.

Iapar@feddit.org · edit-2 2 months ago

deleted by creator

istanbullu@lemmy.ml · 10 months ago

How would you check 5 billion images?

Iapar@feddit.org · edit-2 2 months ago

deleted by creator

istanbullu@lemmy.ml · 10 months ago

That won’t work. Models of this kind need billions of images or they are trash.

vrek@programming.dev · 10 months ago

Great they removed them… Did they report the images to the authorities?

RecallMadness@lemmy.nz · 10 months ago

If 2000 out of 5,000,000,000 images can be found, why couldn’t they be found before the dataset was published.

girlfreddy · 10 months ago

That’s a question to be pondered for the ages.

/s

Child abuse images removed from AI image-generator training source, researchers say

Child abuse images removed from AI image-generator training source, researchers say

Just a moment...