Maroon@lemmy.world to

Open Source@lemmy.ml · 7 days ago

Can I use fail2ban like a pi-hole LLM scrapers?

3

13

Can I use fail2ban like a pi-hole LLM scrapers?

Maroon@lemmy.world to

Open Source@lemmy.ml · 7 days ago

3

After dabbling in the world of LLM poisoning, I realised that I simply do not have the skill set (or brain power) to effectively poison LLM web scrapers.

I am trying to work with what I know /understand. I have fail2ban installed in my static webserver. Is it possible now to get a massive list of known IP addresses that scrape websites and add that to the ban list?

Chat

lungdart
link
fedilink
arrow-up
8·
7 days ago
Fail2ban is not a static security policy.

It’s a dynamic firewall. It ties logs to time boxed firewall rules.

You could auto ban any source that hits robots.txt on a Web server for 1h for instance. I’ve heard AI data scrapers actually use that to target big data rather than respect web server requests.

Open Source@lemmy.ml

opensource@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Posts must be relevant to the open source ideology
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

587 users / day
1.63K users / week
4.15K users / month
11K users / 6 months
580 local subscribers
33.1K subscribers
3.51K Posts
38.5K Comments
Modlog