mastodon.gamedev.place is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server focused on game development and related topics.

Server stats:

5.1K
active users

#aicrawlers

0 posts0 participants0 posts today

AI Crawlers stealing your content? Time to fight back! 💪

LLMs and AI bots are scraping the web, stealing up your data, hogging bandwidth, and even crashing servers under aggressive loads.

Don’t let them freeload! The CrowdSec AI Crawlers Blocklist stops unwanted harvesting before it hurts your site’s performance or privacy.

Regain control over your digital assets: crowdsec.net/blog/protect-agai

So according to the request statistics, since the last rotation of the access log file for the #MacPorts trac this morning, there were:

20.8k requests from IE 3
20.9k requests from IE 4
21.3k requests from IE 5
43 requests from IE 6 and
23 requests from IE 7

These requests came from these Windows versions (roughly 4k per version): CE, 95, 98 (9.5k), NT 4, 2000, XP, NT 5.01(?!), Server 2003, Vista, 7, and 8.0.

I'm sure none of those are AI crawler bots.

🤖 Calling all FOSS communities!

Worried about AI crawlers scraping your content or overwhelming your servers? We’ve got your back. 💪

To support open source communities, we’re offering free access to our Platinum AI Crawlers Blocklist. 🎉

🔗 Learn how to get started: crowdsec.net/blog/protecting-f

www.crowdsec.netProtecting FOSS Communities from AI Crawlers with CrowdSecAnnouncing free access to the CrowdSec AI Crawlers Blocklist for all open source projects, to help FOSS communities reduce unwanted traffic from AI bots.

After struggling all night with LLM crawlers, here's a little something I wrote:

A #Fail2Ban filter to block #LLMCrawlers before they start damaging infrastructure. It works by matching HTTP user-agents.

Hopefully this can be of use for other people as well.

codeberg.org/camelia/llm-crawl

Codeberg.orgllm-crawlers-fail2ban-filterThis repository contains a Fail2Ban filter for use with nginx. Its purpose is to block LLM crawlers before they start damaging infrastructure.