CloudFlare Launches AI Bot Protection for Free Users to Automatically Identify and Block Content Scraping Bots
CloudFlare, humorously referred to by netizens as the "Cyber Bodhisattva," has recently rolled out AI bot protection settings for all its free users. This feature was previously available but required setting up rules, which in turn necessitated subscribing to CloudFlare Pro or other paid plans.
The newly introduced AI Scrapers and Crawlers protection is a one-click setup available to all users, regardless of their subscription status. With just a single click to enable this option, CloudFlare claims it will block robots and crawlers from scraping website content for training artificial intelligence models, preventing certain AI companies from unauthorized content scraping.
The underlying mechanism of how it works remains somewhat unclear, but it's likely that CloudFlare has identified and categorized common AI bots, such as OpenAI's GPTBot, and blocks them based on their signatures.
CloudFlare provides security and distribution services to millions of websites, making this feature particularly meaningful. This is especially true for websites, such as news media, that are protected by copyright.
However, the challenge lies in the fact that bot identification can prevent honest bots that declare themselves and respect the robots.txt protocol, like OpenAI's, but it struggles against those who do not adhere to the protocol nor disclose their bot identities, voraciously scraping websites for content.
To counteract such scenarios, measures might include banning user agents (UAs) associated with high-frequency scraping, enabling CloudFlare's CAPTCHA challenges, and other functionalities to prevent various bots from circumventing CloudFlare's AI bot protection.
How to Enable This Feature: Go to CloudFlare, navigate to the dashboard, select the website in question, go to Security, and then to Automated Bots to enable AI Scrapers and Crawlers protection.