Discover Innovative Gadgets

Web scraping by artificial intelligence is now prevented by Cloudflare by default settings.

Web crawlers powered by artificial intelligence are now routinely blocked by Cloudflare unless explicit authorization is obtained from site administrators.

, and Administrator

2025 September 29 . 8:45 PM

2 min read

Web scraping via AI is now thwarted by default by Cloudflare's protective measures.

Web scraping by artificial intelligence is now prevented by Cloudflare by default settings.

In the rapidly evolving landscape of artificial intelligence (AI), a significant change is underway as more GenAI vendors grapple with the reality of paying a fair price for high-quality training data while maintaining profitability.

This shift is reflected in the new policy introduced by Cloudflare, a prominent web infrastructure and security company. Under this policy, companies with newly registered domains using Cloudflare's services worldwide are required to explicitly allow AI web crawlers, such as those from OpenAI, to access content. Previously, access was generally granted by default.

The updated policy also introduces a "Pay Per Crawl" program for select publishers. This allows them to set pricing terms for AI scrapers, offering a potential new revenue stream for content creators. Existing domains are not automatically blocked, but the policy underscores the need for a more structured approach to web scraping.

The legality of web scraping has long been a murky area, with loosely enforced rules such as the robots.txt file serving as the primary guide. However, developments in this field highlight the gap between fast-moving technologies and slower regulatory systems. In May 2025, Irish and German regulators declined to block Meta from using Facebook and Instagram data, signalling a potential shift in attitudes towards data usage.

The competition from China may also play a role in this evolution. With many Western GenAI companies facing economic uncertainty, some may choose to exit the business. This could lead to a power shift in the AI industry.

However, it's important to note that in some jurisdictions, a deliberate bypass of anti-bot protection and massive data scraping may constitute a criminal offense. Breach of contract claims, not copyright, could pose the most serious legal threat to GenAI companies.

Cloudflare CEO, Matthew Prince, has emphasised the need for publishers to have control and a new economic model that benefits everyone. As the web scraping landscape continues to evolve, it's clear that a more structured, fair, and legal approach is necessary to ensure a sustainable future for both AI companies and content creators.

Latest

It is an airport, the picture is inside an airport, there are many people waiting for the flights,...

Cloud Computing Revolution

Braunschweig-Wolfsburg Airport Unveils €4M New Terminal in Just 12 Months

The new terminal's €4 million price tag and 12-month construction time are impressive. But it's the modern facilities and improved passenger experience that will make the biggest impact.

, and Administrator

2025 October 9

In the image there are few people, the first two men were wearing Microsoft id cards.

Safeguard Your Gadgets

Optus Data Breach Affects 7.7 Million: IDs Exposed for 1.2M

Optus' data breach impacts millions. 1.2M customers' current ID numbers exposed. Act now to protect your identity.

, and Administrator

2025 October 9

In this image I can see number of buildings, number of trees, clouds, the sky, number of vehicles...

Finance

Namibia Expands Electronic Visa Scheme to 126 Countries

Namibia's new visa scheme welcomes 36 more countries. Enjoy easier entry and lower fees as you explore its stunning landscapes.

, and Administrator

2025 October 9

Web scraping by artificial intelligence is now prevented by Cloudflare by default settings.

Web scraping by artificial intelligence is now prevented by Cloudflare by default settings.

Read also:

Related

Latest