AI Crawlers Surge Wikimedia Commons Bandwidth by 50%

April 02, 2025

The Wikimedia Foundation reports a 50% increase in bandwidth consumption on Wikimedia Commons due to AI crawlers seeking data for model training.

The Wikimedia Foundation has reported a significant 50% increase in bandwidth consumption on Wikimedia Commons since January 2024. This surge is attributed not to human users, but to automated AI crawlers that are scraping data to train AI models, as detailed in a recent blog post by the organization.

Wikimedia Commons, a repository of freely accessible multimedia files, has seen two-thirds of its most resource-intensive traffic coming from bots, despite these bots accounting for only 35% of overall pageviews. This disparity arises because bots often access less frequently visited content stored in the core data center, which is more costly to serve.

The Wikimedia Foundation's site reliability team is actively working to block these crawlers to prevent disruption for regular users. The organization is facing increased cloud costs due to this unprecedented traffic from AI scrapers, highlighting a growing challenge for open-source infrastructures.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

AI Brief AI Brief (X)

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

AI Crawlers Surge Wikimedia Commons Bandwidth by 50%

We hope you enjoyed this article.

Subscribe to Daily AI Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Wikimedia Deutschland Launches AI-Friendly Wikidata Embedding Project

Elon Musk's xAI to Launch Grokipedia as a Wikipedia Rival

Raptive Introduces Content Protection Framework Against AI Scraping

OpenAI to Invest $100 Billion in Backup Servers

Insider Threats and AI Complexities Escalate File Security Risks

Anthropic Settles $1.5 Billion Copyright Lawsuit with Authors

Microsoft Unveils Fairwater AI Datacenter in Wisconsin

Bain & Company Highlights AI's Growing Compute Demand

Mavvrik Report: 85% of Companies Miss AI Cost Forecasts

Sam Altman Questions Authenticity of Social Media Due to Bots

Survey Highlights Growing Adoption of AI in Video Security

CoreWeave Expands OpenAI Partnership with $6.5 Billion Deal