Berliner Boersenzeitung - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.343054
AFN 77.464136
ALL 96.578481
AMD 443.001294
ANG 2.116924
AOA 1084.432259
ARS 1696.425045
AUD 1.722632
AWG 2.13043
AZN 2.015092
BAM 1.955364
BBD 2.363473
BDT 143.548016
BGN 1.986001
BHD 0.442401
BIF 3475.425631
BMD 1.182587
BND 1.500966
BOB 8.109193
BRL 6.256361
BSD 1.173439
BTN 107.717999
BWP 16.277373
BYN 3.32206
BYR 23178.695489
BZD 2.360074
CAD 1.622687
CDF 2578.039008
CHF 0.922409
CLF 0.026073
CLP 1029.489324
CNY 8.24689
CNH 8.21806
COP 4228.657801
CRC 580.770597
CUC 1.182587
CUP 31.338542
CVE 110.240437
CZK 24.267271
DJF 208.973438
DKK 7.466899
DOP 73.933527
DZD 153.154875
EGP 55.703589
ERN 17.738798
ETB 182.791072
FJD 2.661179
FKP 0.870315
GBP 0.866681
GEL 3.18162
GGP 0.870315
GHS 12.79115
GIP 0.870315
GMD 86.329235
GNF 10278.709772
GTQ 9.006993
GYD 245.515296
HKD 9.221278
HNL 30.954103
HRK 7.533317
HTG 153.905708
HUF 382.153287
IDR 19840.785951
ILS 3.707232
IMP 0.870315
INR 108.316693
IQD 1537.357457
IRR 49816.456691
ISK 145.777895
JEP 0.870315
JMD 184.718842
JOD 0.838501
JPY 184.146504
KES 151.256298
KGS 103.416722
KHR 4722.947667
KMF 496.686746
KPW 1064.353704
KRW 1710.387141
KWD 0.362349
KYD 0.977982
KZT 590.738376
LAK 25359.349612
LBP 105085.885516
LKR 363.548997
LRD 217.091629
LSL 18.94048
LTL 3.491871
LVL 0.715335
LYD 7.466336
MAD 10.748905
MDL 19.97255
MGA 5308.817127
MKD 61.616271
MMK 2483.187819
MNT 4218.830116
MOP 9.4253
MRU 46.916546
MUR 54.292994
MVR 18.271409
MWK 2034.84661
MXN 20.533372
MYR 4.736855
MZN 75.57955
NAD 18.94048
NGN 1680.526824
NIO 43.180379
NOK 11.555294
NPR 172.348599
NZD 1.987207
OMR 0.454249
PAB 1.173539
PEN 3.936823
PGK 5.018882
PHP 69.733624
PKR 328.342141
PLN 4.208885
PYG 7847.251532
QAR 4.278347
RON 5.101724
RSD 117.373848
RUB 89.207823
RWF 1711.518652
SAR 4.433442
SBD 9.606873
SCR 16.856244
SDG 711.330129
SEK 10.584272
SGD 1.505082
SHP 0.887246
SLE 28.859447
SLL 24798.24684
SOS 669.450838
SRD 45.081425
STD 24477.153012
STN 24.494542
SVC 10.267712
SYP 13078.904017
SZL 18.935781
THB 36.920787
TJS 10.972155
TMT 4.139053
TND 3.416239
TOP 2.847384
TRY 51.246799
TTD 7.971224
TWD 37.116428
TZS 3004.130641
UAH 50.599026
UGX 4148.075755
USD 1.182587
UYU 44.440098
UZS 14242.826515
VES 416.584326
VND 31036.982812
VUV 141.661813
WST 3.258757
XAF 655.810877
XAG 0.011483
XAU 0.000237
XCD 3.196
XCG 2.114929
XDR 0.815618
XOF 655.810877
XPF 119.331742
YER 281.814608
ZAR 19.0597
ZMK 10644.701884
ZMW 23.02187
ZWL 380.792372
  • SCS

    0.0200

    16.14

    +0.12%

  • RBGPF

    -0.8100

    83.23

    -0.97%

  • NGG

    1.3200

    81.5

    +1.62%

  • VOD

    0.2300

    14.17

    +1.62%

  • RELX

    0.0600

    39.9

    +0.15%

  • CMSC

    0.1000

    23.75

    +0.42%

  • GSK

    0.5000

    49.15

    +1.02%

  • BCE

    0.4900

    25.2

    +1.94%

  • RYCEF

    0.3000

    17.12

    +1.75%

  • BTI

    0.9400

    59.16

    +1.59%

  • RIO

    3.1300

    90.43

    +3.46%

  • BP

    1.1000

    36.53

    +3.01%

  • JRI

    0.0100

    13.68

    +0.07%

  • CMSD

    0.0900

    24.13

    +0.37%

  • BCC

    -1.1800

    84.33

    -1.4%

  • AZN

    1.2600

    92.95

    +1.36%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

(K.Müller--BBZ)