Berliner Boersenzeitung - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.30721
AFN 75.04906
ALL 95.511578
AMD 434.790006
ANG 2.098881
AOA 1076.479183
ARS 1633.590788
AUD 1.627507
AWG 2.110743
AZN 1.998135
BAM 1.957945
BBD 2.36232
BDT 143.911791
BGN 1.956074
BHD 0.442846
BIF 3489.761182
BMD 1.172635
BND 1.49616
BOB 8.104467
BRL 5.844769
BSD 1.1729
BTN 111.261714
BWP 15.93962
BYN 3.309795
BYR 22983.642195
BZD 2.358906
CAD 1.593769
CDF 2720.513174
CHF 0.915939
CLF 0.026785
CLP 1054.199114
CNY 8.007044
CNH 8.01045
COP 4288.794539
CRC 533.238815
CUC 1.172635
CUP 31.074822
CVE 110.755819
CZK 24.37678
DJF 208.401119
DKK 7.472268
DOP 69.776325
DZD 155.421478
EGP 62.903067
ERN 17.589522
ETB 184.104084
FJD 2.616195
FKP 0.863507
GBP 0.863135
GEL 3.148572
GGP 0.863507
GHS 13.138031
GIP 0.863507
GMD 85.602758
GNF 10289.870838
GTQ 8.960697
GYD 245.376635
HKD 9.18804
HNL 31.215994
HRK 7.530314
HTG 153.644064
HUF 362.609217
IDR 20303.937137
ILS 3.452038
IMP 0.863507
INR 111.228692
IQD 1536.151596
IRR 1540842.135344
ISK 143.812385
JEP 0.863507
JMD 183.781361
JOD 0.831444
JPY 184.148271
KES 151.446236
KGS 102.512326
KHR 4705.20161
KMF 492.507029
KPW 1055.372308
KRW 1726.963181
KWD 0.360175
KYD 0.977442
KZT 543.267779
LAK 25774.513442
LBP 105009.447276
LKR 374.857478
LRD 215.589357
LSL 19.536543
LTL 3.462486
LVL 0.709316
LYD 7.45214
MAD 10.828156
MDL 20.208607
MGA 4872.298025
MKD 61.58302
MMK 2462.531881
MNT 4198.466183
MOP 9.464155
MRU 46.89411
MUR 55.161185
MVR 18.123116
MWK 2042.147896
MXN 20.473739
MYR 4.654233
MZN 74.935737
NAD 19.536538
NGN 1612.494489
NIO 43.059592
NOK 10.876123
NPR 178.010182
NZD 1.986965
OMR 0.450756
PAB 1.17287
PEN 4.113256
PGK 5.089675
PHP 71.920083
PKR 326.872391
PLN 4.246116
PYG 7213.611083
QAR 4.272789
RON 5.203454
RSD 117.281962
RUB 87.925585
RWF 1714.392086
SAR 4.397591
SBD 9.438049
SCR 17.149829
SDG 704.171511
SEK 10.814215
SGD 1.492858
SHP 0.87549
SLE 28.876177
SLL 24589.561066
SOS 670.165086
SRD 43.924599
STD 24271.172941
STN 24.859858
SVC 10.263252
SYP 129.60945
SZL 19.536529
THB 38.125294
TJS 11.001451
TMT 4.110085
TND 3.379578
TOP 2.823423
TRY 52.968153
TTD 7.96147
TWD 37.088138
TZS 3054.714062
UAH 51.536521
UGX 4410.264652
USD 1.172635
UYU 46.775838
UZS 13998.332237
VES 573.351287
VND 30905.962944
VUV 139.316425
WST 3.208318
XAF 656.724148
XAG 0.015459
XAU 0.000254
XCD 3.169105
XCG 2.11385
XDR 0.81498
XOF 657.266022
XPF 119.331742
YER 279.849722
ZAR 19.527126
ZMK 10555.124618
ZMW 21.903587
ZWL 377.587929
  • RBGPF

    -1.1500

    62.6

    -1.84%

  • BCC

    -1.1200

    78.15

    -1.43%

  • RYCEF

    0.5000

    16.3

    +3.07%

  • RELX

    -0.2350

    36.355

    -0.65%

  • NGG

    -1.0500

    88.49

    -1.19%

  • JRI

    -0.0100

    12.98

    -0.08%

  • RIO

    0.1300

    100.61

    +0.13%

  • BCE

    0.1750

    23.955

    +0.73%

  • CMSD

    0.1500

    23.28

    +0.64%

  • GSK

    -0.6890

    51.621

    -1.33%

  • VOD

    0.3500

    16.15

    +2.17%

  • AZN

    -2.4600

    184.91

    -1.33%

  • BP

    -0.9750

    46.405

    -2.1%

  • BTI

    -0.0950

    58.705

    -0.16%

  • CMSC

    0.0500

    22.87

    +0.22%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

(K.Müller--BBZ)