Zürcher Nachrichten - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.278489
AFN 76.301366
ALL 96.530556
AMD 444.389335
ANG 2.085119
AOA 1068.154458
ARS 1670.316609
AUD 1.75427
AWG 2.096704
AZN 1.984845
BAM 1.955415
BBD 2.345238
BDT 142.439297
BGN 1.957372
BHD 0.439074
BIF 3456.06653
BMD 1.164835
BND 1.508396
BOB 8.046379
BRL 6.313529
BSD 1.16437
BTN 104.690912
BWP 15.469884
BYN 3.34764
BYR 22830.773166
BZD 2.341828
CAD 1.611422
CDF 2599.912958
CHF 0.937162
CLF 0.02734
CLP 1072.545921
CNY 8.235507
CNH 8.234944
COP 4446.759008
CRC 568.78787
CUC 1.164835
CUP 30.868137
CVE 110.780379
CZK 24.198994
DJF 207.014999
DKK 7.469472
DOP 74.84113
DZD 151.385181
EGP 55.40272
ERN 17.47253
ETB 180.60972
FJD 2.630723
FKP 0.8723
GBP 0.873382
GEL 3.149553
GGP 0.8723
GHS 13.337819
GIP 0.8723
GMD 85.033396
GNF 10119.511721
GTQ 8.919242
GYD 243.610929
HKD 9.068302
HNL 30.667954
HRK 7.538703
HTG 152.42995
HUF 382.163892
IDR 19442.733022
ILS 3.76907
IMP 0.8723
INR 104.795933
IQD 1525.399284
IRR 49054.133779
ISK 149.006189
JEP 0.8723
JMD 186.373259
JOD 0.825914
JPY 180.836077
KES 150.617641
KGS 101.8653
KHR 4665.166047
KMF 491.560932
KPW 1048.343898
KRW 1715.709753
KWD 0.357232
KYD 0.970405
KZT 588.861385
LAK 25249.913875
LBP 104272.296288
LKR 359.159196
LRD 204.939598
LSL 19.73441
LTL 3.439456
LVL 0.704598
LYD 6.329752
MAD 10.752872
MDL 19.812009
MGA 5193.953775
MKD 61.627851
MMK 2446.083892
MNT 4131.091086
MOP 9.337359
MRU 46.433846
MUR 53.664406
MVR 17.950554
MWK 2019.093291
MXN 21.176696
MYR 4.788683
MZN 74.437324
NAD 19.73441
NGN 1689.139851
NIO 42.851552
NOK 11.767103
NPR 167.505978
NZD 2.016522
OMR 0.447885
PAB 1.164465
PEN 3.914028
PGK 4.940241
PHP 68.699705
PKR 326.441746
PLN 4.232667
PYG 8008.421228
QAR 4.244263
RON 5.093014
RSD 117.420109
RUB 89.113003
RWF 1694.158743
SAR 4.371861
SBD 9.5794
SCR 15.722146
SDG 700.652754
SEK 10.953705
SGD 1.509027
SHP 0.873928
SLE 26.791608
SLL 24426.013032
SOS 664.266196
SRD 44.99647
STD 24109.740275
STN 24.495171
SVC 10.187374
SYP 12881.033885
SZL 19.719113
THB 37.125677
TJS 10.683448
TMT 4.076924
TND 3.415727
TOP 2.804644
TRY 49.510866
TTD 7.893444
TWD 36.432793
TZS 2836.374505
UAH 48.875802
UGX 4119.187948
USD 1.164835
UYU 45.541022
UZS 13930.253805
VES 289.561652
VND 30705.060237
VUV 142.19158
WST 3.250066
XAF 655.824896
XAG 0.019865
XAU 0.000276
XCD 3.148026
XCG 2.098577
XDR 0.815408
XOF 655.723589
XPF 119.331742
YER 277.700931
ZAR 19.720255
ZMK 10484.920268
ZMW 26.920577
ZWL 375.076512
  • CMSC

    -0.0800

    23.4

    -0.34%

  • BCC

    -1.1100

    73.15

    -1.52%

  • GSK

    -0.3270

    48.243

    -0.68%

  • BCE

    0.2500

    23.47

    +1.07%

  • RIO

    -0.3100

    73.42

    -0.42%

  • SCS

    -0.0850

    16.145

    -0.53%

  • NGG

    -0.3900

    75.52

    -0.52%

  • BP

    -0.9650

    36.265

    -2.66%

  • BTI

    -0.8250

    57.215

    -1.44%

  • RBGPF

    0.0000

    78.35

    0%

  • JRI

    0.0300

    13.78

    +0.22%

  • RYCEF

    -0.1400

    14.51

    -0.96%

  • RELX

    -0.1340

    40.406

    -0.33%

  • CMSD

    -0.0550

    23.265

    -0.24%

  • VOD

    -0.1630

    12.47

    -1.31%

  • AZN

    0.2900

    90.32

    +0.32%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

M.J.Baumann--NZN