Zürcher Nachrichten - As AI data scrapers sap websites' revenues, some fight back

EUR -
AED 4.256969
AFN 73.026624
ALL 95.949668
AMD 436.29849
ANG 2.074968
AOA 1062.937298
ARS 1612.956254
AUD 1.648622
AWG 2.089361
AZN 1.97515
BAM 1.955793
BBD 2.330592
BDT 141.989509
BGN 1.981339
BHD 0.437098
BIF 3425.188147
BMD 1.159146
BND 1.479895
BOB 7.995972
BRL 6.159011
BSD 1.157196
BTN 108.180626
BWP 15.778945
BYN 3.510788
BYR 22719.261378
BZD 2.327292
CAD 1.591102
CDF 2637.057544
CHF 0.913917
CLF 0.027244
CLP 1075.745893
CNY 7.982348
CNH 8.005172
COP 4253.385281
CRC 540.49813
CUC 1.159146
CUP 30.717369
CVE 110.264618
CZK 24.515015
DJF 206.059287
DKK 7.48519
DOP 68.689762
DZD 153.294785
EGP 59.995792
ERN 17.38719
ETB 182.369469
FJD 2.566871
FKP 0.868888
GBP 0.86899
GEL 3.147128
GGP 0.868888
GHS 12.613956
GIP 0.868888
GMD 85.201694
GNF 10142.964899
GTQ 8.863969
GYD 242.099162
HKD 9.082199
HNL 30.628894
HRK 7.547552
HTG 151.809475
HUF 393.739159
IDR 19654.711213
ILS 3.60393
IMP 0.868888
INR 108.971952
IQD 1515.894754
IRR 1525001.44174
ISK 144.047519
JEP 0.868888
JMD 181.799371
JOD 0.82188
JPY 184.582853
KES 149.909481
KGS 101.364887
KHR 4623.983998
KMF 494.955743
KPW 1043.265709
KRW 1744.874492
KWD 0.35536
KYD 0.964297
KZT 556.328075
LAK 24848.914008
LBP 103633.441366
LKR 360.978751
LRD 211.759267
LSL 19.520632
LTL 3.422657
LVL 0.701156
LYD 7.407974
MAD 10.813063
MDL 20.15193
MGA 4824.983303
MKD 61.639787
MMK 2432.834089
MNT 4136.040892
MOP 9.340468
MRU 46.32084
MUR 53.912319
MVR 17.920835
MWK 2006.593056
MXN 20.746631
MYR 4.565921
MZN 74.073751
NAD 19.520632
NGN 1572.092184
NIO 42.579853
NOK 11.093021
NPR 173.089401
NZD 1.985179
OMR 0.445696
PAB 1.157196
PEN 4.000686
PGK 4.994983
PHP 69.723065
PKR 323.078682
PLN 4.282755
PYG 7557.973845
QAR 4.231485
RON 5.101986
RSD 117.449594
RUB 96.003268
RWF 1683.694173
SAR 4.352195
SBD 9.33305
SCR 15.877645
SDG 696.647132
SEK 10.831104
SGD 1.486609
SHP 0.86966
SLE 28.486057
SLL 24306.724357
SOS 661.297712
SRD 43.45349
STD 23991.981659
STN 24.499915
SVC 10.124965
SYP 128.330532
SZL 19.526932
THB 38.14522
TJS 11.114462
TMT 4.068602
TND 3.417588
TOP 2.790945
TRY 51.295112
TTD 7.850973
TWD 37.135217
TZS 3008.589588
UAH 50.693025
UGX 4373.984863
USD 1.159146
UYU 46.629839
UZS 14107.951178
VES 527.05282
VND 30499.449254
VUV 137.764445
WST 3.161931
XAF 655.95473
XAG 0.017051
XAU 0.000257
XCD 3.13265
XCG 2.085493
XDR 0.815797
XOF 655.95473
XPF 119.331742
YER 276.576393
ZAR 19.85325
ZMK 10433.709028
ZMW 22.593922
ZWL 373.244535
  • RBGPF

    -13.5000

    69

    -19.57%

  • BCC

    -1.5600

    68.3

    -2.28%

  • NGG

    -3.5400

    81.99

    -4.32%

  • GSK

    -0.5300

    51.84

    -1.02%

  • VOD

    -0.0900

    14.33

    -0.63%

  • BCE

    0.0600

    25.79

    +0.23%

  • RELX

    -0.4600

    33.36

    -1.38%

  • CMSC

    -0.2000

    22.65

    -0.88%

  • RIO

    -2.5000

    83.15

    -3.01%

  • RYCEF

    -1.2600

    15.34

    -8.21%

  • AZN

    -5.3300

    183.6

    -2.9%

  • CMSD

    -0.2420

    22.658

    -1.07%

  • JRI

    -0.3900

    11.77

    -3.31%

  • BTI

    -1.3500

    57.37

    -2.35%

  • BP

    -1.0800

    44.78

    -2.41%

As AI data scrapers sap websites' revenues, some fight back
As AI data scrapers sap websites' revenues, some fight back / Photo: PATRICIA DE MELO MOREIRA - AFP

As AI data scrapers sap websites' revenues, some fight back

A swarm of AI "crawlers" is running rampant on the internet, scouring billions of websites for data to feed algorithms at leading tech companies -- all without permission or payment, upending the online economy.

Text size:

Before the rise of AI chatbots, websites allowed search engines to access their content in return for increased visibility, a system that rewarded them with traffic and advertising revenues.

But the rapid development of generative AI has allowed tech giants like Google and OpenAI to harvest information for their chatbots with web crawlers, without humans ever needing to visit the original sites.

Traditional content producers, such as media outlets, are being outpaced by AI crawlers, which have cut into their online operations and advertising revenues.

"Sites that gave bots access to their content used to get readers in exchange," said Kurt Muehmel, head of AI strategy at data management firm Dataiku.

But the arrival of generative AI "completely breaks" that model, he told AFP.

Wikipedia's human internet traffic fell by eight percent between 2024 and 2025 because of a rise in AI search engine summaries, the online encyclopaedia reported last month.

"The fundamental tension is that the new business of the internet that is AI-driven doesn't generate traffic," said Matthew Prince, CEO of Cloudflare, an American internet services provider.

- 'No trespassing' -

Cloudflare, which processes more than 20 percent of all internet traffic, announced this summer a new measure aimed at blocking AI crawlers from accessing content without payment or permission from website owners.

"It's basically like putting a speed limit sign or a no trespassing sign," Prince told AFP on the sidelines of the Web Summit in Lisbon.

"Badly behaving bots can get by that, but we can track that... Over time, we can tighten these controls in a way that we're confident the AI companies can't get through."

The measure, which applies to more than 10 million websites, has already "attracted the attention of artificial intelligence giants", he added.

On a smaller scale, American startup TollBit is providing online news publishers with tools to block, monitor and monetise AI crawler traffic.

"The internet is a highway," said CEO and co-founder Toshit Panigrahi, who described the company as a "tollbooth on the internet".

TollBit works with more than 5,600 sites, including USA Today, Time magazine and the Associated Press, allowing media outlets to set their own access fees for their content.

The analytics are free for publishers, but AI companies are charged a "transaction fee for every piece of content they access".

But for Muehmel, the online takeover by AI crawlers cannot be resolved with only "partial measures or by an individual company".

"This is an evolution of the entire internet economy, which will take years," he said.

If the bot swarm continues to roam freely online, "all of the incentives for content creation are going to go away," Prince said.

"That would be a loss, not just for us humans that want to consume it, but actually for the AI companies that need original content in order to train their systems."

M.J.Baumann--NZN