Zürcher Nachrichten - Grok shows 'flaws' in fact-checking Israel-Iran war: study

EUR -
AED 4.212777
AFN 72.835586
ALL 94.512843
AMD 422.248264
ANG 2.053494
AOA 1052.895931
ARS 1680.790338
AUD 1.635257
AWG 2.067368
AZN 1.95436
BAM 1.956354
BBD 2.309354
BDT 140.73988
BGN 1.939347
BHD 0.432422
BIF 3423.630825
BMD 1.146945
BND 1.480319
BOB 7.92328
BRL 5.90941
BSD 1.146625
BTN 108.087801
BWP 15.582008
BYN 3.185903
BYR 22480.122
BZD 2.305963
CAD 1.623185
CDF 2615.035015
CHF 0.925648
CLF 0.026299
CLP 1035.072439
CNY 7.764364
CNH 7.780559
COP 3960.034063
CRC 520.14739
CUC 1.146945
CUP 30.394043
CVE 110.569964
CZK 24.190336
DJF 203.835517
DKK 7.474072
DOP 66.986043
DZD 152.939427
EGP 57.331754
ERN 17.204175
ETB 181.647461
FJD 2.564
FKP 0.86699
GBP 0.866531
GEL 3.039852
GGP 0.86699
GHS 12.874504
GIP 0.86699
GMD 84.304874
GNF 10064.442782
GTQ 8.746478
GYD 239.84901
HKD 8.988436
HNL 30.606273
HRK 7.533254
HTG 149.77244
HUF 351.906109
IDR 20445.785654
ILS 3.394682
IMP 0.86699
INR 108.1919
IQD 1502.49795
IRR 1577049.375404
ISK 143.976448
JEP 0.86699
JMD 181.171337
JOD 0.813229
JPY 185.008009
KES 148.419043
KGS 100.300781
KHR 4599.249852
KMF 492.617229
KPW 1032.250901
KRW 1752.130969
KWD 0.353179
KYD 0.955446
KZT 559.543917
LAK 25295.872375
LBP 102708.92515
LKR 382.668433
LRD 208.916469
LSL 18.815678
LTL 3.386631
LVL 0.693776
LYD 7.311819
MAD 10.580612
MDL 20.248208
MGA 4817.169398
MKD 61.628611
MMK 2408.037641
MNT 4105.573741
MOP 9.256923
MRU 45.947051
MUR 54.881752
MVR 17.720734
MWK 1992.243861
MXN 19.872547
MYR 4.745948
MZN 73.301688
NAD 18.814173
NGN 1560.350288
NIO 41.990088
NOK 11.102662
NPR 172.945006
NZD 1.997675
OMR 0.441554
PAB 1.14663
PEN 3.881306
PGK 5.032508
PHP 69.638491
PKR 319.223511
PLN 4.259467
PYG 7041.056554
QAR 4.175458
RON 5.239364
RSD 117.183799
RUB 83.845404
RWF 1679.12748
SAR 4.299026
SBD 9.24601
SCR 15.693948
SDG 688.744688
SEK 10.98638
SGD 1.482316
SHP 0.85631
SLE 28.387314
SLL 24050.86738
SOS 655.483268
SRD 42.898615
STD 23739.445827
STN 24.544623
SVC 10.032843
SYP 126.774237
SZL 18.814083
THB 37.723444
TJS 10.63456
TMT 4.014308
TND 3.339618
TOP 2.761569
TRY 53.262066
TTD 7.775237
TWD 36.375404
TZS 3017.595134
UAH 51.508996
UGX 4173.182519
USD 1.146945
UYU 45.84299
UZS 13769.075108
VES 695.774297
VND 30176.12295
VUV 136.079641
WST 3.156168
XAF 656.142926
XAG 0.017684
XAU 0.000276
XCD 3.099677
XCG 2.066386
XDR 0.807102
XOF 648.024305
XPF 119.331742
YER 273.665193
ZAR 18.876464
ZMK 10323.885445
ZMW 20.552914
ZWL 369.315822
  • CMSC

    0.0500

    22.37

    +0.22%

  • CMSD

    0.0000

    22.29

    0%

  • NGG

    -1.2400

    79.44

    -1.56%

  • BCC

    3.8500

    74.66

    +5.16%

  • BTI

    -0.5800

    58.91

    -0.98%

  • BP

    -1.0400

    39.1

    -2.66%

  • BCE

    0.0000

    23.28

    0%

  • JRI

    0.0500

    12.67

    +0.39%

  • RBGPF

    -0.5300

    60.61

    -0.87%

  • GSK

    -1.4800

    50.67

    -2.92%

  • RELX

    -0.8300

    31.18

    -2.66%

  • RYCEF

    -0.0300

    18.4

    -0.16%

  • RIO

    -2.5900

    100.08

    -2.59%

  • VOD

    -0.2300

    14.3

    -1.61%

  • AZN

    -2.9600

    174.93

    -1.69%

Grok shows 'flaws' in fact-checking Israel-Iran war: study
Grok shows 'flaws' in fact-checking Israel-Iran war: study / Photo: Lionel BONAVENTURE - AFP

Grok shows 'flaws' in fact-checking Israel-Iran war: study

Elon Musk's AI chatbot Grok produced inaccurate and contradictory responses when users sought to fact-check the Israel-Iran conflict, a study said Tuesday, raising fresh doubts about its reliability as a debunking tool.

Text size:

With tech platforms reducing their reliance on human fact-checkers, users are increasingly utilizing AI-powered chatbots -- including xAI's Grok -- in search of reliable information, but their responses are often themselves prone to misinformation.

"The investigation into Grok's performance during the first days of the Israel-Iran conflict exposes significant flaws and limitations in the AI chatbot's ability to provide accurate, reliable, and consistent information during times of crisis," said the study from the Digital Forensic Research Lab (DFRLab) of the Atlantic Council, an American think tank.

"Grok demonstrated that it struggles with verifying already-confirmed facts, analyzing fake visuals, and avoiding unsubstantiated claims."

The DFRLab analyzed around 130,000 posts in various languages on the platform X, where the AI assistant is built in, to find that Grok was "struggling to authenticate AI-generated media."

Following Iran's retaliatory strikes on Israel, Grok offered vastly different responses to similar prompts about an AI-generated video of a destroyed airport that amassed millions of views on X, the study found.

It oscillated -- sometimes within the same minute -- between denying the airport's destruction and confirming it had been damaged by strikes, the study said.

In some responses, Grok cited the a missile launched by Yemeni rebels as the source of the damage. In others, it wrongly identified the AI-generated airport as one in Beirut, Gaza, or Tehran.

When users shared another AI-generated video depicting buildings collapsing after an alleged Iranian strike on Tel Aviv, Grok responded that it appeared to be real, the study said.

The Israel-Iran conflict, which led to US air strikes against Tehran's nuclear program over the weekend, has churned out an avalanche of online misinformation including AI-generated videos and war visuals recycled from other conflicts.

AI chatbots also amplified falsehoods.

As the Israel-Iran war intensified, false claims spread across social media that China had dispatched military cargo planes to Tehran to offer its support.

When users asked the AI-operated X accounts of AI companies Perplexity and Grok about its validity, both wrongly responded that the claims were true, according to disinformation watchdog NewsGuard.

Researchers say Grok has previously made errors verifying information related to crises such as the recent India-Pakistan conflict and anti-immigration protests in Los Angeles.

Last month, Grok was under renewed scrutiny for inserting "white genocide" in South Africa, a far-right conspiracy theory, into unrelated queries.

Musk's startup xAI blamed an "unauthorized modification" for the unsolicited response.

Musk, a South African-born billionaire, has previously peddled the unfounded claim that South Africa's leaders were "openly pushing for genocide" of white people.

Musk himself blasted Grok after it cited Media Matters -- a liberal media watchdog he has targeted in multiple lawsuits -- as a source in some of its responses about misinformation.

"Shame on you, Grok," Musk wrote on X. "Your sourcing is terrible."

A.P.Huber--NZN