Zürcher Nachrichten - Grok shows 'flaws' in fact-checking Israel-Iran war: study

EUR -
AED 4.310221
AFN 82.155324
ALL 97.823133
AMD 450.645235
ANG 2.100475
AOA 1076.234339
ARS 1473.514285
AUD 1.791261
AWG 2.115499
AZN 1.996077
BAM 1.961628
BBD 2.369741
BDT 142.890269
BGN 1.953652
BHD 0.442395
BIF 3447.001039
BMD 1.173647
BND 1.50457
BOB 8.110095
BRL 6.531578
BSD 1.173687
BTN 101.352119
BWP 16.360608
BYN 3.841012
BYR 23003.479863
BZD 2.357604
CAD 1.596576
CDF 3387.145366
CHF 0.93167
CLF 0.029077
CLP 1115.821696
CNY 8.42068
CNH 8.417255
COP 4779.747549
CRC 591.958723
CUC 1.173647
CUP 31.101644
CVE 110.616143
CZK 24.621957
DJF 208.580483
DKK 7.465416
DOP 70.879738
DZD 152.280921
EGP 57.593323
ERN 17.604704
ETB 160.438806
FJD 2.635075
FKP 0.869596
GBP 0.868129
GEL 3.18031
GGP 0.869596
GHS 12.235242
GIP 0.869596
GMD 84.502242
GNF 10159.087821
GTQ 9.007762
GYD 245.549533
HKD 9.212911
HNL 30.925529
HRK 7.533054
HTG 154.01759
HUF 399.0159
IDR 19081.386546
ILS 3.919001
IMP 0.869596
INR 101.336491
IQD 1537.477481
IRR 49425.218326
ISK 142.387226
JEP 0.869596
JMD 188.209174
JOD 0.832173
JPY 172.152847
KES 151.984555
KGS 102.635049
KHR 4715.713436
KMF 495.279152
KPW 1056.318627
KRW 1621.475665
KWD 0.358138
KYD 0.978106
KZT 626.230545
LAK 25309.696554
LBP 105100.083013
LKR 354.034303
LRD 235.902622
LSL 20.679287
LTL 3.465474
LVL 0.709928
LYD 6.367002
MAD 10.596809
MDL 19.905139
MGA 5199.255783
MKD 61.489111
MMK 2463.429779
MNT 4212.545586
MOP 9.490095
MRU 46.734228
MUR 53.389385
MVR 18.070545
MWK 2038.036225
MXN 21.885465
MYR 4.962764
MZN 75.066503
NAD 20.679181
NGN 1796.607129
NIO 43.131273
NOK 11.821875
NPR 162.163792
NZD 1.957138
OMR 0.451263
PAB 1.173687
PEN 4.178539
PGK 4.869168
PHP 66.751132
PKR 334.401423
PLN 4.25183
PYG 8925.250959
QAR 4.27278
RON 5.067772
RSD 117.118129
RUB 92.132882
RWF 1688.291111
SAR 4.4027
SBD 9.723776
SCR 16.576525
SDG 704.776571
SEK 11.168471
SGD 1.500596
SHP 0.922302
SLE 26.993899
SLL 24610.793968
SOS 670.744383
SRD 42.9948
STD 24292.121904
STN 24.881315
SVC 10.269204
SYP 15259.59054
SZL 20.679677
THB 37.756437
TJS 11.267275
TMT 4.119501
TND 3.37717
TOP 2.748801
TRY 47.479094
TTD 7.970681
TWD 34.502525
TZS 3063.218208
UAH 49.028302
UGX 4211.5125
USD 1.173647
UYU 47.390799
UZS 14934.657142
VES 140.209264
VND 30684.999032
VUV 139.392575
WST 3.09389
XAF 657.911671
XAG 0.029898
XAU 0.000342
XCD 3.17184
XCG 2.115285
XDR 0.819039
XOF 659.589833
XPF 119.331742
YER 282.790417
ZAR 20.611981
ZMK 10564.229246
ZMW 27.141638
ZWL 377.913833
  • CMSC

    0.0900

    22.314

    +0.4%

  • CMSD

    0.0250

    22.285

    +0.11%

  • RBGPF

    0.0000

    69.04

    0%

  • SCS

    0.0400

    10.74

    +0.37%

  • RELX

    0.0300

    53

    +0.06%

  • RIO

    -0.1400

    59.33

    -0.24%

  • GSK

    0.1300

    41.45

    +0.31%

  • NGG

    0.2700

    71.48

    +0.38%

  • BP

    0.1750

    30.4

    +0.58%

  • BTI

    0.7150

    48.215

    +1.48%

  • BCC

    0.7900

    91.02

    +0.87%

  • JRI

    0.0200

    13.13

    +0.15%

  • VOD

    0.0100

    9.85

    +0.1%

  • BCE

    -0.0600

    22.445

    -0.27%

  • RYCEF

    0.1000

    12

    +0.83%

  • AZN

    -0.1200

    73.71

    -0.16%

Grok shows 'flaws' in fact-checking Israel-Iran war: study
Grok shows 'flaws' in fact-checking Israel-Iran war: study / Photo: Lionel BONAVENTURE - AFP

Grok shows 'flaws' in fact-checking Israel-Iran war: study

Elon Musk's AI chatbot Grok produced inaccurate and contradictory responses when users sought to fact-check the Israel-Iran conflict, a study said Tuesday, raising fresh doubts about its reliability as a debunking tool.

Text size:

With tech platforms reducing their reliance on human fact-checkers, users are increasingly utilizing AI-powered chatbots -- including xAI's Grok -- in search of reliable information, but their responses are often themselves prone to misinformation.

"The investigation into Grok's performance during the first days of the Israel-Iran conflict exposes significant flaws and limitations in the AI chatbot's ability to provide accurate, reliable, and consistent information during times of crisis," said the study from the Digital Forensic Research Lab (DFRLab) of the Atlantic Council, an American think tank.

"Grok demonstrated that it struggles with verifying already-confirmed facts, analyzing fake visuals, and avoiding unsubstantiated claims."

The DFRLab analyzed around 130,000 posts in various languages on the platform X, where the AI assistant is built in, to find that Grok was "struggling to authenticate AI-generated media."

Following Iran's retaliatory strikes on Israel, Grok offered vastly different responses to similar prompts about an AI-generated video of a destroyed airport that amassed millions of views on X, the study found.

It oscillated -- sometimes within the same minute -- between denying the airport's destruction and confirming it had been damaged by strikes, the study said.

In some responses, Grok cited the a missile launched by Yemeni rebels as the source of the damage. In others, it wrongly identified the AI-generated airport as one in Beirut, Gaza, or Tehran.

When users shared another AI-generated video depicting buildings collapsing after an alleged Iranian strike on Tel Aviv, Grok responded that it appeared to be real, the study said.

The Israel-Iran conflict, which led to US air strikes against Tehran's nuclear program over the weekend, has churned out an avalanche of online misinformation including AI-generated videos and war visuals recycled from other conflicts.

AI chatbots also amplified falsehoods.

As the Israel-Iran war intensified, false claims spread across social media that China had dispatched military cargo planes to Tehran to offer its support.

When users asked the AI-operated X accounts of AI companies Perplexity and Grok about its validity, both wrongly responded that the claims were true, according to disinformation watchdog NewsGuard.

Researchers say Grok has previously made errors verifying information related to crises such as the recent India-Pakistan conflict and anti-immigration protests in Los Angeles.

Last month, Grok was under renewed scrutiny for inserting "white genocide" in South Africa, a far-right conspiracy theory, into unrelated queries.

Musk's startup xAI blamed an "unauthorized modification" for the unsolicited response.

Musk, a South African-born billionaire, has previously peddled the unfounded claim that South Africa's leaders were "openly pushing for genocide" of white people.

Musk himself blasted Grok after it cited Media Matters -- a liberal media watchdog he has targeted in multiple lawsuits -- as a source in some of its responses about misinformation.

"Shame on you, Grok," Musk wrote on X. "Your sourcing is terrible."

A.P.Huber--NZN