Zürcher Nachrichten - ChatGPT's taste for literary nonsense sparks alarm

EUR -
AED 4.235108
AFN 72.638695
ALL 95.986116
AMD 435.092592
ANG 2.063949
AOA 1057.292369
ARS 1577.236365
AUD 1.673475
AWG 2.078266
AZN 1.958134
BAM 1.955386
BBD 2.320668
BDT 141.373711
BGN 1.970817
BHD 0.435957
BIF 3424.38207
BMD 1.152991
BND 1.480725
BOB 7.979516
BRL 6.049975
BSD 1.152186
BTN 108.575339
BWP 15.841123
BYN 3.460157
BYR 22598.615681
BZD 2.317349
CAD 1.59725
CDF 2635.149736
CHF 0.916506
CLF 0.027072
CLP 1068.948607
CNY 7.966185
CNH 7.980055
COP 4255.61911
CRC 534.200663
CUC 1.152991
CUP 30.554251
CVE 110.542933
CZK 24.511426
DJF 204.909943
DKK 7.471979
DOP 68.605777
DZD 153.395731
EGP 60.817599
ERN 17.294859
ETB 181.192506
FJD 2.594811
FKP 0.862247
GBP 0.865314
GEL 3.107286
GGP 0.862247
GHS 12.636424
GIP 0.862247
GMD 84.719455
GNF 10120.377686
GTQ 8.814361
GYD 241.055175
HKD 9.023247
HNL 30.577003
HRK 7.535828
HTG 150.891941
HUF 388.338432
IDR 19510.445669
ILS 3.602059
IMP 0.862247
INR 108.645093
IQD 1510.417681
IRR 1514222.549315
ISK 143.339936
JEP 0.862247
JMD 181.081615
JOD 0.817484
JPY 184.182756
KES 149.773716
KGS 100.828779
KHR 4629.257123
KMF 492.326899
KPW 1037.758177
KRW 1739.332384
KWD 0.35421
KYD 0.960221
KZT 555.084372
LAK 25063.132529
LBP 103250.307387
LKR 362.372615
LRD 211.803486
LSL 19.658594
LTL 3.404482
LVL 0.697433
LYD 7.35573
MAD 10.768576
MDL 20.238324
MGA 4813.735514
MKD 61.653053
MMK 2421.261549
MNT 4132.119635
MOP 9.284814
MRU 46.246593
MUR 53.751971
MVR 17.825775
MWK 2001.591211
MXN 20.574308
MYR 4.605027
MZN 73.687834
NAD 19.658789
NGN 1598.632905
NIO 42.337441
NOK 11.175356
NPR 173.720942
NZD 2.002185
OMR 0.443309
PAB 1.152181
PEN 3.988767
PGK 4.968807
PHP 69.448107
PKR 321.972295
PLN 4.27801
PYG 7540.995323
QAR 4.215912
RON 5.097026
RSD 117.441351
RUB 93.822176
RWF 1683.36627
SAR 4.326033
SBD 9.272321
SCR 15.995702
SDG 692.947394
SEK 10.884917
SGD 1.482394
SHP 0.865042
SLE 28.306224
SLL 24177.648784
SOS 658.93198
SRD 43.308612
STD 23864.577457
STN 24.616349
SVC 10.082038
SYP 128.492581
SZL 19.658268
THB 38.014217
TJS 11.02665
TMT 4.046997
TND 3.370773
TOP 2.776124
TRY 51.145977
TTD 7.820546
TWD 36.875174
TZS 2968.95063
UAH 50.55856
UGX 4286.184377
USD 1.152991
UYU 46.710504
UZS 14054.955391
VES 537.314539
VND 30382.455194
VUV 137.232784
WST 3.170183
XAF 655.832201
XAG 0.01708
XAU 0.000263
XCD 3.116015
XCG 2.076605
XDR 0.813367
XOF 653.172449
XPF 119.331742
YER 275.161365
ZAR 19.752487
ZMK 10378.307533
ZMW 21.632883
ZWL 371.262501
  • CMSC

    -0.1200

    22.79

    -0.53%

  • BCC

    -0.9900

    73.66

    -1.34%

  • GSK

    -0.3300

    54.37

    -0.61%

  • RIO

    -2.0750

    85.465

    -2.43%

  • AZN

    -3.6350

    183.505

    -1.98%

  • BCE

    -0.0750

    25.415

    -0.3%

  • NGG

    -1.8100

    82.48

    -2.19%

  • BTI

    -0.1600

    58.29

    -0.27%

  • BP

    0.8550

    46.265

    +1.85%

  • CMSD

    -0.0840

    22.596

    -0.37%

  • JRI

    -0.0020

    12.098

    -0.02%

  • RBGPF

    -13.5000

    69

    -19.57%

  • VOD

    -0.0150

    14.705

    -0.1%

  • RYCEF

    -0.6000

    15.3

    -3.92%

  • RELX

    -0.2800

    32.19

    -0.87%

ChatGPT's taste for literary nonsense sparks alarm
ChatGPT's taste for literary nonsense sparks alarm / Photo: Anna Moneymaker - GETTY IMAGES NORTH AMERICA/AFP

ChatGPT's taste for literary nonsense sparks alarm

OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.

Text size:

Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.

"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.

His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.

He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."

He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.

The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.

"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.

"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.

He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.

His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.

After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.

- 'Ripe for exploitation' -

"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.

"But it's just not clear to me that it's so very different for human beings," he added.

"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."

The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.

A.Senn--NZN