Zürcher Nachrichten - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.184829
AFN 71.778596
ALL 94.713473
AMD 419.412877
ANG 2.039871
AOA 1044.771654
ARS 1684.037898
AUD 1.65217
AWG 2.052229
AZN 1.941395
BAM 1.954275
BBD 2.295209
BDT 140.170644
BGN 1.926481
BHD 0.429577
BIF 3389.525002
BMD 1.139336
BND 1.47455
BOB 7.875167
BRL 5.909969
BSD 1.139611
BTN 106.961675
BWP 15.487597
BYN 3.305121
BYR 22330.988246
BZD 2.291872
CAD 1.617003
CDF 2583.449152
CHF 0.922361
CLF 0.026741
CLP 1052.462206
CNY 7.745378
CNH 7.752824
COP 3933.97956
CRC 517.396348
CUC 1.139336
CUP 30.192408
CVE 110.914822
CZK 24.277777
DJF 202.483266
DKK 7.480088
DOP 67.648129
DZD 151.960142
EGP 56.43136
ERN 17.090042
ETB 180.756124
FJD 2.576894
FKP 0.862156
GBP 0.863068
GEL 3.01359
GGP 0.862156
GHS 12.817976
GIP 0.862156
GMD 83.171943
GNF 10003.37167
GTQ 8.694217
GYD 238.503349
HKD 8.935757
HNL 30.443504
HRK 7.540017
HTG 148.9438
HUF 354.163079
IDR 20319.889067
ILS 3.420345
IMP 0.862156
INR 107.373829
IQD 1492.530337
IRR 1566644.152835
ISK 144.115067
JEP 0.862156
JMD 179.479977
JOD 0.807834
JPY 184.272854
KES 147.487501
KGS 99.635383
KHR 4568.738301
KMF 494.472282
KPW 1025.40292
KRW 1749.154845
KWD 0.352773
KYD 0.949701
KZT 552.928627
LAK 25139.452216
LBP 102027.551287
LKR 383.077949
LRD 207.644445
LSL 18.902021
LTL 3.364164
LVL 0.689173
LYD 7.297492
MAD 10.727424
MDL 20.206123
MGA 4813.695565
MKD 61.682975
MMK 2391.979433
MNT 4079.099526
MOP 9.205882
MRU 45.65363
MUR 54.380945
MVR 17.603174
MWK 1979.027259
MXN 19.943058
MYR 4.65765
MZN 72.807828
NAD 18.902016
NGN 1567.875065
NIO 41.711525
NOK 11.31707
NPR 171.141482
NZD 2.017953
OMR 0.438641
PAB 1.139661
PEN 3.898852
PGK 4.993996
PHP 69.855021
PKR 316.792839
PLN 4.291823
PYG 6955.543036
QAR 4.152924
RON 5.244483
RSD 117.477374
RUB 89.906115
RWF 1670.266774
SAR 4.278251
SBD 9.173881
SCR 14.7775
SDG 683.602068
SEK 11.094411
SGD 1.474647
SHP 0.850629
SLE 28.259714
SLL 23891.313258
SOS 651.134774
SRD 42.70578
STD 23581.957684
STN 25.065395
SVC 9.971177
SYP 125.933213
SZL 18.902007
THB 37.947303
TJS 10.547288
TMT 3.987676
TND 3.346804
TOP 2.743248
TRY 53.103436
TTD 7.744822
TWD 36.299026
TZS 2996.451799
UAH 51.151345
UGX 4182.626747
USD 1.139336
UYU 45.746318
UZS 13689.124042
VES 707.246307
VND 29964.540351
VUV 136.6644
WST 3.173617
XAF 655.445647
XAG 0.019435
XAU 0.00028
XCD 3.079113
XCG 2.053798
XDR 0.816281
XOF 652.839983
XPF 119.331742
YER 271.874128
ZAR 19.434192
ZMK 10255.396502
ZMW 20.528345
ZWL 366.865771
  • RBGPF

    0.0000

    61.3

    0%

  • CMSC

    -0.1160

    21.93

    -0.53%

  • RYCEF

    0.7000

    18.7

    +3.74%

  • NGG

    -0.4100

    83.01

    -0.49%

  • BTI

    0.2800

    62.76

    +0.45%

  • RIO

    -1.3700

    93.74

    -1.46%

  • GSK

    0.6100

    52.5

    +1.16%

  • RELX

    0.4200

    31.34

    +1.34%

  • BCE

    -0.2800

    22.92

    -1.22%

  • BCC

    1.2600

    81.02

    +1.56%

  • AZN

    2.7300

    188.41

    +1.45%

  • CMSD

    -0.1600

    21.77

    -0.73%

  • JRI

    0.2100

    12.79

    +1.64%

  • VOD

    0.0300

    13.89

    +0.22%

  • BP

    -0.5900

    37.13

    -1.59%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

P.Gashi--NZN