Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.178426
AFN 79.167405
ALL 98.060105
AMD 436.693803
ANG 2.036005
AOA 1043.791486
ARS 1347.252549
AUD 1.759577
AWG 2.049169
AZN 1.910892
BAM 1.953039
BBD 2.298032
BDT 139.074868
BGN 1.955683
BHD 0.428824
BIF 3388.066486
BMD 1.137637
BND 1.466514
BOB 7.864814
BRL 6.409417
BSD 1.138181
BTN 97.511887
BWP 15.278204
BYN 3.724802
BYR 22297.685477
BZD 2.286248
CAD 1.561105
CDF 3259.330522
CHF 0.936956
CLF 0.027864
CLP 1069.276332
CNY 8.195875
CNH 8.180412
COP 4694.720795
CRC 579.375992
CUC 1.137637
CUP 30.147381
CVE 110.105017
CZK 24.891196
DJF 202.180553
DKK 7.458914
DOP 67.20501
DZD 149.875728
EGP 56.505179
ERN 17.064555
ETB 155.405078
FJD 2.56344
FKP 0.839728
GBP 0.841209
GEL 3.117211
GGP 0.839728
GHS 11.64344
GIP 0.839728
GMD 81.910185
GNF 9864.666646
GTQ 8.741107
GYD 238.121336
HKD 8.925001
HNL 29.655084
HRK 7.532635
HTG 148.99809
HUF 403.609734
IDR 18587.509883
ILS 4.004539
IMP 0.839728
INR 97.50744
IQD 1490.992566
IRR 47922.959241
ISK 144.605271
JEP 0.839728
JMD 181.553385
JOD 0.806578
JPY 163.677557
KES 147.039767
KGS 99.4862
KHR 4564.488169
KMF 494.301134
KPW 1023.8033
KRW 1566.912621
KWD 0.34897
KYD 0.948447
KZT 582.940922
LAK 24583.037173
LBP 101979.96065
LKR 340.69748
LRD 227.066061
LSL 20.384234
LTL 3.359146
LVL 0.688145
LYD 6.196242
MAD 10.466093
MDL 19.576072
MGA 5172.643292
MKD 61.499701
MMK 2388.355188
MNT 4069.813709
MOP 9.197619
MRU 44.991407
MUR 51.682917
MVR 17.587556
MWK 1973.593089
MXN 21.911026
MYR 4.829247
MZN 72.706455
NAD 20.385486
NGN 1800.549212
NIO 41.880069
NOK 11.54164
NPR 156.020103
NZD 1.895605
OMR 0.43742
PAB 1.138181
PEN 4.120803
PGK 4.676205
PHP 63.373191
PKR 322.141749
PLN 4.27755
PYG 9094.145937
QAR 4.14997
RON 5.057479
RSD 117.214173
RUB 89.845321
RWF 1610.402553
SAR 4.267057
SBD 9.500142
SCR 16.756107
SDG 683.151078
SEK 10.944521
SGD 1.466613
SHP 0.894004
SLE 25.846723
SLL 23855.679611
SOS 650.474873
SRD 42.260376
STD 23546.789313
SVC 9.95853
SYP 14791.345992
SZL 20.376021
THB 37.132267
TJS 11.267874
TMT 3.987418
TND 3.388011
TOP 2.664462
TRY 44.512313
TTD 7.723016
TWD 34.134226
TZS 3060.243236
UAH 47.272613
UGX 4145.141077
USD 1.137637
UYU 47.451054
UZS 14607.774913
VES 107.900918
VND 29641.132404
VUV 137.46876
WST 3.141781
XAF 655.022526
XAG 0.03295
XAU 0.000339
XCD 3.074521
XDR 0.81106
XOF 655.005278
XPF 119.331742
YER 277.413054
ZAR 20.335376
ZMK 10240.097137
ZMW 30.559537
ZWL 366.318654
  • RBGPF

    -1.5000

    67.5

    -2.22%

  • CMSC

    0.0470

    22.117

    +0.21%

  • RYCEF

    0.1600

    12.04

    +1.33%

  • NGG

    -0.5350

    71.395

    -0.75%

  • AZN

    0.2500

    72.18

    +0.35%

  • VOD

    -0.0750

    10.325

    -0.73%

  • GSK

    -1.1450

    40.51

    -2.83%

  • BTI

    1.0550

    46.445

    +2.27%

  • RIO

    -0.6700

    58.91

    -1.14%

  • BP

    0.0120

    29.577

    +0.04%

  • RELX

    -0.5500

    54.03

    -1.02%

  • CMSD

    0.0128

    22.0789

    +0.06%

  • BCE

    -0.3800

    21.9

    -1.74%

  • BCC

    1.9750

    87.075

    +2.27%

  • SCS

    0.3700

    10.56

    +3.5%

  • JRI

    0.0340

    12.95

    +0.26%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

(H.Schneide--BBZ)