Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.343084
AFN 77.459286
ALL 96.579317
AMD 443.005126
ANG 2.116942
AOA 1084.441581
ARS 1696.432015
AUD 1.709698
AWG 2.130448
AZN 2.007214
BAM 1.955381
BBD 2.363494
BDT 143.549257
BGN 1.986018
BHD 0.442405
BIF 3475.455694
BMD 1.182597
BND 1.500979
BOB 8.109263
BRL 6.25641
BSD 1.173449
BTN 107.718931
BWP 16.277514
BYN 3.322089
BYR 23178.895993
BZD 2.360095
CAD 1.622777
CDF 2578.061108
CHF 0.92885
CLF 0.026073
CLP 1029.497459
CNY 8.246959
CNH 8.220248
COP 4228.69438
CRC 580.775621
CUC 1.182597
CUP 31.338813
CVE 110.241391
CZK 24.243347
DJF 208.975246
DKK 7.466902
DOP 73.934166
DZD 153.1562
EGP 55.657722
ERN 17.738951
ETB 182.792653
FJD 2.661203
FKP 0.866824
GBP 0.867324
GEL 3.181352
GGP 0.866824
GHS 12.791261
GIP 0.866824
GMD 86.329097
GNF 10278.798686
GTQ 9.007071
GYD 245.51742
HKD 9.221356
HNL 30.954371
HRK 7.533375
HTG 153.907039
HUF 381.677781
IDR 19840.957581
ILS 3.707263
IMP 0.866824
INR 108.317628
IQD 1537.370756
IRR 49816.887621
ISK 145.778454
JEP 0.866824
JMD 184.72044
JOD 0.838427
JPY 184.148094
KES 151.257607
KGS 103.41761
KHR 4722.988522
KMF 496.691175
KPW 1064.460543
KRW 1710.401436
KWD 0.362346
KYD 0.977991
KZT 590.743486
LAK 25359.568979
LBP 105086.794547
LKR 363.552141
LRD 217.093507
LSL 18.940644
LTL 3.491901
LVL 0.715341
LYD 7.466401
MAD 10.748998
MDL 19.972723
MGA 5308.863051
MKD 61.616804
MMK 2482.620837
MNT 4215.294549
MOP 9.425381
MRU 46.916952
MUR 54.293134
MVR 18.271037
MWK 2034.864212
MXN 20.593728
MYR 4.736893
MZN 75.57967
NAD 18.940644
NGN 1680.541045
NIO 43.180752
NOK 11.543747
NPR 172.350089
NZD 1.990578
OMR 0.454253
PAB 1.173549
PEN 3.936857
PGK 5.018925
PHP 69.734175
PKR 328.344981
PLN 4.206148
PYG 7847.319413
QAR 4.278384
RON 5.10168
RSD 117.374863
RUB 88.771554
RWF 1711.533457
SAR 4.43348
SBD 9.606956
SCR 16.85639
SDG 711.331576
SEK 10.578186
SGD 1.50509
SHP 0.887254
SLE 28.85216
SLL 24798.461354
SOS 669.456629
SRD 45.081813
STD 24477.364748
STN 24.494754
SVC 10.267801
SYP 13079.017154
SZL 18.935945
THB 36.920482
TJS 10.97225
TMT 4.139089
TND 3.416268
TOP 2.847409
TRY 51.247241
TTD 7.971293
TWD 37.116742
TZS 3004.156628
UAH 50.599464
UGX 4148.111638
USD 1.182597
UYU 44.440483
UZS 14242.949721
VES 416.587929
VND 31037.251293
VUV 141.325014
WST 3.258752
XAF 655.81655
XAG 0.011483
XAU 0.000237
XCD 3.196027
XCG 2.114947
XDR 0.815625
XOF 655.81655
XPF 119.331742
YER 281.816102
ZAR 19.042528
ZMK 10644.788392
ZMW 23.02207
ZWL 380.795666
  • SCS

    0.0200

    16.14

    +0.12%

  • RBGPF

    -0.8100

    83.23

    -0.97%

  • GSK

    0.5000

    49.15

    +1.02%

  • BP

    1.1000

    36.53

    +3.01%

  • NGG

    1.3200

    81.5

    +1.62%

  • RELX

    0.0600

    39.9

    +0.15%

  • CMSC

    0.1000

    23.75

    +0.42%

  • RYCEF

    0.3000

    17.12

    +1.75%

  • BTI

    0.9400

    59.16

    +1.59%

  • RIO

    3.1300

    90.43

    +3.46%

  • BCE

    0.4900

    25.2

    +1.94%

  • CMSD

    0.0900

    24.13

    +0.37%

  • JRI

    0.0100

    13.68

    +0.07%

  • BCC

    -1.1800

    84.33

    -1.4%

  • VOD

    0.2300

    14.17

    +1.62%

  • AZN

    1.2600

    92.95

    +1.36%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

(H.Schneide--BBZ)