Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.194964
AFN 71.962329
ALL 95.607711
AMD 430.633064
ANG 2.044746
AOA 1047.455718
ARS 1597.169164
AUD 1.633408
AWG 2.056074
AZN 1.937983
BAM 1.947808
BBD 2.300461
BDT 140.154929
BGN 1.952481
BHD 0.434119
BIF 3390.787219
BMD 1.142263
BND 1.461603
BOB 7.892616
BRL 6.088954
BSD 1.142213
BTN 105.429411
BWP 15.564139
BYN 3.379633
BYR 22388.358054
BZD 2.297075
CAD 1.567813
CDF 2578.08801
CHF 0.903144
CLF 0.026534
CLP 1047.694983
CNY 7.877735
CNH 7.885249
COP 4206.042137
CRC 537.395008
CUC 1.142263
CUP 30.269974
CVE 109.814419
CZK 24.467299
DJF 203.395446
DKK 7.469419
DOP 70.172076
DZD 152.519197
EGP 59.872586
ERN 17.133947
ETB 178.286769
FJD 2.545759
FKP 0.858727
GBP 0.862717
GEL 3.11834
GGP 0.858727
GHS 12.404105
GIP 0.858727
GMD 83.968708
GNF 10012.915882
GTQ 8.759061
GYD 238.959522
HKD 8.941847
HNL 30.233947
HRK 7.538477
HTG 149.765495
HUF 392.485624
IDR 19372.840409
ILS 3.591663
IMP 0.858727
INR 105.756264
IQD 1496.260674
IRR 1509757.783542
ISK 144.265995
JEP 0.858727
JMD 179.214662
JOD 0.809911
JPY 182.340006
KES 147.733832
KGS 99.89057
KHR 4580.206897
KMF 492.315655
KPW 1028.036787
KRW 1716.535902
KWD 0.351166
KYD 0.951795
KZT 559.16568
LAK 24474.578059
LBP 102280.730681
LKR 355.471461
LRD 209.012399
LSL 19.183488
LTL 3.372806
LVL 0.690944
LYD 7.288096
MAD 10.757561
MDL 19.925244
MGA 4742.540823
MKD 61.388118
MMK 2398.079178
MNT 4076.711204
MOP 9.207521
MRU 45.698494
MUR 53.127208
MVR 17.648461
MWK 1980.473893
MXN 20.473351
MYR 4.4988
MZN 73.001428
NAD 19.183488
NGN 1582.606175
NIO 42.027556
NOK 11.156005
NPR 168.686859
NZD 1.97274
OMR 0.439203
PAB 1.142114
PEN 3.938839
PGK 4.994507
PHP 68.067038
PKR 318.920633
PLN 4.270853
PYG 7368.76513
QAR 4.151964
RON 5.097466
RSD 116.890386
RUB 91.630602
RWF 1666.761092
SAR 4.286839
SBD 9.197181
SCR 17.439444
SDG 686.500582
SEK 10.812155
SGD 1.46549
SHP 0.856993
SLE 28.04248
SLL 23952.699825
SOS 651.626304
SRD 42.88965
STD 23642.541316
STN 24.399885
SVC 9.993994
SYP 126.248614
SZL 19.177313
THB 36.927147
TJS 10.94768
TMT 3.997921
TND 3.37784
TOP 2.750295
TRY 50.476467
TTD 7.746216
TWD 36.774707
TZS 2975.292043
UAH 50.368233
UGX 4294.379678
USD 1.142263
UYU 45.881742
UZS 13791.412287
VES 505.685187
VND 30034.667687
VUV 135.076355
WST 3.124335
XAF 653.276533
XAG 0.014183
XAU 0.000227
XCD 3.087023
XCG 2.058454
XDR 0.812466
XOF 653.276533
XPF 119.331742
YER 272.486584
ZAR 19.332787
ZMK 10281.734904
ZMW 22.231781
ZWL 367.808273
  • RBGPF

    0.1000

    82.5

    +0.12%

  • CMSD

    -0.1100

    22.99

    -0.48%

  • BCC

    0.3800

    70

    +0.54%

  • BCE

    -0.1100

    25.57

    -0.43%

  • JRI

    -0.2300

    12.59

    -1.83%

  • NGG

    0.0900

    90.9

    +0.1%

  • GSK

    -0.8900

    53.39

    -1.67%

  • CMSC

    -0.1500

    22.99

    -0.65%

  • RYCEF

    -1.1300

    16.12

    -7.01%

  • RELX

    -0.0400

    34.14

    -0.12%

  • RIO

    -2.8700

    87.83

    -3.27%

  • VOD

    0.1000

    14.41

    +0.69%

  • AZN

    -2.6000

    189.9

    -1.37%

  • BTI

    0.0400

    59.93

    +0.07%

  • BP

    0.5100

    42.67

    +1.2%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

(H.Schneide--BBZ)