Anthropic's Claude AI gets smarter -- and mischievious

Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

Berlin 21°C

EUR -

AED 4.236833

AFN 76.142197

ALL 93.45107

AMD 420.704816

AOA 1059.063247

ARS 1713.99192

AUD 1.649976

AWG 2.076594

AZN 1.965807

BAM 1.955822

BBD 2.316627

BDT 142.031626

BHD 0.433705

BIF 3413.239071

BMD 1.153663

BND 1.476217

BOB 13.658156

BRL 5.861191

BSD 1.150213

BTN 109.651255

BWP 15.68318

BYN 3.345938

BYR 22611.798836

BZD 2.313326

CAD 1.617379

CDF 2624.584211

CHF 0.93183

CLF 0.027184

CLP 1073.37276

CNY 7.788731

CNH 7.790705

COP 3653.005322

CRC 522.40598

CUC 1.153663

CUP 30.572075

CVE 110.266262

CZK 24.240891

DJF 204.822345

DKK 7.480664

DOP 66.730764

DZD 153.303755

EGP 59.017676

ERN 17.304948

ETB 183.804404

FJD 2.554153

FKP 0.856626

GBP 0.855675

GEL 3.016876

GGP 0.856626

GHS 13.446154

GIP 0.856626

GMD 84.798688

GNF 10097.115581

GTQ 8.7761

GYD 240.602754

HKD 9.047262

HNL 30.819353

HRK 7.538386

HTG 150.391722

HUF 365.192535

IDR 20799.39394

ILS 3.533613

IMP 0.856626

INR 110.048515

IQD 1506.817248

IRR 1586431.11648

ISK 142.062532

JEP 0.856626

JMD 182.092084

JOD 0.817993

JPY 181.627012

KES 148.781703

KGS 100.888291

KHR 4660.799756

KMF 492.614593

KRW 1664.690302

KWD 0.356644

KYD 0.958511

KZT 545.026239

LAK 26048.298174

LBP 103004.479089

LKR 386.12442

LRD 207.612377

LSL 19.024918

LTL 3.406468

LVL 0.69784

LYD 7.359084

MAD 10.744823

MDL 20.10023

MGA 4919.056308

MKD 61.525704

MMK 2422.354323

MNT 4147.524662

MOP 9.292006

MRU 46.226529

MUR 54.222569

MVR 17.836069

MWK 1994.42283

MXN 20.013177

MYR 4.712949

MZN 73.731051

NAD 19.024918

NGN 1574.012366

NIO 42.330485

NOK 10.926464

NPR 175.442008

NZD 1.961512

OMR 0.443645

PAB 1.150213

PEN 3.898045

PGK 5.150059

PHP 70.667684

PKR 319.439457

PLN 4.312336

PYG 6858.078504

QAR 4.204648

RON 5.247557

RSD 117.498334

RUB 91.443047

RWF 1688.519328

SAR 4.320672

SBD 9.322873

SCR 15.583178

SDG 692.198315

SEK 10.985993

SGD 1.479693

SLE 28.499711

SOS 657.307524

SRD 43.587131

STD 23878.499126

STN 24.50028

SVC 10.064115

SZL 19.022218

THB 38.676603

TJS 10.616322

TMT 4.049358

TND 3.381339

TRY 54.813428

TTD 7.810089

TWD 37.273824

TZS 3047.993231

UAH 51.336988

UGX 4319.04944

USD 1.153663

UYU 46.28053

UZS 13768.157604

VES 860.282504

VND 30341.919148

VUV 137.643052

WST 3.154711

XAF 655.964509

XAG 0.020031

XAU 0.000285

XCD 3.117833

XCG 2.072924

XDR 0.815809

XOF 655.964509

XPF 119.331742

YER 274.922082

ZAR 19.102044

ZMK 10384.357384

ZMW 21.606247

ZWL 371.479082

CMSC

0.0300

21.84

+0.14%
BCC

1.0000

76.38

+1.31%
NGG

-0.4200

79.97

-0.53%
RBGPF

0.0000

69.21

0%
RIO

-0.3300

96.85

-0.34%
JRI

0.0900

12.96

+0.69%
BCE

-0.0200

21.68

-0.09%
RELX

-1.1900

35.42

-3.36%
RYCEF

-0.3100

19.55

-1.59%
CMSD

0.0900

22.11

+0.41%
VOD

-0.3600

15.78

-2.28%
GSK

-0.3800

51.69

-0.74%
AZN

-1.7000

169.64

-1%
BTI

-1.0400

60.65

-1.71%
BP

1.0000

45.22

+2.21%

Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

TECHNOLOGY 23.05.2025

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

(H.Schneide--BBZ)

Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

Anthropic's Claude AI gets smarter -- and mischievious

Featured

Warsaw and Kyiv exhume Volyn victims at centre of diplomatic quarrel

Japan probe made closest-ever asteroid flyby: space agency

Anthropic's models gained unauthorized 'real-world' access during testing

Apple tops estimates in CEO Cook's final quarter, but shares fall