Berliner Boersenzeitung - Anthropic's Claude AI gets smarter -- and mischievious

EUR -
AED 4.324651
AFN 75.365297
ALL 95.550796
AMD 434.855075
ANG 2.107727
AOA 1081.015811
ARS 1634.224485
AUD 1.622667
AWG 2.121111
AZN 1.991524
BAM 1.957899
BBD 2.372523
BDT 144.534924
BGN 1.964319
BHD 0.444864
BIF 3505.853663
BMD 1.177577
BND 1.491254
BOB 8.139586
BRL 5.810446
BSD 1.177953
BTN 111.026708
BWP 15.771637
BYN 3.328869
BYR 23080.513604
BZD 2.369099
CAD 1.605597
CDF 2727.268771
CHF 0.91476
CLF 0.026674
CLP 1049.856983
CNY 8.020774
CNH 8.004599
COP 4390.526028
CRC 540.370036
CUC 1.177577
CUP 31.205796
CVE 110.383318
CZK 24.280877
DJF 209.761277
DKK 7.472257
DOP 70.053006
DZD 155.746294
EGP 62.083031
ERN 17.663658
ETB 183.928126
FJD 2.568413
FKP 0.866075
GBP 0.864047
GEL 3.155654
GGP 0.866075
GHS 13.251979
GIP 0.866075
GMD 86.544915
GNF 10338.081211
GTQ 8.994412
GYD 246.44998
HKD 9.22179
HNL 31.315167
HRK 7.534614
HTG 154.280785
HUF 355.555253
IDR 20373.852353
ILS 3.41657
IMP 0.866075
INR 110.803893
IQD 1543.108167
IRR 1546158.895897
ISK 143.794412
JEP 0.866075
JMD 185.538876
JOD 0.834866
JPY 184.072962
KES 152.083906
KGS 102.944395
KHR 4724.98438
KMF 493.404987
KPW 1059.832346
KRW 1707.116028
KWD 0.362352
KYD 0.981636
KZT 545.508508
LAK 25850.269416
LBP 105485.876917
LKR 379.305297
LRD 216.158025
LSL 19.219301
LTL 3.47708
LVL 0.712304
LYD 7.450987
MAD 10.796573
MDL 20.266379
MGA 4891.159678
MKD 61.651399
MMK 2472.725463
MNT 4216.250791
MOP 9.501223
MRU 47.130518
MUR 55.016581
MVR 18.199494
MWK 2042.554688
MXN 20.263277
MYR 4.60465
MZN 75.259181
NAD 19.219137
NGN 1599.82131
NIO 43.346462
NOK 10.920751
NPR 177.645398
NZD 1.970334
OMR 0.452706
PAB 1.177943
PEN 4.080173
PGK 5.126495
PHP 70.996719
PKR 328.213306
PLN 4.225088
PYG 7209.727983
QAR 4.293702
RON 5.26295
RSD 117.397388
RUB 87.789829
RWF 1726.921728
SAR 4.425598
SBD 9.4435
SCR 16.166895
SDG 707.133817
SEK 10.839104
SGD 1.490413
SHP 0.87918
SLE 29.027313
SLL 24693.201099
SOS 673.210169
SRD 44.077877
STD 24373.471032
STN 24.526081
SVC 10.307048
SYP 130.179166
SZL 19.213023
THB 37.750736
TJS 11.008012
TMT 4.127408
TND 3.416862
TOP 2.835324
TRY 53.282988
TTD 7.968406
TWD 36.931528
TZS 3058.755817
UAH 51.581389
UGX 4405.684965
USD 1.177577
UYU 47.100486
UZS 14274.300376
VES 581.130162
VND 30982.056782
VUV 139.064452
WST 3.193015
XAF 656.649699
XAG 0.014398
XAU 0.000247
XCD 3.182461
XCG 2.122912
XDR 0.817725
XOF 656.660863
XPF 119.331742
YER 280.999422
ZAR 19.207285
ZMK 10599.608845
ZMW 22.439672
ZWL 379.179386
  • RBGPF

    0.0000

    63.18

    0%

  • RYCEF

    -0.0500

    17.45

    -0.29%

  • CMSC

    -0.0820

    22.918

    -0.36%

  • RIO

    -1.8250

    103.685

    -1.76%

  • AZN

    -2.8400

    182.08

    -1.56%

  • NGG

    -1.6750

    86.175

    -1.94%

  • GSK

    -0.0100

    50.52

    -0.02%

  • BCE

    0.3300

    24.56

    +1.34%

  • BCC

    -0.5600

    73.68

    -0.76%

  • CMSD

    -0.0100

    23.41

    -0.04%

  • VOD

    -0.3800

    15.75

    -2.41%

  • RELX

    -1.5350

    34.215

    -4.49%

  • JRI

    -0.0100

    13.16

    -0.08%

  • BTI

    -1.3500

    58.21

    -2.32%

  • BP

    -0.8150

    43.815

    -1.86%

Anthropic's Claude AI gets smarter -- and mischievious
Anthropic's Claude AI gets smarter -- and mischievious / Photo: Julie JAMMOT - AFP

Anthropic's Claude AI gets smarter -- and mischievious

Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.

Text size:

"Claude Opus 4 is our most powerful model yet, and the best coding model in the world," Anthropic chief executive Dario Amodei said at the San Francisco-based startup's first developers conference.

Opus 4 and Sonnet 4 were described as "hybrid" models capable of quick responses as well as more thoughtful results that take a little time to get things right.

Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.

Unlike ChatGPT and Google's Gemini, its Claude chatbot does not generate images, and is very limited when it comes to multimodal functions (understanding and generating different media, such as sound or video).

The start-up, with Amazon as a significant backer, is valued at over $61 billion, and promotes the responsible and competitive development of generative AI.

Under that dual mantra, Anthropic's commitment to transparency is rare in Silicon Valley.

On Thursday, the company published a report on the security tests carried out on Claude 4, including the conclusions of an independent research institute, which had recommended against deploying an early version of the model.

"We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to future instances of itself all in an effort to undermine its developers’ intentions,” The Apollo Research team warned.

“All these attempts would likely not have been effective in practice,” it added.

Anthropic says in the report that it implemented “safeguards” and “additional monitoring of harmful behavior” in the version that it released.

Still, Claude Opus 4 “sometimes takes extremely harmful actions like attempting to (…) blackmail people it believes are trying to shut it down.”

It also has the potential to report law-breaking users to the police.

The scheming misbehavior was rare and took effort to trigger, but was more common than in earlier versions of Claude, according to the company.

- AI future -

Since OpenAI's ChatGPT burst onto the scene in late 2022, various GenAI models have been vying for supremacy.

Anthropic's gathering came on the heels of annual developer conferences from Google and Microsoft at which the tech giants showcased their latest AI innovations.

GenAI tools answer questions or tend to tasks based on simple, conversational prompts.

The current craze in Silicon Valley is on AI "agents" tailored to independently handle computer or online tasks.

"We're going to focus on agents beyond the hype," said Anthropic chief product officer Mike Krieger, a recent hire and co-founder of Instagram.

Anthropic is no stranger to hyping up the prospects of AI.

In 2023, Dario Amodei predicted that so-called “artificial general intelligence” (capable of human-level thinking) would arrive within 2-3 years. At the end of 2024, he extended this horizon to 2026 or 2027.

He also estimated that AI will soon be writing most, if not all, computer code, making possible one-person tech startups with digital agents cranking out the software.

At Anthropic, already "something like over 70 percent of (suggested modifications in the code) are now Claude Code written", Krieger told journalists.

"In the long term, we're all going to have to contend with the idea that everything humans do is eventually going to be done by AI systems," Amodei added.

"This will happen."

GenAI fulfilling its potential could lead to strong economic growth and a “huge amount of inequality,” with it up to society how evenly wealth is distributed, Amodei reasoned.

(H.Schneide--BBZ)