Berliner Boersenzeitung - Inner workings of AI an enigma - even to its creators

EUR -
AED 4.093938
AFN 78.583086
ALL 98.028692
AMD 430.600233
ANG 1.994759
AOA 1022.079983
ARS 1273.430123
AUD 1.741515
AWG 2.00905
AZN 1.899229
BAM 1.94552
BBD 2.249414
BDT 135.364744
BGN 1.956714
BHD 0.420123
BIF 3271.32339
BMD 1.114591
BND 1.446656
BOB 7.698323
BRL 6.321517
BSD 1.114113
BTN 95.244734
BWP 15.065396
BYN 3.645935
BYR 21845.97562
BZD 2.237875
CAD 1.559224
CDF 3199.989995
CHF 0.935258
CLF 0.027458
CLP 1053.70095
CNY 8.035645
CNH 8.038634
COP 4662.053802
CRC 564.318188
CUC 1.114591
CUP 29.536651
CVE 110.906104
CZK 24.903343
DJF 198.085479
DKK 7.461114
DOP 65.653715
DZD 148.43807
EGP 55.871534
ERN 16.718859
ETB 147.687571
FJD 2.53497
FKP 0.838643
GBP 0.839916
GEL 3.054414
GGP 0.838643
GHS 13.765629
GIP 0.838643
GMD 80.81211
GNF 9646.781977
GTQ 8.553802
GYD 233.08838
HKD 8.709991
HNL 28.97975
HRK 7.536532
HTG 145.779712
HUF 402.65743
IDR 18381.159303
ILS 3.965402
IMP 0.838643
INR 95.414086
IQD 1460.113677
IRR 46938.200596
ISK 145.92263
JEP 0.838643
JMD 177.601568
JOD 0.790584
JPY 162.626614
KES 144.061263
KGS 97.471376
KHR 4480.654574
KMF 492.095975
KPW 1003.1886
KRW 1560.505279
KWD 0.342741
KYD 0.928494
KZT 568.03853
LAK 24097.449007
LBP 99811.587981
LKR 333.35856
LRD 222.528437
LSL 20.152223
LTL 3.291097
LVL 0.674205
LYD 6.147011
MAD 10.374056
MDL 19.407453
MGA 5055.783316
MKD 61.538345
MMK 2340.055112
MNT 3992.834027
MOP 8.968014
MRU 44.193939
MUR 51.394194
MVR 17.231992
MWK 1933.815063
MXN 21.719028
MYR 4.788324
MZN 71.226495
NAD 20.152218
NGN 1785.931219
NIO 40.961624
NOK 11.595348
NPR 152.391774
NZD 1.896961
OMR 0.429073
PAB 1.114113
PEN 4.107496
PGK 4.533876
PHP 62.209206
PKR 313.72729
PLN 4.265005
PYG 8894.999537
QAR 4.060644
RON 5.107393
RSD 116.613822
RUB 90.282633
RWF 1581.046756
SAR 4.180621
SBD 9.296163
SCR 16.161751
SDG 669.315748
SEK 10.911162
SGD 1.449007
SHP 0.875893
SLE 25.305293
SLL 23372.407676
SOS 636.992606
SRD 40.7734
STD 23069.774923
SVC 9.74849
SYP 14491.834225
SZL 20.15221
THB 37.238883
TJS 11.486208
TMT 3.90664
TND 3.365399
TOP 2.610487
TRY 43.296314
TTD 7.557069
TWD 33.726439
TZS 3006.612171
UAH 46.245634
UGX 4076.460311
USD 1.114591
UYU 46.354857
UZS 14420.01983
VES 105.001372
VND 28891.860053
VUV 133.745898
WST 3.094337
XAF 652.509194
XAG 0.034583
XAU 0.000349
XCD 3.012237
XDR 0.81882
XOF 641.450893
XPF 119.331742
YER 272.075566
ZAR 20.132906
ZMK 10032.656842
ZMW 29.946764
ZWL 358.897716
  • CMSC

    -0.0450

    22.055

    -0.2%

  • RIO

    -0.2000

    62.55

    -0.32%

  • BTI

    1.2680

    42.638

    +2.97%

  • NGG

    1.2100

    71.24

    +1.7%

  • CMSD

    0.0672

    22.08

    +0.3%

  • BP

    0.1550

    29.785

    +0.52%

  • RYCEF

    -0.0900

    10.7

    -0.84%

  • RBGPF

    1.5000

    64.5

    +2.33%

  • GSK

    0.3941

    37.535

    +1.05%

  • AZN

    0.7400

    68.7

    +1.08%

  • BCC

    0.8950

    91.885

    +0.97%

  • SCS

    0.0250

    10.525

    +0.24%

  • VOD

    0.1650

    9.435

    +1.75%

  • JRI

    0.0900

    12.83

    +0.7%

  • RELX

    0.4350

    54.475

    +0.8%

  • BCE

    -0.0350

    21.595

    -0.16%

Inner workings of AI an enigma - even to its creators
Inner workings of AI an enigma - even to its creators / Photo: Kirill KUDRYAVTSEV - AFP

Inner workings of AI an enigma - even to its creators

Even the greatest human minds building generative artificial intelligence that is poised to change the world admit they do not comprehend how digital minds think.

Text size:

"People outside the field are often surprised and alarmed to learn that we do not understand how our own AI creations work," Anthropic co-founder Dario Amodei wrote in an essay posted online in April.

"This lack of understanding is essentially unprecedented in the history of technology."

Unlike traditional software programs that follow pre-ordained paths of logic dictated by programmers, generative AI (gen AI) models are trained to find their own way to success once prompted.

In a recent podcast Chris Olah, who was part of ChatGPT-maker OpenAI before joining Anthropic, described gen AI as "scaffolding" on which circuits grow.

Olah is considered an authority in so-called mechanistic interpretability, a method of reverse engineering AI models to figure out how they work.

This science, born about a decade ago, seeks to determine exactly how AI gets from a query to an answer.

"Grasping the entirety of a large language model is an incredibly ambitious task," said Neel Nanda, a senior research scientist at the Google DeepMind AI lab.

It was "somewhat analogous to trying to fully understand the human brain," Nanda added to AFP, noting neuroscientists have yet to succeed on that front.

Delving into digital minds to understand their inner workings has gone from a little-known field just a few years ago to being a hot area of academic study.

"Students are very much attracted to it because they perceive the impact that it can have," said Boston University computer science professor Mark Crovella.

The area of study is also gaining traction due to its potential to make gen AI even more powerful, and because peering into digital brains can be intellectually exciting, the professor added.

- Keeping AI honest -

Mechanistic interpretability involves studying not just results served up by gen AI but scrutinizing calculations performed while the technology mulls queries, according to Crovella.

"You could look into the model...observe the computations that are being performed and try to understand those," the professor explained.

Startup Goodfire uses AI software capable of representing data in the form of reasoning steps to better understand gen AI processing and correct errors.

The tool is also intended to prevent gen AI models from being used maliciously or from deciding on their own to deceive humans about what they are up to.

"It does feel like a race against time to get there before we implement extremely intelligent AI models into the world with no understanding of how they work," said Goodfire chief executive Eric Ho.

In his essay, Amodei said recent progress has made him optimistic that the key to fully deciphering AI will be found within two years.

"I agree that by 2027, we could have interpretability that reliably detects model biases and harmful intentions," said Auburn University associate professor Anh Nguyen.

According to Boston University's Crovella, researchers can already access representations of every digital neuron in AI brains.

"Unlike the human brain, we actually have the equivalent of every neuron instrumented inside these models", the academic said. "Everything that happens inside the model is fully known to us. It's a question of discovering the right way to interrogate that."

Harnessing the inner workings of gen AI minds could clear the way for its adoption in areas where tiny errors can have dramatic consequences, like national security, Amodei said.

For Nanda, better understanding what gen AI is doing could also catapult human discoveries, much like DeepMind's chess-playing AI, AlphaZero, revealed entirely new chess moves that none of the grand masters had ever thought about.

Properly understood, a gen AI model with a stamp of reliability would grab competitive advantage in the market.

Such a breakthrough by a US company would also be a win for the nation in its technology rivalry with China.

"Powerful AI will shape humanity's destiny," Amodei wrote.

"We deserve to understand our own creations before they radically transform our economy, our lives, and our future."

(A.Berg--BBZ)