[Question] Non-ul­ti­ma­tum game problem

numpyNaN8 Apr 2024 23:25 UTC
9 points
4 comments2 min readLW link

Pan­demic Iden­ti­fi­ca­tion Simulator

jefftk8 Apr 2024 19:00 UTC
22 points
0 comments1 min readLW link
(www.jefftk.com)

How We Pic­ture Bayesian Agents

8 Apr 2024 18:12 UTC
62 points
14 comments7 min readLW link

CEA seeks co-founder for AI safety group sup­port spin-off

agucova8 Apr 2024 15:42 UTC
18 points
0 comments1 min readLW link

In­ves­ti­gat­ing the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC
10 points
0 comments1 min readLW link

Mea­sur­ing Learned Op­ti­miza­tion in Small Trans­former Models

J Bostock8 Apr 2024 14:41 UTC
22 points
0 comments11 min readLW link

[Question] Can sin­gu­lar­ity emerge from trans­form­ers?

MP8 Apr 2024 14:26 UTC
−3 points
1 comment1 min readLW link

Gated At­ten­tion Blocks: Pre­limi­nary Progress to­ward Re­mov­ing At­ten­tion Head Superposition

8 Apr 2024 11:14 UTC
36 points
4 comments15 min readLW link

Math-to-English Cheat Sheet

nahoj8 Apr 2024 9:19 UTC
54 points
5 comments6 min readLW link

[Question] What does it take to trans­fer the knowl­edge to ac­tion?

EL_File41388 Apr 2024 6:23 UTC
3 points
7 comments1 min readLW link

Nor­mal­iz­ing Sparse Autoencoders

Fengyuan Hu8 Apr 2024 6:17 UTC
13 points
17 comments13 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC
121 points
10 comments3 min readLW link

[Cross­post] In­tro­duc­ing the Hyper­man­i­fest: Redefin­ing AI’s Role in Hu­man Con­nec­tion and Interaction

Suzie. EXE7 Apr 2024 17:21 UTC
4 points
0 comments5 min readLW link

On hiatus

SashaWu7 Apr 2024 16:21 UTC
3 points
0 comments1 min readLW link

Ap­pli­ca­tions Open: Ele­vate Your Men­tal Wel­lbe­ing with Re­think Wel­lbe­ing’s CBT Program

Inga G.7 Apr 2024 14:03 UTC
13 points
2 comments1 min readLW link

The Poker The­ory of Poker Night

omark7 Apr 2024 9:47 UTC
29 points
13 comments9 min readLW link
(www.codeandbugs.com)

Cen­trists are (prob­a­bly) less biased

Kevin Dorst7 Apr 2024 6:40 UTC
1 point
2 comments5 min readLW link
(kevindorst.substack.com)

on the dol­lar-yen ex­change rate

bhauth7 Apr 2024 4:49 UTC
50 points
21 comments10 min readLW link
(www.bhauth.com)

Con­flict in Posthu­man Literature

Martín Soto6 Apr 2024 22:26 UTC
39 points
1 comment2 min readLW link
(twitter.com)

“Frac­tal Strat­egy” work­shop report

Raemon6 Apr 2024 21:26 UTC
54 points
18 comments10 min readLW link

The 2nd De­mo­graphic Transition

Maxwell Tabarrok6 Apr 2024 14:10 UTC
68 points
17 comments4 min readLW link
(www.maximum-progress.com)

My in­tel­lec­tual jour­ney to (dis)solve the hard prob­lem of consciousness

Charbel-Raphaël6 Apr 2024 9:32 UTC
42 points
41 comments30 min readLW link

Mea­sur­ing Pre­dictabil­ity of Per­sona Evaluations

6 Apr 2024 8:46 UTC
19 points
0 comments7 min readLW link

Pri­vacy and writing

Neil 6 Apr 2024 8:20 UTC
20 points
1 comment5 min readLW link

[Question] How does the ever-in­creas­ing use of AI in the mil­i­tary for the di­rect pur­pose of mur­der­ing peo­ple af­fect your p(doom)?

Justausername6 Apr 2024 6:31 UTC
19 points
16 comments1 min readLW link

Two tools for re­think­ing ex­is­ten­tial risk

Arepo6 Apr 2024 2:55 UTC
2 points
0 comments25 min readLW link

Ex­plor­ing Whole Brain Emulation

PeterMcCluskey6 Apr 2024 2:38 UTC
12 points
1 comment2 min readLW link
(bayesianinvestor.com)

Koan: di­v­in­ing alien datas­truc­tures from RAM activations

TsviBT5 Apr 2024 18:04 UTC
32 points
0 comments21 min readLW link

On the 2nd CWT with Jonathan Haidt

Zvi5 Apr 2024 17:30 UTC
29 points
3 comments33 min readLW link
(thezvi.wordpress.com)

End-to-end hack­ing with lan­guage models

tchauvin5 Apr 2024 15:06 UTC
23 points
0 comments8 min readLW link

Par­tial value takeover with­out world takeover

KatjaGrace5 Apr 2024 6:20 UTC
89 points
23 comments3 min readLW link
(worldspiritsockpuppet.com)

On Com­plex­ity Science

Garrett Baker5 Apr 2024 2:24 UTC
50 points
19 comments4 min readLW link

Us­ing game the­ory to elect a cen­trist in the 2024 US Pres­i­den­tial Election

Ebenezer Dukakis5 Apr 2024 0:46 UTC
−3 points
0 comments8 min readLW link

New re­port: A re­view of the em­piri­cal ev­i­dence for ex­is­ten­tial risk from AI via mis­al­igned power-seeking

4 Apr 2024 23:41 UTC
30 points
5 comments1 min readLW link
(blog.aiimpacts.org)

Quick ev­i­dence re­view of bulk­ing & cutting

jp4 Apr 2024 21:43 UTC
31 points
5 comments4 min readLW link

LLMs for Align­ment Re­search: a safety pri­or­ity?

abramdemski4 Apr 2024 20:03 UTC
142 points
24 comments11 min readLW link

Seek­ing unique­ness where de­sign flourish

Itay Dreyfus4 Apr 2024 19:12 UTC
2 points
0 comments3 min readLW link
(productidentity.co)

On Leif We­nar’s Ab­surdly Un­con­vinc­ing Cri­tique Of Effec­tive Altru­ism

omnizoid4 Apr 2024 19:01 UTC
8 points
2 comments14 min readLW link

Run evals on base mod­els too!

orthonormal4 Apr 2024 18:43 UTC
47 points
6 comments1 min readLW link

Let’s Fund: Im­pact of our $1M crowd­funded grant to the Cen­ter for Clean En­ergy Innovation

Hauke Hillebrandt4 Apr 2024 16:28 UTC
5 points
0 comments1 min readLW link
(lets-fund.org)

The Buck­ling World Hy­poth­e­sis—Vi­su­al­is­ing Vuln­er­a­ble Worlds

Rosco-Hunter4 Apr 2024 15:51 UTC
−5 points
2 comments4 min readLW link

Can AI Trans­form the Elec­torate into a Ci­ti­zen’s Assem­bly?

Rosco-Hunter4 Apr 2024 15:45 UTC
−6 points
0 comments4 min readLW link

AI Discrim­i­na­tion Re­quire­ments: A Reg­u­la­tory Review

4 Apr 2024 15:43 UTC
7 points
0 comments6 min readLW link

Try­ing to Do More Good

jefftk4 Apr 2024 14:20 UTC
18 points
0 comments12 min readLW link
(www.jefftk.com)

Lan­guage and Ca­pa­bil­ities: Test­ing LLM Math­e­mat­i­cal Abil­ities Across Languages

Ethan Edwards4 Apr 2024 13:18 UTC
21 points
1 comment36 min readLW link

AI #58: Star­gate AGI

Zvi4 Apr 2024 13:10 UTC
49 points
9 comments60 min readLW link
(thezvi.wordpress.com)

Cult of equilibrium

Templarrr4 Apr 2024 9:19 UTC
11 points
2 comments1 min readLW link

[Question] Should you re­fuse this bet in Tech­ni­color Sleep­ing Beauty?

Ape in the coat4 Apr 2024 8:55 UTC
14 points
14 comments1 min readLW link

[Question] What’s with all the bans re­cently?

Gerald Monroe4 Apr 2024 6:16 UTC
65 points
83 comments4 min readLW link

Fun Things to Think About De­signs For: Search Eng­ine Alternatives

Jacob Watts4 Apr 2024 3:00 UTC
2 points
0 comments2 min readLW link