[Question] Up­com­ing un­am­bigu­ously good tech pos­si­bil­ities? (Like eg in­door plumb­ing)

lukehmiles11 Apr 2024 23:14 UTC
9 points
7 comments1 min readLW link

Leave No Con­text Be­hind—A Comment

Gunnar_Zarncke11 Apr 2024 22:50 UTC
17 points
0 comments2 min readLW link

AXRP Epi­sode 27 - AI Con­trol with Buck Sh­legeris and Ryan Greenblatt

DanielFilan11 Apr 2024 21:30 UTC
69 points
10 comments107 min readLW link

ChatGPT defines 10 con­crete terms: gener­i­cally, for 5- and 11-year-olds, and for a sci­en­tist

Bill Benzon11 Apr 2024 20:27 UTC
3 points
9 comments6 min readLW link

Ack­shually, many wor­lds is wrong

tailcalled11 Apr 2024 20:23 UTC
28 points
42 comments4 min readLW link

[Question] What does Eliezer Yud­kowsky think of the mean­ing of life now?

metaqualia11 Apr 2024 18:36 UTC
−7 points
3 comments1 min readLW link

Struc­tured Trans­parency: a frame­work for ad­dress­ing use/​mis-use trade-offs when shar­ing information

habryka11 Apr 2024 18:35 UTC
23 points
0 comments2 min readLW link
(arxiv.org)

Deon­tic Ex­plo­ra­tions In “Pay­ing To Talk To Slaves”

JenniferRM11 Apr 2024 18:23 UTC
4 points
10 comments31 min readLW link

Ex­pe­rience Re­port—ML4Good AI Safety Bootcamp

Kieron Kretschmar11 Apr 2024 18:03 UTC
28 points
0 comments4 min readLW link

A Gen­tle In­tro­duc­tion to Risk Frame­works Beyond Forecasting

pendingsurvival11 Apr 2024 18:03 UTC
73 points
10 comments27 min readLW link

[Question] AGI com­pute allocation

janna11 Apr 2024 18:02 UTC
−11 points
0 comments1 min readLW link

Boundaries Up­date #1

Chipmonk11 Apr 2024 16:07 UTC
2 points
2 comments1 min readLW link
(formalizingboundaries.substack.com)

An­nounc­ing At­las Computing

miyazono11 Apr 2024 15:56 UTC
44 points
4 comments4 min readLW link

AI #59: Model Updates

Zvi11 Apr 2024 14:20 UTC
30 points
2 comments63 min readLW link
(thezvi.wordpress.com)

The the­ory of Prox­i­mal Policy Op­ti­mi­sa­tion implementations

salman.mohammadi11 Apr 2024 13:00 UTC
3 points
1 comment6 min readLW link
(salmanmohammadi.github.io)

[Question] Things can be difficult in 3 ways: Pain­ful, time-con­sum­ing, or un­con­trol­lable. Is this rea­son­able to say?

SpectrumDT11 Apr 2024 12:50 UTC
2 points
4 comments1 min readLW link

Learn­ing Writ­ten Hindi From Scratch

Morpheus11 Apr 2024 11:13 UTC
3 points
0 comments2 min readLW link

[Question] What is the best AI gen­er­ated mu­sic about ra­tio­nal­ity/​ai/​tran­shu­man­ism?

Nathan Young11 Apr 2024 9:34 UTC
4 points
2 comments1 min readLW link

[Question] Work ethic af­ter 2020?

TeaTieAndHat11 Apr 2024 3:32 UTC
15 points
11 comments1 min readLW link

Repug­nance and replacement

MichaelStJules11 Apr 2024 2:41 UTC
2 points
0 comments1 min readLW link

Re­v­erse Reg­u­la­tory Capture

Chris_Leong11 Apr 2024 2:40 UTC
12 points
3 comments1 min readLW link

[Question] Is LLM Trans­la­tion Without Rosetta Stone pos­si­ble?

cubefox11 Apr 2024 0:36 UTC
14 points
14 comments1 min readLW link

[Question] What should we tell an AI if it asks why it was cre­ated?

cSkeleton10 Apr 2024 20:37 UTC
1 point
1 comment1 min readLW link

RTFB: On the New Pro­posed CAIP AI Bill

Zvi10 Apr 2024 18:30 UTC
119 points
14 comments34 min readLW link
(thezvi.wordpress.com)

(Ra­tional) De­ci­sion-Mak­ing In Wartime

Danylo Zhyrko10 Apr 2024 18:08 UTC
15 points
2 comments5 min readLW link

Think­ing harder doesn’t work

Jakob Greenfeld10 Apr 2024 18:00 UTC
−9 points
6 comments6 min readLW link
(jakobgreenfeld.com)

Scal­ing Laws and Superposition

Pavan Katta10 Apr 2024 15:36 UTC
8 points
4 comments5 min readLW link
(www.pavankatta.com)

Re­spon­si­ble Ad­vanced Ar­tifi­cial In­tel­li­gence Act

Anon4210 Apr 2024 14:35 UTC
4 points
0 comments1 min readLW link
(assets.caip.org)

Ap­ply to the Pivotal Re­search Fel­low­ship (AI Safety & Biose­cu­rity)

10 Apr 2024 12:08 UTC
18 points
0 comments1 min readLW link

Is Con­scious­ness Si­mu­lated?

Daniele De Nuntiis10 Apr 2024 9:02 UTC
−1 points
2 comments5 min readLW link

AI DOESN’T NEED TO KILL HUMANITY TO EXIST. IT WILL JUST SEE US IMPLODE. OR NOT. [2024]

X O10 Apr 2024 8:52 UTC
−37 points
0 comments27 min readLW link

How I se­lect al­ign­ment re­search projects

10 Apr 2024 4:33 UTC
34 points
4 comments24 min readLW link

[Question] How to ac­cel­er­ate re­cov­ery from sleep debt with bio­hack­ing?

exanova10 Apr 2024 1:27 UTC
10 points
2 comments1 min readLW link

[Question] What are some posthu­man­ist/​more-than-hu­man ap­proaches to defi­ni­tions of in­tel­li­gence and agency? Par­tic­u­larly in ap­pli­ca­tion to AI re­search.

Eli Hiton9 Apr 2024 21:52 UTC
1 point
0 comments1 min readLW link

Ophiol­ogy (or, how the Mamba ar­chi­tec­ture works)

9 Apr 2024 19:31 UTC
66 points
8 comments10 min readLW link

Ap­ply to LASR Labs: a Lon­don-based tech­ni­cal AI safety re­search programme

9 Apr 2024 17:34 UTC
44 points
1 comment3 min readLW link

“De­cen­tral­ized Au­tonomous Ed­u­ca­tion”—Call for Re­view­ers (Seeds of Science)

rogersbacon9 Apr 2024 14:39 UTC
6 points
0 comments1 min readLW link

D&D.Sci: The Mad Tyrant’s Pet Tur­tles [Eval­u­a­tion and Rule­set]

abstractapplic9 Apr 2024 14:01 UTC
44 points
6 comments3 min readLW link

Med­i­cal Roundup #2

Zvi9 Apr 2024 13:40 UTC
37 points
18 comments16 min readLW link
(thezvi.wordpress.com)

PIBBSS is hiring in a va­ri­ety of roles (al­ign­ment re­search and in­cu­ba­tion pro­gram)

9 Apr 2024 8:12 UTC
54 points
0 comments3 min readLW link

Any ev­i­dence or rea­son to ex­pect a mul­ti­verse /​ Everett branches?

lukehmiles9 Apr 2024 5:26 UTC
12 points
122 comments1 min readLW link

Fer­ment­ing Form

koratkar9 Apr 2024 2:46 UTC
19 points
2 comments4 min readLW link
(careerscouting.substack.com)

[Question] Non-ul­ti­ma­tum game problem

numpyNaN8 Apr 2024 23:25 UTC
9 points
4 comments2 min readLW link

Pan­demic Iden­ti­fi­ca­tion Simulator

jefftk8 Apr 2024 19:00 UTC
22 points
0 comments1 min readLW link
(www.jefftk.com)

How We Pic­ture Bayesian Agents

8 Apr 2024 18:12 UTC
62 points
14 comments7 min readLW link

CEA seeks co-founder for AI safety group sup­port spin-off

agucova8 Apr 2024 15:42 UTC
18 points
0 comments1 min readLW link

In­ves­ti­gat­ing the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC
10 points
0 comments1 min readLW link

Mea­sur­ing Learned Op­ti­miza­tion in Small Trans­former Models

J Bostock8 Apr 2024 14:41 UTC
22 points
0 comments11 min readLW link

[Question] Can sin­gu­lar­ity emerge from trans­form­ers?

MP8 Apr 2024 14:26 UTC
−3 points
1 comment1 min readLW link

Gated At­ten­tion Blocks: Pre­limi­nary Progress to­ward Re­mov­ing At­ten­tion Head Superposition

8 Apr 2024 11:14 UTC
36 points
4 comments15 min readLW link