[Question] Is a ran­dom box of gas pre­dictable af­ter 20 sec­onds?

Jan 24, 2024, 11:00 PM
37 points
35 comments1 min readLW link

[Question] Will quan­tum ran­dom­ness af­fect the 2028 elec­tion?

Jan 24, 2024, 10:54 PM
66 points
52 comments1 min readLW link

AISN #30: In­vest­ments in Com­pute and Mili­tary AI Plus, Ja­pan and Sin­ga­pore’s Na­tional AI Safety Institutes

Jan 24, 2024, 7:38 PM
27 points
1 comment6 min readLW link
(newsletter.safe.ai)

Krueger Lab AI Safety In­tern­ship 2024

Joey BreamJan 24, 2024, 7:17 PM
3 points
0 comments1 min readLW link

Agents that act for rea­sons: a thought experiment

Michele CampoloJan 24, 2024, 4:47 PM
3 points
0 comments3 min readLW link

Im­pact Assess­ment of AI Safety Camp (Arb Re­search)

Samuel HoltonJan 24, 2024, 4:19 PM
10 points
0 comments11 min readLW link
(forum.effectivealtruism.org)

The case for en­sur­ing that pow­er­ful AIs are controlled

Jan 24, 2024, 4:11 PM
276 points
73 comments28 min readLW link

LLMs can strate­gi­cally de­ceive while do­ing gain-of-func­tion re­search

Igor IvanovJan 24, 2024, 3:45 PM
36 points
4 comments11 min readLW link

Monthly Roundup #14: Jan­uary 2024

ZviJan 24, 2024, 12:50 PM
38 points
22 comments44 min readLW link
(thezvi.wordpress.com)

This might be the last AI Safety Camp

Jan 24, 2024, 9:33 AM
196 points
34 comments1 min readLW link

Global LessWrong/​AC10 Meetup on VRChat

Jan 24, 2024, 5:44 AM
15 points
2 comments1 min readLW link

Hu­mans aren’t fleeb.

Charlie SteinerJan 24, 2024, 5:31 AM
37 points
5 comments2 min readLW link

A Paradigm Shift in Sustainability

Jose Miguel Cruz y CelisJan 23, 2024, 11:34 PM
5 points
0 comments18 min readLW link

From Finite Fac­tors to Bayes Nets

J BostockJan 23, 2024, 8:03 PM
38 points
7 comments8 min readLW link

In­sti­tu­tional eco­nomics through the lens of scale-free reg­u­la­tive de­vel­op­ment, mor­pho­gen­e­sis, and cog­ni­tive science

Roman LeventovJan 23, 2024, 7:42 PM
8 points
0 comments14 min readLW link

Mak­ing a Sec­u­lar Sols­tice Songbook

jefftkJan 23, 2024, 7:40 PM
38 points
6 comments1 min readLW link
(www.jefftk.com)

Sim­ple Appreciations

Jonathan MoregårdJan 23, 2024, 4:23 PM
17 points
11 comments4 min readLW link
(open.substack.com)

[Question] What en­vi­ron­men­tal cues had you not seen them would have ended in dis­aster?

koratkarJan 23, 2024, 2:59 PM
11 points
1 comment1 min readLW link

Loneli­ness and suicide miti­ga­tion for stu­dents us­ing GPT3-en­abled chat­bots (sur­vey of Replika users in Na­ture)

Kaj_SotalaJan 23, 2024, 2:05 PM
45 points
2 comments2 min readLW link
(www.nature.com)

“Safety as a Scien­tific Pur­suit” (2024)

technicalitiesJan 23, 2024, 12:40 PM
17 points
3 comments2 min readLW link
(banburismus.substack.com)

Brain­storm­ing: Slow Takeoff

David PiepgrassJan 23, 2024, 6:58 AM
3 points
0 comments51 min readLW link

Refram­ing Acausal Trol­ling as Acausal Patronage

StrivingForLegibilityJan 23, 2024, 3:04 AM
14 points
0 comments2 min readLW link

Orthog­o­nal­ity or the “Hu­man Worth Hy­poth­e­sis”?

JeffsJan 23, 2024, 12:57 AM
21 points
31 comments3 min readLW link

the sub­red­dit size threshold

bhauthJan 23, 2024, 12:38 AM
32 points
3 comments4 min readLW link
(www.bhauth.com)

Start­ing in mechanis­tic interpretability

Jakub SmékalJan 22, 2024, 11:40 PM
1 point
0 comments3 min readLW link
(jakubsmekal.com)

We need a Science of Evals

Jan 22, 2024, 8:30 PM
71 points
13 comments9 min readLW link

An­nounc­ing the SoS Re­search Col­lec­tive for in­de­pen­dent re­searchers (and aca­demics think­ing in­de­pen­dently)

rogersbaconJan 22, 2024, 8:13 PM
15 points
0 comments8 min readLW link
(www.theseedsofscience.pub)

A Brief Assess­ment of OpenAI’s Pre­pared­ness Frame­work & Some Sugges­tions for Improvement

simeon_cJan 22, 2024, 8:08 PM
14 points
0 comments6 min readLW link
(uploads-ssl.webflow.com)

D&D.Sci(-fi): Coloniz­ing the Su­perHyper­Sphere [Eval­u­a­tion and Rule­set]

abstractapplicJan 22, 2024, 7:20 PM
40 points
7 comments3 min readLW link

′ pe­ter­todd’’s last stand: The fi­nal days of open GPT-3 research

mwatkinsJan 22, 2024, 6:47 PM
109 points
16 comments45 min readLW link

In­terLab – a toolkit for ex­per­i­ments with multi-agent interactions

Jan 22, 2024, 6:23 PM
69 points
0 comments8 min readLW link
(acsresearch.org)

San Fer­nando Valley Ra­tion­al­ist Meetup

Thomas BroadleyJan 22, 2024, 4:49 PM
3 points
1 comment1 min readLW link

Who Or­ga­nizes Dances?

jefftkJan 22, 2024, 2:30 PM
12 points
0 comments1 min readLW link
(www.jefftk.com)

Values Darwinism

pchvykovJan 22, 2024, 10:44 AM
11 points
13 comments3 min readLW link

[Question] The akra­sia doom loop and ex­ec­u­tive func­tion di­s­or­ders: a question

TeaTieAndHatJan 22, 2024, 7:01 AM
18 points
7 comments2 min readLW link

Pre­dict­ing AGI by the Tur­ing Test

Yuxi_LiuJan 22, 2024, 4:22 AM
21 points
2 comments10 min readLW link
(yuxi-liu-wired.github.io)

In­cor­po­rat­ing Jus­tice The­ory into De­ci­sion Theory

StrivingForLegibilityJan 21, 2024, 7:17 PM
13 points
20 comments5 min readLW link

De­liber­ate Dy­sen­tery: Q&A about Hu­man Challenge Trials

Niko_McCartyJan 21, 2024, 7:05 PM
16 points
1 comment18 min readLW link
(www.asimov.press)

When Does Altru­ism Strengthen Altru­ism?

jefftkJan 21, 2024, 6:50 PM
44 points
2 comments3 min readLW link
(www.jefftk.com)

A Shut­down Prob­lem Proposal

Jan 21, 2024, 6:12 PM
125 points
61 comments6 min readLW link

Is prin­ci­pled mass-out­reach pos­si­ble, for AGI X-risk?

Nicholas / Heather KrossJan 21, 2024, 5:45 PM
9 points
5 comments3 min readLW link

Vacuum: The­ory and Technologies

nomagicpillJan 21, 2024, 5:23 PM
33 points
0 comments25 min readLW link
(210ethan.github.io)

Another Non-An­thropic Para­dox: The Un­sur­pris­ing Rare­ness of Rare Events

Ape in the coatJan 21, 2024, 3:58 PM
19 points
16 comments6 min readLW link

Book re­view: Cui­sine and Empire

eukaryoteJan 21, 2024, 6:15 AM
40 points
2 comments12 min readLW link
(eukaryotewritesblog.com)

Cat­a­logue of POLITICO Re­ports and Other Cited Ar­ti­cles on Effec­tive Altru­ism and AI Safety Con­nec­tions in Wash­ing­ton, DC

Evan_GaensbauerJan 21, 2024, 2:15 AM
4 points
0 commentsLW link
(docs.google.com)

You can rack up mas­sive amounts of data quickly by ask­ing ques­tions to all your friends

Neil Jan 21, 2024, 1:27 AM
14 points
2 comments2 min readLW link

[Question] Party for biomed­i­cal re­ju­ve­na­tion re­search: Euro­pean par­li­a­ment elections

Iakov DudinskyJan 21, 2024, 12:35 AM
1 point
0 comments1 min readLW link

[Question] Why have in­surance mar­kets suc­ceeded where pre­dic­tion mar­kets have not?

JNankJan 21, 2024, 12:35 AM
13 points
13 comments1 min readLW link

[linkpost] Self-Re­ward­ing Lan­guage Models

Jacob G-WJan 21, 2024, 12:30 AM
13 points
2 comments1 min readLW link
(arxiv.org)

Why Im­prov­ing Dialogue Feels So Hard

mattoJan 20, 2024, 9:26 PM
21 points
8 comments3 min readLW link