CCS: Coun­ter­fac­tual Civ­i­liza­tion Simulation

Morphism2 May 2024 22:54 UTC
3 points
0 comments2 min readLW link

Let’s De­sign A School, Part 2.1 School as Ed­u­ca­tion—Structure

Sable2 May 2024 22:04 UTC
26 points
3 comments10 min readLW link
(affablyevil.substack.com)

Why I’m not do­ing PauseAI

kwiat.dev2 May 2024 22:00 UTC
−8 points
5 comments4 min readLW link

AI #61: Meta Trouble

Zvi2 May 2024 18:40 UTC
29 points
0 comments52 min readLW link
(thezvi.wordpress.com)

Why is AGI/​ASI Inevitable?

DeathlessAmaranth2 May 2024 18:27 UTC
14 points
6 comments1 min readLW link

AISN #34: New Mili­tary AI Sys­tems Plus, AI Labs Fail to Uphold Vol­un­tary Com­mit­ments to UK AI Safety In­sti­tute, and New AI Policy Pro­pos­als in the US Senate

2 May 2024 16:12 UTC
6 points
0 comments8 min readLW link
(newsletter.safe.ai)

Ai Salon: Trust­wor­thy AI Fu­tures #1

Ian Eisenberg2 May 2024 16:07 UTC
1 point
0 comments1 min readLW link

How to write Pseu­docode and why you should

Johannes C. Mayer2 May 2024 15:53 UTC
8 points
5 comments3 min readLW link

AI #62: Too Soon to Tell

Zvi2 May 2024 15:40 UTC
30 points
8 comments31 min readLW link
(thezvi.wordpress.com)

White­board Pro­gram Tra­ce­ing: De­bug a Pro­gram Be­fore you have the Code

Johannes C. Mayer2 May 2024 15:30 UTC
3 points
0 comments1 min readLW link

[Question] Which skin­care prod­ucts are ev­i­dence-based?

Vanessa Kosoy2 May 2024 15:22 UTC
123 points
48 comments1 min readLW link

Q&A on Pro­posed SB 1047

Zvi2 May 2024 15:10 UTC
74 points
8 comments44 min readLW link
(thezvi.wordpress.com)

[Question] What are the Ac­tivi­ties that make up your Re­search Pro­cess?

Johannes C. Mayer2 May 2024 15:01 UTC
4 points
0 comments1 min readLW link

[Question] How do you Select the Right Re­search Ac­i­tivity in the Right Mo­ment?

Johannes C. Mayer2 May 2024 14:45 UTC
6 points
1 comment1 min readLW link

[Question] Can stealth air­craft be de­tected op­ti­cally?

Yair Halberstadt2 May 2024 7:47 UTC
20 points
28 comments1 min readLW link

An ex­pla­na­tion of evil in an or­ga­nized world

KatjaGrace2 May 2024 5:20 UTC
27 points
9 comments2 min readLW link
(worldspiritsockpuppet.com)

Why I stopped work­ing on AI safety

jbkjr2 May 2024 5:08 UTC
−5 points
0 comments4 min readLW link
(jbkjr.me)

[Linkpost] Silver Bul­letin: For most peo­ple, poli­tics is about fit­ting in

Gunnar_Zarncke1 May 2024 18:12 UTC
18 points
4 comments1 min readLW link
(www.natesilver.net)

Launch­ing ap­pli­ca­tions for AI Safety Ca­reers Course In­dia 2024

Axiom_Futures1 May 2024 17:55 UTC
4 points
1 comment1 min readLW link

[Question] Shane Legg’s nec­es­sary prop­er­ties for ev­ery AGI Safety plan

jacquesthibs1 May 2024 17:15 UTC
58 points
12 comments1 min readLW link

KAN: Kol­mogorov-Arnold Networks

Gunnar_Zarncke1 May 2024 16:50 UTC
18 points
15 comments1 min readLW link
(arxiv.org)

Man­i­fund Q1 Retro: Learn­ings from im­pact certs

Austin Chen1 May 2024 16:48 UTC
40 points
1 comment15 min readLW link

ACX Covid Ori­gins Post con­vinced readers

ErnestScribbler1 May 2024 13:06 UTC
77 points
7 comments2 min readLW link

LessWrong Com­mu­nity Week­end 2024, open for applications

1 May 2024 10:18 UTC
79 points
2 comments7 min readLW link

Take SCIFs, it’s dan­ger­ous to go alone

1 May 2024 8:02 UTC
43 points
1 comment3 min readLW link

AXRP Epi­sode 30 - AI Se­cu­rity with Jeffrey Ladish

DanielFilan1 May 2024 2:50 UTC
25 points
0 comments79 min readLW link

Neuro/​BCI/​WBE for Safe AI Workshop

Allison Duettmann1 May 2024 0:46 UTC
3 points
0 comments1 min readLW link

AGI: Cryp­tog­ra­phy, Se­cu­rity & Mul­tipo­lar Sce­nar­ios Workshop

Allison Duettmann1 May 2024 0:42 UTC
8 points
1 comment1 min readLW link

The for­mal goal is a pointer

Morphism1 May 2024 0:27 UTC
25 points
10 comments1 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimes30 Apr 2024 23:10 UTC
19 points
5 comments6 min readLW link
(jacob-haimes.github.io)

Ques­tions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC
77 points
11 comments8 min readLW link

Real­ity com­pre­hen­si­bil­ity: are there illog­i­cal things in re­al­ity?

DDthinker30 Apr 2024 21:30 UTC
−3 points
0 comments10 min readLW link

Mechanis­ti­cally Elic­it­ing La­tent Be­hav­iors in Lan­guage Models

30 Apr 2024 18:51 UTC
225 points
44 comments45 min readLW link1 review

[Question] What is the eas­iest/​funnest way to build up a com­pre­hen­sive un­der­stand­ing of AI and AI Safety?

Jordan Arel30 Apr 2024 18:41 UTC
4 points
2 comments1 min readLW link

Transcoders en­able fine-grained in­ter­pretable cir­cuit anal­y­sis for lan­guage models

30 Apr 2024 17:58 UTC
76 points
14 comments17 min readLW link

An­nounc­ing the 2024 Roots of Progress Blog-Build­ing Intensive

jasoncrawford30 Apr 2024 17:37 UTC
14 points
0 comments2 min readLW link
(rootsofprogress.org)

The In­ten­tional Stance, LLMs Edition

Eleni Angelou30 Apr 2024 17:12 UTC
36 points
5 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC
226 points
31 comments1 min readLW link
(ailabwatch.org)

Why I’m do­ing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC
113 points
16 comments4 min readLW link

LLMs could be as con­scious as hu­man em­u­la­tions, potentially

Canaletto30 Apr 2024 11:36 UTC
15 points
15 comments3 min readLW link

An in­ter­est­ing math­e­mat­i­cal model of how LLMs work

Bill Benzon30 Apr 2024 11:01 UTC
5 points
0 comments1 min readLW link

Towards Mul­ti­modal In­ter­pretabil­ity: Learn­ing Sparse In­ter­pretable Fea­tures in Vi­sion Transformers

hugofry29 Apr 2024 20:57 UTC
94 points
9 comments11 min readLW link

Towards a for­mal­iza­tion of the agent struc­ture problem

Alex_Altair29 Apr 2024 20:28 UTC
56 points
6 comments14 min readLW link

Iron­ing Out the Squiggles

Zack_M_Davis29 Apr 2024 16:13 UTC
170 points
37 comments11 min readLW link

Su­per ad­di­tivity of consciousness

Arturo Macias29 Apr 2024 15:41 UTC
−2 points
13 comments2 min readLW link

AISC9 has ended and there will be an AISC10

Linda Linsefors29 Apr 2024 10:53 UTC
75 points
4 comments2 min readLW link

Open-Source AI: A Reg­u­la­tory Review

29 Apr 2024 10:10 UTC
18 points
0 comments8 min readLW link

Big-en­dian is bet­ter than lit­tle-endian

Menotim29 Apr 2024 2:30 UTC
38 points
18 comments3 min readLW link

The Prop-room and Stage Cog­ni­tive Architecture

Robert Kralisch29 Apr 2024 0:48 UTC
14 points
4 comments14 min readLW link

How are Si­mu­la­tors and Agents re­lated?

Robert Kralisch29 Apr 2024 0:22 UTC
6 points
0 comments7 min readLW link