[Linkpost] Silver Bul­letin: For most peo­ple, poli­tics is about fit­ting in

Gunnar_ZarnckeMay 1, 2024, 6:12 PM
18 points
4 comments1 min readLW link
(www.natesilver.net)

Launch­ing ap­pli­ca­tions for AI Safety Ca­reers Course In­dia 2024

Axiom_FuturesMay 1, 2024, 5:55 PM
4 points
1 comment1 min readLW link

[Question] Shane Legg’s nec­es­sary prop­er­ties for ev­ery AGI Safety plan

jacquesthibsMay 1, 2024, 5:15 PM
58 points
12 comments1 min readLW link

KAN: Kol­mogorov-Arnold Networks

Gunnar_ZarnckeMay 1, 2024, 4:50 PM
18 points
15 comments1 min readLW link
(arxiv.org)

Man­i­fund Q1 Retro: Learn­ings from im­pact certs

Austin ChenMay 1, 2024, 4:48 PM
40 points
1 commentLW link

ACX Covid Ori­gins Post con­vinced readers

ErnestScribblerMay 1, 2024, 1:06 PM
77 points
7 comments2 min readLW link

LessWrong Com­mu­nity Week­end 2024, open for applications

May 1, 2024, 10:18 AM
79 points
2 comments7 min readLW link

Take SCIFs, it’s dan­ger­ous to go alone

May 1, 2024, 8:02 AM
42 points
1 comment3 min readLW link

AXRP Epi­sode 30 - AI Se­cu­rity with Jeffrey Ladish

DanielFilanMay 1, 2024, 2:50 AM
25 points
0 comments79 min readLW link

Neuro/​BCI/​WBE for Safe AI Workshop

Allison DuettmannMay 1, 2024, 12:46 AM
3 points
0 comments1 min readLW link

AGI: Cryp­tog­ra­phy, Se­cu­rity & Mul­tipo­lar Sce­nar­ios Workshop

Allison DuettmannMay 1, 2024, 12:42 AM
8 points
1 comment1 min readLW link

The for­mal goal is a pointer

MorphismMay 1, 2024, 12:27 AM
20 points
10 comments1 min readLW link

Arch-an­ar­chy:The­ory and practice

Peter lawless Apr 30, 2024, 11:20 PM
−6 points
0 comments2 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimesApr 30, 2024, 11:10 PM
19 points
5 comments6 min readLW link
(jacob-haimes.github.io)

Ques­tions for labs

Zach Stein-PerlmanApr 30, 2024, 10:15 PM
77 points
11 comments8 min readLW link

Real­ity com­pre­hen­si­bil­ity: are there illog­i­cal things in re­al­ity?

DDthinkerApr 30, 2024, 9:30 PM
−3 points
0 comments10 min readLW link

Mechanis­ti­cally Elic­it­ing La­tent Be­hav­iors in Lan­guage Models

Apr 30, 2024, 6:51 PM
210 points
43 comments45 min readLW link

[Question] What is the eas­iest/​funnest way to build up a com­pre­hen­sive un­der­stand­ing of AI and AI Safety?

Jordan ArelApr 30, 2024, 6:41 PM
4 points
2 comments1 min readLW link

Transcoders en­able fine-grained in­ter­pretable cir­cuit anal­y­sis for lan­guage models

Apr 30, 2024, 5:58 PM
74 points
14 comments17 min readLW link

An­nounc­ing the 2024 Roots of Progress Blog-Build­ing Intensive

jasoncrawfordApr 30, 2024, 5:37 PM
14 points
0 comments2 min readLW link
(rootsofprogress.org)

The In­ten­tional Stance, LLMs Edition

Eleni AngelouApr 30, 2024, 5:12 PM
30 points
3 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-PerlmanApr 30, 2024, 5:00 PM
225 points
30 comments1 min readLW link
(ailabwatch.org)

Why I’m do­ing PauseAI

Joseph MillerApr 30, 2024, 4:21 PM
108 points
16 comments4 min readLW link

LLMs could be as con­scious as hu­man em­u­la­tions, potentially

CanalettoApr 30, 2024, 11:36 AM
15 points
15 comments3 min readLW link

An in­ter­est­ing math­e­mat­i­cal model of how LLMs work

Bill BenzonApr 30, 2024, 11:01 AM
5 points
0 comments1 min readLW link

Towards Mul­ti­modal In­ter­pretabil­ity: Learn­ing Sparse In­ter­pretable Fea­tures in Vi­sion Transformers

hugofryApr 29, 2024, 8:57 PM
93 points
8 comments11 min readLW link

Towards a for­mal­iza­tion of the agent struc­ture problem

Alex_AltairApr 29, 2024, 8:28 PM
55 points
6 comments14 min readLW link

Iron­ing Out the Squiggles

Zack_M_DavisApr 29, 2024, 4:13 PM
157 points
36 comments11 min readLW link

Su­per ad­di­tivity of consciousness

Arturo MaciasApr 29, 2024, 3:41 PM
−2 points
13 comments2 min readLW link

AISC9 has ended and there will be an AISC10

Linda LinseforsApr 29, 2024, 10:53 AM
75 points
4 comments2 min readLW link

Open-Source AI: A Reg­u­la­tory Review

Apr 29, 2024, 10:10 AM
18 points
0 comments8 min readLW link

Big-en­dian is bet­ter than lit­tle-endian

MenotimApr 29, 2024, 2:30 AM
29 points
17 comments3 min readLW link

The Prop-room and Stage Cog­ni­tive Architecture

Robert KralischApr 29, 2024, 12:48 AM
14 points
4 comments14 min readLW link

How are Si­mu­la­tors and Agents re­lated?

Robert KralischApr 29, 2024, 12:22 AM
6 points
0 comments7 min readLW link

Ex­tended Embodiment

Robert KralischApr 29, 2024, 12:18 AM
8 points
1 comment3 min readLW link

Refer­en­tial Containment

Robert KralischApr 29, 2024, 12:16 AM
2 points
4 comments3 min readLW link

Disen­tan­gling Com­pe­tence and Intelligence

Robert KralischApr 29, 2024, 12:12 AM
23 points
7 comments6 min readLW link

List your AI X-Risk cruxes!

Aryeh EnglanderApr 28, 2024, 6:26 PM
42 points
7 comments2 min readLW link

Things I tell my­self to be more agentic

DMMFApr 28, 2024, 5:44 PM
9 points
0 comments3 min readLW link
(danfrank.ca)

Es­ti­mat­ing the Num­ber of Play­ers from Game Re­sult Percentages

Daniel LApr 28, 2024, 5:42 PM
1 point
2 comments1 min readLW link

The Science Al­gorithm—AISC 2024 Fi­nal Presentation

Johannes C. MayerApr 28, 2024, 2:55 PM
4 points
0 comments1 min readLW link
(www.youtube.com)

[Aspira­tion-based de­signs] Out­look: deal­ing with complexity

Apr 28, 2024, 1:06 PM
13 points
3 comments2 min readLW link

[Aspira­tion-based de­signs] 3. Perfor­mance and safety crite­ria, and as­pira­tion intervals

Jobst HeitzigApr 28, 2024, 1:04 PM
10 points
0 comments12 min readLW link

[Aspira­tion-based de­signs] 2. For­mal frame­work, ba­sic algorithm

28 Apr 2024 13:02 UTC
18 points
2 comments16 min readLW link

[Aspira­tion-based de­signs] 1. In­for­mal in­tro­duc­tion

28 Apr 2024 13:00 UTC
44 points
4 comments8 min readLW link

Play­ing North­boro with Lily and Rick

jefftk28 Apr 2024 2:40 UTC
10 points
1 comment2 min readLW link
(www.jefftk.com)

Re­lease of UN’s draft re­lated to the gov­er­nance of AI (a sum­mary of the Si­mon In­sti­tute’s re­sponse)

Sebastian Schmidt27 Apr 2024 18:34 UTC
7 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Mercy to the Ma­chine: Thoughts & Rights

False Name27 Apr 2024 16:36 UTC
7 points
6 comments17 min readLW link

Con­structabil­ity: Plainly-coded AGIs may be fea­si­ble in the near future

27 Apr 2024 16:04 UTC
91 points
13 comments13 min readLW link

So What’s Up With PUFAs Chem­i­cally?

J Bostock27 Apr 2024 13:32 UTC
57 points
23 comments6 min readLW link