Arch-an­ar­chy:The­ory and practice

Peter lawless 30 Apr 2024 23:20 UTC
−6 points
0 comments2 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimes30 Apr 2024 23:10 UTC
19 points
5 comments6 min readLW link
(jacob-haimes.github.io)

Ques­tions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC
77 points
11 comments8 min readLW link

Real­ity com­pre­hen­si­bil­ity: are there illog­i­cal things in re­al­ity?

DDthinker30 Apr 2024 21:30 UTC
−3 points
0 comments10 min readLW link

Mechanis­ti­cally Elic­it­ing La­tent Be­hav­iors in Lan­guage Models

30 Apr 2024 18:51 UTC
215 points
43 comments45 min readLW link

[Question] What is the eas­iest/​funnest way to build up a com­pre­hen­sive un­der­stand­ing of AI and AI Safety?

Jordan Arel30 Apr 2024 18:41 UTC
4 points
2 comments1 min readLW link

Transcoders en­able fine-grained in­ter­pretable cir­cuit anal­y­sis for lan­guage models

30 Apr 2024 17:58 UTC
75 points
14 comments17 min readLW link

An­nounc­ing the 2024 Roots of Progress Blog-Build­ing Intensive

jasoncrawford30 Apr 2024 17:37 UTC
14 points
0 comments2 min readLW link
(rootsofprogress.org)

The In­ten­tional Stance, LLMs Edition

Eleni Angelou30 Apr 2024 17:12 UTC
30 points
3 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC
225 points
30 comments1 min readLW link
(ailabwatch.org)

Why I’m do­ing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC
104 points
16 comments4 min readLW link

LLMs could be as con­scious as hu­man em­u­la­tions, potentially

Canaletto30 Apr 2024 11:36 UTC
15 points
15 comments3 min readLW link

An in­ter­est­ing math­e­mat­i­cal model of how LLMs work

Bill Benzon30 Apr 2024 11:01 UTC
5 points
0 comments1 min readLW link

Towards Mul­ti­modal In­ter­pretabil­ity: Learn­ing Sparse In­ter­pretable Fea­tures in Vi­sion Transformers

hugofry29 Apr 2024 20:57 UTC
94 points
9 comments11 min readLW link

Towards a for­mal­iza­tion of the agent struc­ture problem

Alex_Altair29 Apr 2024 20:28 UTC
55 points
6 comments14 min readLW link

Iron­ing Out the Squiggles

Zack_M_Davis29 Apr 2024 16:13 UTC
157 points
36 comments11 min readLW link

Su­per ad­di­tivity of consciousness

Arturo Macias29 Apr 2024 15:41 UTC
−2 points
13 comments2 min readLW link

AISC9 has ended and there will be an AISC10

Linda Linsefors29 Apr 2024 10:53 UTC
75 points
4 comments2 min readLW link

Open-Source AI: A Reg­u­la­tory Review

29 Apr 2024 10:10 UTC
18 points
0 comments8 min readLW link

Big-en­dian is bet­ter than lit­tle-endian

Menotim29 Apr 2024 2:30 UTC
32 points
17 comments3 min readLW link

The Prop-room and Stage Cog­ni­tive Architecture

Robert Kralisch29 Apr 2024 0:48 UTC
14 points
4 comments14 min readLW link

How are Si­mu­la­tors and Agents re­lated?

Robert Kralisch29 Apr 2024 0:22 UTC
6 points
0 comments7 min readLW link

Ex­tended Embodiment

Robert Kralisch29 Apr 2024 0:18 UTC
8 points
1 comment3 min readLW link

Refer­en­tial Containment

Robert Kralisch29 Apr 2024 0:16 UTC
2 points
4 comments3 min readLW link

Disen­tan­gling Com­pe­tence and Intelligence

Robert Kralisch29 Apr 2024 0:12 UTC
23 points
7 comments6 min readLW link

List your AI X-Risk cruxes!

Aryeh Englander28 Apr 2024 18:26 UTC
42 points
7 comments2 min readLW link

Things I tell my­self to be more agentic

DMMF28 Apr 2024 17:44 UTC
9 points
0 comments3 min readLW link
(danfrank.ca)

Es­ti­mat­ing the Num­ber of Play­ers from Game Re­sult Percentages

Daniel L28 Apr 2024 17:42 UTC
1 point
2 comments1 min readLW link

The Science Al­gorithm—AISC 2024 Fi­nal Presentation

Johannes C. Mayer28 Apr 2024 14:55 UTC
4 points
0 comments1 min readLW link
(www.youtube.com)

[Aspira­tion-based de­signs] Out­look: deal­ing with complexity

28 Apr 2024 13:06 UTC
13 points
3 comments2 min readLW link

[Aspira­tion-based de­signs] 3. Perfor­mance and safety crite­ria, and as­pira­tion intervals

Jobst Heitzig28 Apr 2024 13:04 UTC
10 points
0 comments12 min readLW link

[Aspira­tion-based de­signs] 2. For­mal frame­work, ba­sic algorithm

28 Apr 2024 13:02 UTC
18 points
2 comments16 min readLW link

[Aspira­tion-based de­signs] 1. In­for­mal in­tro­duc­tion

28 Apr 2024 13:00 UTC
44 points
4 comments8 min readLW link

Play­ing North­boro with Lily and Rick

jefftk28 Apr 2024 2:40 UTC
10 points
1 comment2 min readLW link
(www.jefftk.com)

Re­lease of UN’s draft re­lated to the gov­er­nance of AI (a sum­mary of the Si­mon In­sti­tute’s re­sponse)

Sebastian Schmidt27 Apr 2024 18:34 UTC
7 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Mercy to the Ma­chine: Thoughts & Rights

False Name27 Apr 2024 16:36 UTC
7 points
6 comments17 min readLW link

Con­structabil­ity: Plainly-coded AGIs may be fea­si­ble in the near future

27 Apr 2024 16:04 UTC
91 points
15 comments13 min readLW link

So What’s Up With PUFAs Chem­i­cally?

J Bostock27 Apr 2024 13:32 UTC
57 points
25 comments6 min readLW link

Link: Let’s Think Dot by Dot: Hid­den Com­pu­ta­tion in Trans­former Lan­guage Models by Ja­cob Pfau, William Mer­rill & Sa­muel R. Bowman

Chris_Leong27 Apr 2024 13:22 UTC
12 points
0 comments1 min readLW link
(twitter.com)

Two Ver­nor Vinge Book Reviews

Maxwell Tabarrok27 Apr 2024 12:14 UTC
17 points
0 comments2 min readLW link
(www.maximum-progress.com)

Re­fusal in LLMs is me­di­ated by a sin­gle direction

27 Apr 2024 11:13 UTC
252 points
95 comments10 min readLW link

[Question] Plau­si­bil­ity of Get­ting Early Warn­ing Shots be­cause AIs can’t co­or­di­nate?

hmys27 Apr 2024 8:02 UTC
12 points
0 comments1 min readLW link

AI Safety Sphere

Myles H27 Apr 2024 1:49 UTC
6 points
2 comments2 min readLW link

Ex­plor­ing the Eso­teric Path­ways to AI Sen­tience (Part One)

jeffreycaruso27 Apr 2024 1:02 UTC
−11 points
6 comments2 min readLW link

Su­per­po­si­tion is not “just” neu­ron polysemanticity

LawrenceC26 Apr 2024 23:22 UTC
68 points
4 comments13 min readLW link

D&D.Sci Long War: Defen­der of Data-mocracy

aphyer26 Apr 2024 22:30 UTC
44 points
20 comments4 min readLW link

On Not Pul­ling The Lad­der Up Be­hind You

Screwtape26 Apr 2024 21:58 UTC
190 points
21 comments9 min readLW link

We are headed into an ex­treme com­pute overhang

devrandom26 Apr 2024 21:38 UTC
54 points
34 comments2 min readLW link

[Con­cept Depen­dency] Edge Reg­u­lar Lat­tice Graph

Johannes C. Mayer26 Apr 2024 21:14 UTC
9 points
1 comment1 min readLW link

[Con­cept Depen­dency] Con­cept Depen­dency Posts

Johannes C. Mayer26 Apr 2024 20:57 UTC
11 points
3 comments2 min readLW link