RSS

An Un­in­ten­tional Compliment

28 Apr 2024 20:04 UTC
21 points
1 comment4 min readLW link

Un­in­ten­tion­ally Creat­ing Value

28 Apr 2024 20:05 UTC
19 points
0 comments2 min readLW link

List your AI X-Risk cruxes!

Aryeh Englander28 Apr 2024 18:26 UTC
23 points
3 comments2 min readLW link

How are Si­mu­la­tors and Agents re­lated?

Robert Kralisch29 Apr 2024 0:22 UTC
4 points
0 comments7 min readLW link

The Prop-room and Stage Cog­ni­tive Architecture

Robert Kralisch29 Apr 2024 0:48 UTC
2 points
0 comments14 min readLW link

Ex­tended Embodiment

Robert Kralisch29 Apr 2024 0:18 UTC
4 points
0 comments3 min readLW link

Re­fusal in LLMs is me­di­ated by a sin­gle direction

27 Apr 2024 11:13 UTC
127 points
38 comments10 min readLW link

Refer­en­tial Containment

Robert Kralisch29 Apr 2024 0:16 UTC
2 points
0 comments3 min readLW link

Disen­tan­gling Com­pe­tence and Intelligence

Robert Kralisch29 Apr 2024 0:12 UTC
2 points
0 comments6 min readLW link

[Aspira­tion-based de­signs] 1. In­for­mal in­tro­duc­tion

28 Apr 2024 13:00 UTC
30 points
2 comments8 min readLW link

On Not Pul­ling The Lad­der Up Be­hind You

Screwtape26 Apr 2024 21:58 UTC
104 points
5 comments9 min readLW link

Con­structabil­ity: Plainly-coded AGIs may be fea­si­ble in the near future

27 Apr 2024 16:04 UTC
58 points
9 comments13 min readLW link

[Aspira­tion-based de­signs] 2. For­mal frame­work, ba­sic algorithm

28 Apr 2024 13:02 UTC
16 points
0 comments16 min readLW link

Things I tell my­self to be more agentic

DMMF28 Apr 2024 17:44 UTC
5 points
0 comments3 min readLW link
(danfrank.ca)

So What’s Up With PUFAs Chem­i­cally?

J Bostock27 Apr 2024 13:32 UTC
51 points
22 comments6 min readLW link

[Aspira­tion-based de­signs] Out­look: deal­ing with complexity

28 Apr 2024 13:06 UTC
11 points
0 comments2 min readLW link

[Aspira­tion-based de­signs] 3. Perfor­mance and safety crite­ria, and as­pira­tion intervals

Jobst Heitzig28 Apr 2024 13:04 UTC
11 points
0 comments12 min readLW link

The Science Al­gorithm—AISC 2024 Fi­nal Presentation

Johannes C. Mayer28 Apr 2024 14:55 UTC
7 points
0 comments1 min readLW link
(www.youtube.com)

Duct Tape security

Isaac King26 Apr 2024 18:57 UTC
66 points
8 comments5 min readLW link

[Question] Ex­am­ples of Highly Coun­ter­fac­tual Dis­cov­er­ies?

johnswentworth23 Apr 2024 22:19 UTC
167 points
86 comments1 min readLW link