RSS

List your AI X-Risk cruxes!

Aryeh Englander28 Apr 2024 18:26 UTC
14 points
0 comments2 min readLW link

Things I tell my­self to be more agentic

DMMF28 Apr 2024 17:44 UTC
4 points
0 comments3 min readLW link
(danfrank.ca)

[Aspira­tion-based de­signs] 1. In­for­mal in­tro­duc­tion

28 Apr 2024 13:00 UTC
22 points
0 comments8 min readLW link

Re­fusal in LLMs is me­di­ated by a sin­gle direction

27 Apr 2024 11:13 UTC
126 points
29 comments10 min readLW link

Es­ti­mat­ing the Num­ber of Play­ers from Game Re­sult Percentages

Daniel L28 Apr 2024 17:42 UTC
1 point
0 comments1 min readLW link

[Aspira­tion-based de­signs] 2. For­mal frame­work, ba­sic algorithm

28 Apr 2024 13:02 UTC
14 points
0 comments16 min readLW link

The Science Al­gorithm—AISC 2024 Fi­nal Presentation

Johannes C. Mayer28 Apr 2024 14:55 UTC
7 points
0 comments1 min readLW link
(www.youtube.com)

[Aspira­tion-based de­signs] Out­look: deal­ing with complexity

28 Apr 2024 13:06 UTC
11 points
0 comments2 min readLW link

[Aspira­tion-based de­signs] 3. Perfor­mance and safety crite­ria, and as­pira­tion intervals

Jobst Heitzig28 Apr 2024 13:04 UTC
9 points
0 comments12 min readLW link

Con­structabil­ity: Plainly-coded AGIs may be fea­si­ble in the near future

27 Apr 2024 16:04 UTC
59 points
8 comments13 min readLW link

On Not Pul­ling The Lad­der Up Be­hind You

Screwtape26 Apr 2024 21:58 UTC
101 points
5 comments9 min readLW link

So What’s Up With PUFAs Chem­i­cally?

J Bostock27 Apr 2024 13:32 UTC
51 points
21 comments6 min readLW link

Duct Tape security

Isaac King26 Apr 2024 18:57 UTC
66 points
8 comments5 min readLW link

[Question] Ex­am­ples of Highly Coun­ter­fac­tual Dis­cov­er­ies?

johnswentworth23 Apr 2024 22:19 UTC
160 points
85 comments1 min readLW link

Su­per­po­si­tion is not “just” neu­ron polysemanticity

LawrenceC26 Apr 2024 23:22 UTC
47 points
3 comments13 min readLW link

Thoughts on seed oil

dynomight20 Apr 2024 12:29 UTC
261 points
82 comments17 min readLW link
(dynomight.net)

The first fu­ture and the best future

KatjaGrace25 Apr 2024 6:40 UTC
97 points
9 comments1 min readLW link
(worldspiritsockpuppet.com)

Spa­tial at­ten­tion as a “tell” for em­pa­thetic simu­la­tion?

Steven Byrnes26 Apr 2024 15:10 UTC
49 points
9 comments8 min readLW link

D&D.Sci Long War: Defen­der of Data-mocracy

aphyer26 Apr 2024 22:30 UTC
38 points
8 comments3 min readLW link

We are headed into an ex­treme com­pute overhang

devrandom26 Apr 2024 21:38 UTC
38 points
14 comments2 min readLW link