[Question] How have analo­gous In­dus­tries solved In­ter­ested > Trained > Em­ployed bot­tle­necks?

yanni kyriacosMay 30, 2024, 11:59 PM
4 points
1 comment1 min readLW link

Duck­bill Masks Bet­ter?

jefftkMay 30, 2024, 11:40 PM
20 points
3 comments1 min readLW link
(www.jefftk.com)

OpenAI: He­len Toner Speaks

ZviMay 30, 2024, 9:10 PM
86 points
8 comments13 min readLW link
(thezvi.wordpress.com)

Non-Dis­par­age­ment Ca­naries for OpenAI

May 30, 2024, 7:20 PM
288 points
51 comments2 min readLW link

Clar­ify­ing METR’s Au­dit­ing Role

Beth BarnesMay 30, 2024, 6:41 PM
108 points
1 comment2 min readLW link

A civ­i­liza­tion ran by amateurs

Olli JärviniemiMay 30, 2024, 5:57 PM
61 points
8 comments6 min readLW link

One week left to ap­ply for the Roots of Progress Blog-Build­ing Intensive

jasoncrawfordMay 30, 2024, 4:55 PM
8 points
0 comments3 min readLW link
(rootsofprogress.org)

Get­ting started with AI Align­ment re­search: how to re­pro­duce an ex­per­i­ment from re­search paper

Alexander230May 30, 2024, 2:51 PM
3 points
0 comments3 min readLW link

AI #66: Oh to Be Less Online

ZviMay 30, 2024, 2:20 PM
37 points
6 comments56 min readLW link
(thezvi.wordpress.com)

The 27 papers

WitheringWeightsMay 30, 2024, 8:46 AM
18 points
2 comments1 min readLW link

The Mar­ket Sin­gu­lar­ity: A New Perspective

azsantoskMay 30, 2024, 7:05 AM
1 point
0 comments15 min readLW link

Awakening

lsusrMay 30, 2024, 7:03 AM
124 points
79 comments9 min readLW link

Value Claims (In Par­tic­u­lar) Are Usu­ally Bullshit

johnswentworthMay 30, 2024, 6:26 AM
144 points
18 comments2 min readLW link

The Pearly Gates

lsusrMay 30, 2024, 4:01 AM
127 points
6 comments3 min readLW link

AXRP Epi­sode 32 - Un­der­stand­ing Agency with Jan Kulveit

DanielFilanMay 30, 2024, 3:50 AM
20 points
0 comments53 min readLW link

US Pres­i­den­tial Elec­tion: Tractabil­ity, Im­por­tance, and Ur­gency

kuhanjMay 29, 2024, 11:52 PM
42 points
2 comments3 min readLW link

Thoughts on SB-1047

ryan_greenblattMay 29, 2024, 11:26 PM
60 points
1 comment11 min readLW link

How I de­signed my own writ­ing sys­tem, VJScript

vkethanaMay 29, 2024, 11:18 PM
2 points
1 comment1 min readLW link
(www.vkethana.com)

AI and integrity

Nathan YoungMay 29, 2024, 8:45 PM
10 points
0 comments2 min readLW link
(nathanpmyoung.substack.com)

MIRI 2024 Com­mu­ni­ca­tions Strategy

Gretta DulebaMay 29, 2024, 7:33 PM
325 points
216 comments7 min readLW link

2024 Sum­mer AI Safety In­tro Fel­low­ship and So­cials in Boston

KevinWeiMay 29, 2024, 6:27 PM
8 points
0 comments1 min readLW link

Apollo Re­search 1-year update

May 29, 2024, 5:44 PM
93 points
0 comments7 min readLW link

Re­sponse to nos­talge­braist: proudly wav­ing my moral-an­tire­al­ist bat­tle flag

Steven ByrnesMay 29, 2024, 4:48 PM
103 points
29 comments11 min readLW link

Look­ing be­yond Everett in mul­ti­ver­sal views of LLMs

kromemMay 29, 2024, 12:35 PM
10 points
0 comments8 min readLW link

[Question] Invit­ing dis­cus­sion of “Beat AI: A con­test us­ing philo­soph­i­cal con­cepts”

David JamesMay 29, 2024, 11:55 AM
2 points
1 comment1 min readLW link

AI com­pa­nies’ commitments

Zach Stein-PerlmanMay 29, 2024, 11:00 AM
36 points
0 comments1 min readLW link

One way vi­o­linists fail

Solenoid_EntityMay 29, 2024, 4:08 AM
33 points
5 comments3 min readLW link

Hardshipification

Jonathan MoregårdMay 28, 2024, 8:02 PM
88 points
17 comments2 min readLW link
(honestliving.substack.com)

When Are Cir­cu­lar Defi­ni­tions A Prob­lem?

johnswentworthMay 28, 2024, 8:00 PM
68 points
15 comments3 min readLW link

Notes on Gracefulness

David GrossMay 28, 2024, 6:40 PM
20 points
2 comments25 min readLW link

[Question] What’s a bet­ter term now that “AGI” is too vague?

Seth HerdMay 28, 2024, 6:02 PM
15 points
9 comments2 min readLW link

Re­ward hack­ing be­hav­ior can gen­er­al­ize across tasks

May 28, 2024, 4:33 PM
79 points
5 comments21 min readLW link

Quick Ad­vice on Writ­ing Essays

Niko_McCartyMay 28, 2024, 3:02 PM
11 points
0 comments3 min readLW link
(www.nikomccarty.com)

[Linkpost] The Ex­pres­sive Ca­pac­ity of State Space Models: A For­mal Lan­guage Perspective

Bogdan Ionut CirsteaMay 28, 2024, 1:49 PM
4 points
3 comments1 min readLW link
(arxiv.org)

OpenAI: Fallout

ZviMay 28, 2024, 1:20 PM
204 points
25 comments36 min readLW link
(thezvi.wordpress.com)

2024 State of the AI Reg­u­la­tory Land­scape

May 28, 2024, 11:59 AM
30 points
0 comments2 min readLW link
(www.convergenceanalysis.org)

Find­ing Back­ward Chain­ing Cir­cuits in Trans­form­ers Trained on Tree Search

May 28, 2024, 5:29 AM
50 points
1 comment9 min readLW link
(arxiv.org)

[Question] How to get nerds fas­ci­nated about mys­te­ri­ous chronic ill­ness re­search?

riceissaMay 27, 2024, 10:58 PM
95 points
50 comments2 min readLW link

Un­der­stand­ing Gödel’s com­plete­ness theorem

jessicataMay 27, 2024, 6:55 PM
39 points
0 comments15 min readLW link
(unstableontology.com)

Publi­cly dis­clos­ing com­pute ex­pen­di­ture daily as a safety regulation

teraflipflopMay 27, 2024, 6:28 PM
−4 points
0 comments2 min readLW link

In­tran­si­tive Trust

ScrewtapeMay 27, 2024, 4:55 PM
41 points
15 comments10 min readLW link

Overview of in­tro­duc­tory re­sources in AI Governance

Lucie PhilipponMay 27, 2024, 4:21 PM
19 points
0 comments6 min readLW link

I am the Golden Gate Bridge

ZviMay 27, 2024, 2:40 PM
95 points
6 comments27 min readLW link
(thezvi.wordpress.com)

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-PerlmanMay 27, 2024, 1:00 PM
202 points
21 comments2 min readLW link

Real Life Sort by Controversial

EloMay 27, 2024, 12:22 PM
5 points
19 comments20 min readLW link

Ju­lia Tasks 101

SatvikBeriMay 27, 2024, 11:32 AM
1 point
0 comments4 min readLW link

De­bates how to defeat ag­ing: Aubrey de Grey vs. Peter Fedichev.

avturchinMay 27, 2024, 10:25 AM
17 points
0 comments1 min readLW link

Be­ing against in­vol­un­tary death and be­ing open to change are compatible

Andy_McKenzieMay 27, 2024, 6:37 AM
32 points
5 comments2 min readLW link

If you’re an AI Safety move­ment builder con­sider ask­ing your mem­bers these ques­tions in an interview

yanni kyriacosMay 27, 2024, 5:46 AM
4 points
0 commentsLW link

Book re­view: Every­thing Is Predictable

PeterMcCluskeyMay 27, 2024, 3:33 AM
46 points
1 comment2 min readLW link
(bayesianinvestor.com)