AI Risk and the US Pres­i­den­tial Candidates

ZaneJan 6, 2024, 8:18 PM
41 points
22 comments6 min readLW link

A Challenge to Effec­tive Altru­ism’s Premises

False NameJan 6, 2024, 6:46 PM
−26 points
3 comments3 min readLW link

Lack of Spi­der-Man is ev­i­dence against the simu­la­tion hypothesis

RamblinDashJan 6, 2024, 6:17 PM
7 points
23 comments1 min readLW link

A Land Tax For Britain

A.H.Jan 6, 2024, 3:52 PM
6 points
9 comments4 min readLW link

Book re­view: Trick or treat­ment (2008)

Fleece MinutiaJan 6, 2024, 3:40 PM
1 point
0 comments2 min readLW link

Are we in­side a black hole?

JayJan 6, 2024, 1:30 PM
2 points
5 comments1 min readLW link

Sur­vey of 2,778 AI au­thors: six parts in pictures

KatjaGraceJan 6, 2024, 4:43 AM
80 points
1 comment2 min readLW link

Pro­ject ideas: Epistemics

Lukas FinnvedenJan 5, 2024, 11:41 PM
43 points
4 commentsLW link
(www.forethought.org)

Al­most ev­ery­one I’ve met would be well-served think­ing more about what to fo­cus on

Henrik KarlssonJan 5, 2024, 9:01 PM
96 points
8 comments11 min readLW link
(www.henrikkarlsson.xyz)

The Next ChatGPT Mo­ment: AI Avatars

Jan 5, 2024, 8:14 PM
43 points
10 comments1 min readLW link

AI Im­pacts 2023 Ex­pert Sur­vey on Progress in AI

habrykaJan 5, 2024, 7:42 PM
28 points
2 comments7 min readLW link
(wiki.aiimpacts.org)

Tech­nol­ogy path de­pen­dence and eval­u­at­ing expertise

Jan 5, 2024, 7:21 PM
25 points
2 comments15 min readLW link

The Hip­pie Rab­bit Hole -Nuggets of Gold in Rivers of Bullshit

Jonathan MoregårdJan 5, 2024, 6:27 PM
39 points
20 comments8 min readLW link
(honestliving.substack.com)

[Question] What tech­ni­cal top­ics could help with bound­aries/​mem­branes?

ChipmonkJan 5, 2024, 6:14 PM
15 points
25 comments1 min readLW link

Catch­ing AIs red-handed

Jan 5, 2024, 5:43 PM
111 points
27 comments17 min readLW link

AI Im­pacts Sur­vey: De­cem­ber 2023 Edition

ZviJan 5, 2024, 2:40 PM
34 points
6 comments10 min readLW link
(thezvi.wordpress.com)

Fore­cast your 2024 with Fatebook

Sage FutureJan 5, 2024, 2:07 PM
19 points
0 comments1 min readLW link
(fatebook.io)

Pre­dic­tive model agents are sort of corrigible

Raymond DouglasJan 5, 2024, 2:05 PM
35 points
6 comments3 min readLW link

Strik­ing Im­pli­ca­tions for Learn­ing The­ory, In­ter­pretabil­ity — and Safety?

RogerDearnaleyJan 5, 2024, 8:46 AM
37 points
4 comments2 min readLW link

If I ran the zoo

Optimization ProcessJan 5, 2024, 5:14 AM
18 points
1 comment2 min readLW link

Does AI care about re­al­ity or just its own per­cep­tion?

RedFishBlueFishJan 5, 2024, 4:05 AM
−6 points
8 comments1 min readLW link

MIRI 2024 Mis­sion and Strat­egy Update

MaloJan 5, 2024, 12:20 AM
223 points
44 comments8 min readLW link

Pro­ject ideas: Gover­nance dur­ing ex­plo­sive tech­nolog­i­cal growth

Lukas FinnvedenJan 4, 2024, 11:51 PM
14 points
0 commentsLW link
(www.forethought.org)

Hello

S BenfieldJan 4, 2024, 11:35 PM
6 points
0 comments2 min readLW link

Us­ing Threats to Achieve So­cially Op­ti­mal Outcomes

StrivingForLegibilityJan 4, 2024, 11:30 PM
8 points
0 comments3 min readLW link

Best-Re­spond­ing Is Not Always the Best Response

StrivingForLegibilityJan 4, 2024, 11:30 PM
10 points
0 comments3 min readLW link

Safety Data Sheets for Op­ti­miza­tion Processes

StrivingForLegibilityJan 4, 2024, 11:30 PM
15 points
1 comment4 min readLW link

The Gears of Argmax

StrivingForLegibilityJan 4, 2024, 11:30 PM
11 points
0 comments3 min readLW link

Cel­lu­lar re­pro­gram­ming, pneu­matic launch sys­tems, and ter­raform­ing Mars: Some things I learned about at Fore­sight Vi­sion Weekend

jasoncrawfordJan 4, 2024, 7:33 PM
28 points
0 comments8 min readLW link
(rootsofprogress.org)

Deep athe­ism and AI risk

Joe CarlsmithJan 4, 2024, 6:58 PM
153 points
22 comments27 min readLW link

Some Va­ca­tion Photos

johnswentworthJan 4, 2024, 5:15 PM
83 points
0 comments1 min readLW link

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copy­right In­fringe­ment, and Con­gres­sional Ques­tions about Re­search Stan­dards in AI Safety

Jan 4, 2024, 4:09 PM
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

EAG Bay Area Satel­lite event: AI In­sti­tu­tion De­sign Hackathon 2024

beatrice@foresight.orgJan 4, 2024, 3:02 PM
1 point
0 comments1 min readLW link

AI #45: To Be Determined

ZviJan 4, 2024, 3:00 PM
52 points
4 comments31 min readLW link
(thezvi.wordpress.com)

Screen-sup­ported Portable Monitor

jefftkJan 4, 2024, 1:50 PM
16 points
10 comments1 min readLW link
(www.jefftk.com)

[Question] Which in­vest­ments for al­igned-AI out­comes?

tailcalledJan 4, 2024, 1:28 PM
8 points
9 comments2 min readLW link

Non-al­ign­ment pro­ject ideas for mak­ing trans­for­ma­tive AI go well

Lukas FinnvedenJan 4, 2024, 7:23 AM
44 points
1 commentLW link
(www.forethought.org)

Fact Check­ing and Re­tal­i­a­tion Against Sources

jefftkJan 4, 2024, 12:41 AM
7 points
2 comments4 min readLW link
(www.jefftk.com)

In­ves­ti­gat­ing Alter­na­tive Fu­tures: Hu­man and Su­per­in­tel­li­gence In­ter­ac­tion Scenarios

Hiroshi YamakawaJan 3, 2024, 11:46 PM
1 point
0 comments17 min readLW link

“At­ti­tudes Toward Ar­tifi­cial Gen­eral In­tel­li­gence: Re­sults from Amer­i­can Adults 2021 and 2023”—call for re­view­ers (Seeds of Science)

rogersbaconJan 3, 2024, 8:11 PM
4 points
0 comments1 min readLW link

What’s up with LLMs rep­re­sent­ing XORs of ar­bi­trary fea­tures?

Sam MarksJan 3, 2024, 7:44 PM
158 points
63 comments16 min readLW link

Spirit Air­lines Merger Play

sapphireJan 3, 2024, 7:25 PM
5 points
12 comments1 min readLW link

$300 for the best sci-fi prompt: the results

RomanSJan 3, 2024, 7:10 PM
16 points
19 comments7 min readLW link

Agent mem­branes/​bound­aries and for­mal­iz­ing “safety”

ChipmonkJan 3, 2024, 5:55 PM
26 points
46 comments3 min readLW link

Safety First: safety be­fore full al­ign­ment. The de­on­tic suffi­ciency hy­poth­e­sis.

ChipmonkJan 3, 2024, 5:55 PM
48 points
3 comments3 min readLW link

Prac­ti­cally A Book Re­view: Ap­pendix to “Non­lin­ear’s Ev­i­dence: De­bunk­ing False and Mislead­ing Claims” (ThingOfThings)

tailcalledJan 3, 2024, 5:07 PM
111 points
25 comments2 min readLW link
(thingofthings.substack.com)

Triv­ial Math­e­mat­ics as a Path Forward

ACrackedPotJan 3, 2024, 4:41 PM
−4 points
2 comments2 min readLW link

Copy­right Con­fronta­tion #1

ZviJan 3, 2024, 3:50 PM
34 points
7 comments18 min readLW link
(thezvi.wordpress.com)

[Question] The­o­ret­i­cally, could we bal­ance the bud­get painlessly?

Logan ZoellnerJan 3, 2024, 2:46 PM
4 points
12 comments1 min readLW link

Jo­hannes’ Biography

Johannes C. MayerJan 3, 2024, 1:27 PM
24 points
0 comments10 min readLW link