Truth­seek­ing is the ground in which other prin­ci­ples grow

ElizabethMay 27, 2024, 1:09 AM
248 points
16 comments16 min readLW link

If try­ing to com­mu­ni­cate about AI risks, make it vivid

MNoetelMay 27, 2024, 1:00 AM
11 points
1 comment2 min readLW link

Луна Лавгуд и Комната Тайн, Часть 1

May 26, 2024, 10:17 PM
24 points
0 comments3 min readLW link

If you are also the worst at politics

lemonhopeMay 26, 2024, 8:07 PM
32 points
8 comments1 min readLW link

Re­view: Conor More­ton’s “Civ­i­liza­tion & Co­op­er­a­tion”

Duncan Sabien (Inactive)May 26, 2024, 7:32 PM
80 points
8 comments38 min readLW link

The ne­ces­sity of “Guardian AI” and two con­di­tions for its achievement

ProicaMay 26, 2024, 5:39 PM
−2 points
0 comments15 min readLW link

No­tifi­ca­tions Re­ceived in 30 Minutes of Class

tanagrabeastMay 26, 2024, 5:02 PM
358 points
16 comments8 min readLW link

Show LW: Hack­erNews but for re­search papers

slenoMay 26, 2024, 3:14 PM
6 points
1 comment1 min readLW link

Disprov­ing and par­tially fix­ing a fully ho­mo­mor­phic en­cryp­tion scheme with perfect secrecy

Lysandre TerrisseMay 26, 2024, 2:56 PM
16 points
1 comment18 min readLW link

The AI Revolu­tion in Biology

Roman LeventovMay 26, 2024, 9:30 AM
13 points
0 comments1 min readLW link
(www.cognitiverevolution.ai)

[Question] Who does the art­work for LessWrong?

Edwin EvansMay 26, 2024, 5:55 AM
10 points
1 comment1 min readLW link

[Question] Is there an idiom for bond­ing over shared tri­als/​trauma?

CstineSublimeMay 26, 2024, 1:18 AM
2 points
1 comment1 min readLW link

Moloch—An Illus­trated Primer

James Stephen BrownMay 26, 2024, 1:04 AM
5 points
0 comments7 min readLW link
(nonzerosum.games)

[Question] Is CDT with pre­com­mit­ment enough?

martinkunevMay 25, 2024, 9:40 PM
10 points
17 comments1 min readLW link

Com­plex sys­tems the­ory in hu­man perfor­mance. New model for con­cep­tu­al­iz­ing train­ing, adap­ta­tion and long-term development

Matěj NekoranecMay 25, 2024, 8:17 PM
1 point
0 comments7 min readLW link

Blindspot in Sport’s Data-Driven Age

Matěj NekoranecMay 25, 2024, 8:17 PM
2 points
0 comments7 min readLW link

LMSR sub­sidy pa­ram­e­ter is the price of information

Abhimanyu Pallavi SudhirMay 25, 2024, 6:05 PM
5 points
0 comments1 min readLW link

Low Fer­til­ity is a De­growth Paradise

Maxwell TabarrokMay 25, 2024, 5:35 PM
7 points
2 comments3 min readLW link
(www.maximum-progress.com)

Epi­sode: Austin vs Linch on OpenAI

Austin ChenMay 25, 2024, 4:15 PM
20 points
25 commentsLW link
(manifund.substack.com)

Train­ing-time do­main au­tho­riza­tion could be helpful for safety

May 25, 2024, 3:10 PM
15 points
4 comments7 min readLW link

Level up your spreadsheeting

angelinahliMay 25, 2024, 2:57 PM
44 points
11 comments3 min readLW link
(docs.google.com)

“Suc­cess­ful lan­guage model evals” by Ja­son Wei

Arjun PanicksseryMay 25, 2024, 9:34 AM
7 points
0 comments1 min readLW link
(www.jasonwei.net)

Beta Tester Re­quest: Ral­ly­point Bounties

lukemarksMay 25, 2024, 9:11 AM
25 points
4 comments1 min readLW link

[Question] What should the norms around AI voices be?

ChristianKlMay 25, 2024, 6:29 AM
17 points
6 comments1 min readLW link

Se­cret US nat­sec pro­ject with in­tel revealed

Nathan Helm-BurgerMay 25, 2024, 4:22 AM
27 points
1 comment1 min readLW link
(www.politico.com)

Launch & Grow Your Univer­sity Group: Ap­ply now to OSP & FSP!

agucovaMay 25, 2024, 1:03 AM
3 points
0 commentsLW link

Com­pu­ta­tional Me­chan­ics Hackathon (June 1 & 2)

Adam ShaiMay 24, 2024, 10:18 PM
34 points
5 comments1 min readLW link

[Question] Re­quest for com­ments/​opinions/​ideas on safety/​ethics for use of tool AI in a large health­care sys­tem.

bokovMay 24, 2024, 8:53 PM
5 points
2 comments1 min readLW link

NYU Code De­bates Up­date/​Postmortem

David ReinMay 24, 2024, 4:08 PM
27 points
4 comments10 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-PerlmanMay 24, 2024, 4:01 PM
242 points
15 comments4 min readLW link

The Schumer Re­port on AI (RTFB)

ZviMay 24, 2024, 3:10 PM
34 points
3 comments36 min readLW link
(thezvi.wordpress.com)

min­utes from a hu­man-al­ign­ment meeting

bhauthMay 24, 2024, 5:01 AM
67 points
4 comments2 min readLW link

Ta­lent Needs of Tech­ni­cal AI Safety Teams

May 24, 2024, 12:36 AM
117 points
65 comments14 min readLW link

How to Give Com­ing AGI’s the Best Chance of Figur­ing Out Ethics for Us

sweenesmMay 23, 2024, 7:44 PM
1 point
2 comments10 min readLW link

Men­tor­ship in AGI Safety (MAGIS) call for men­tors

May 23, 2024, 6:28 PM
31 points
3 comments2 min readLW link

Quick Thoughts on Scal­ing Monosemanticity

Joel BurgetMay 23, 2024, 4:22 PM
28 points
1 comment4 min readLW link
(transformer-circuits.pub)

The case for stop­ping AI safety research

catubcMay 23, 2024, 3:55 PM
53 points
38 comments1 min readLW link

[Question] SAE sparse fea­ture graph us­ing only resi­d­ual layers

Jaehyuk LimMay 23, 2024, 1:32 PM
0 points
3 comments1 min readLW link

[Question] Are most peo­ple deeply con­fused about “love”, or am I miss­ing a hu­man uni­ver­sal?

SpectrumDTMay 23, 2024, 1:22 PM
13 points
28 comments3 min readLW link

Ex­ec­u­tive Dys­func­tion 101

DaystarEldMay 23, 2024, 12:43 PM
28 points
1 comment3 min readLW link
(daystareld.com)

AI #65: I Spy With My AI

ZviMay 23, 2024, 12:40 PM
28 points
7 comments43 min readLW link
(thezvi.wordpress.com)

What mis­takes has the AI safety move­ment made?

EuanMcLeanMay 23, 2024, 11:19 AM
64 points
29 comments12 min readLW link

What should AI safety be try­ing to achieve?

EuanMcLeanMay 23, 2024, 11:17 AM
17 points
1 comment13 min readLW link

What will the first hu­man-level AI look like, and how might things go wrong?

EuanMcLeanMay 23, 2024, 11:17 AM
20 points
2 comments15 min readLW link

Big Pic­ture AI Safety: Introduction

EuanMcLeanMay 23, 2024, 11:15 AM
46 points
7 comments5 min readLW link

Paper in Science: Manag­ing ex­treme AI risks amid rapid progress

JanBMay 23, 2024, 8:40 AM
50 points
2 comments1 min readLW link

Power Law Policy

Ben TurtelMay 23, 2024, 5:28 AM
4 points
7 comments6 min readLW link
(bturtel.substack.com)

Why en­tropy means you might not have to worry as much about su­per­in­tel­li­gent AI

Ron JMay 23, 2024, 3:52 AM
−26 points
1 comment2 min readLW link

Quick Thoughts on Our First Sam­pling Run

jefftkMay 23, 2024, 12:20 AM
29 points
3 comments2 min readLW link
(www.jefftk.com)

AI Safety pro­posal—In­fluenc­ing the su­per­in­tel­li­gence explosion

MorganMay 22, 2024, 11:31 PM
0 points
2 comments7 min readLW link