All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

New survey: 46% of Americans are concerned about extinction from AI; 69% support a six-month pause in AI development

Orpheus16Apr 5, 2023, 1:26 AM

46 points

9 comments1 min readLW link

(today.yougov.com)

Is AGI suicidality the golden ray of hope?

Alex KirkoApr 4, 2023, 11:29 PM

−18 points

4 comments1 min readLW link

Recontextualizing the Risks of AI in More Predictable Outcomes

ignorepeterApr 4, 2023, 11:28 PM

−19 points

2 comments5 min readLW link

LW Team is adjusting moderation policy

RaemonApr 4, 2023, 8:41 PM

304 points

185 comments3 min readLW link

Excessive AI growth-rate yields little socio-economic benefit.

Cleo NardoApr 4, 2023, 7:13 PM

27 points

22 comments4 min readLW link

Penalize Model Complexity Via Self-Distillation

research_prime_spaceApr 4, 2023, 6:52 PM

15 points

7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbaconApr 4, 2023, 6:23 PM

−22 points

0 comments3 min readLW link

(www.secretorum.life)

Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds

1a3ornApr 4, 2023, 5:39 PM

211 points

38 comments5 min readLW link 1 review

Play My Futarchy/Prediction Market Mafia Game

Arjun PanicksseryApr 4, 2023, 4:12 PM

21 points

2 comments1 min readLW link

(arjunpanickssery.substack.com)

[Question] Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?

Aryeh EnglanderApr 4, 2023, 3:53 PM

26 points

14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

ZviApr 4, 2023, 2:40 PM

97 points

9 comments4 min readLW link

(thezvi.wordpress.com)

Running many AI variants to find correct goal generalization

avturchinApr 4, 2023, 2:16 PM

20 points

3 comments1 min readLW link

Invocations: The Other Capabilities Overhang?

Robert_AIZIApr 4, 2023, 1:38 PM

29 points

4 comments4 min readLW link

(aizi.substack.com)

Wanted: Mental Health Program Manager at Rethink Wellbeing

Inga G.Apr 4, 2023, 11:49 AM

7 points

0 comments2 min readLW link

Where Free Will and Determinism Meet

David BravoApr 4, 2023, 10:59 AM

0 points

0 comments3 min readLW link

Strategies to Prevent AI Annihilation

lastchanceformankindApr 4, 2023, 8:59 AM

−2 points

0 comments4 min readLW link

ACX Meetup Madrid

Pablo VillalobosApr 4, 2023, 8:53 AM

5 points

2 comments1 min readLW link

[Question] Best Ways to Try to Get Funding for Alignment Research?

RGRGRGApr 4, 2023, 6:35 AM

9 points

6 comments1 min readLW link

Consider applying to a 2-week alignment project with former GitHub CEO

Bird ConceptApr 4, 2023, 6:20 AM

42 points

0 comments1 min readLW link

(twitter.com)

On how it feels generating art with DALL-E

cortrinkauApr 4, 2023, 4:13 AM

5 points

0 comments3 min readLW link

(cortrinkau.bearblog.dev)

AI Summer Harvest

Cleo NardoApr 4, 2023, 3:35 AM

130 points

10 comments1 min readLW link

How to respond to the recent condemnations of the rationalist community

Christopher KingApr 4, 2023, 1:42 AM

−2 points

7 comments4 min readLW link

Steering systems

Max HApr 4, 2023, 12:56 AM

50 points

1 comment15 min readLW link

ChatGPT Suggests Listening To Russell & Yudkowsky

JenniferRMApr 4, 2023, 12:30 AM

9 points

1 comment17 min readLW link

Complex Systems are Hard to Control

jsteinhardtApr 4, 2023, 12:00 AM

42 points

5 comments10 min readLW link

(bounded-regret.ghost.io)

Apply to the Cavendish Labs Fellowship (by 4/15)

Apr 3, 2023, 11:09 PM

11 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.Apr 3, 2023, 11:07 PM

5 points

3 comments1 min readLW link

Communicating effectively under Knightian norms

Richard_NgoApr 3, 2023, 10:39 PM

96 points

54 comments6 min readLW link

If interpretability research goes well, it may get dangerous

So8resApr 3, 2023, 9:48 PM

201 points

11 comments2 min readLW link

Towards empathy in RL agents and beyond: Insights from cognitive science for AI Alignment

Marc CarauleanuApr 3, 2023, 7:59 PM

15 points

6 comments1 min readLW link

(clipchamp.com)

Monthly Roundup #5: April 2023

ZviApr 3, 2023, 6:50 PM

26 points

12 comments14 min readLW link

(thezvi.wordpress.com)

Exploring non-anthropocentric aspects of AI existential safety

mishkaApr 3, 2023, 6:07 PM

8 points

0 comments3 min readLW link

[Question] GJP on AGI

Suh_Prance_AlotApr 3, 2023, 5:21 PM

2 points

0 comments1 min readLW link

Do we have a plan for the “first critical try” problem?

Christopher KingApr 3, 2023, 4:27 PM

−3 points

14 comments1 min readLW link

Exploratory Analysis of RLHF Transformers with TransformerLens

Curt TiggesApr 3, 2023, 4:09 PM

21 points

2 comments11 min readLW link

(blog.eleuther.ai)

AWS Has Raised Prices Before

jefftkApr 3, 2023, 4:00 PM

7 points

3 comments1 min readLW link

(www.jefftk.com)

Mati’s introduction to pausing giant AI experiments

Mati_RoyApr 3, 2023, 3:56 PM

7 points

0 comments2 min readLW link

Superintelligence will outsmart us or it isn’t superintelligence

Neil Apr 3, 2023, 3:01 PM

−4 points

4 comments1 min readLW link

AI-kills-everyone scenarios require robotic infrastructure, but not necessarily nanotech

avturchinApr 3, 2023, 12:45 PM

53 points

47 comments4 min readLW link

Orthogonality is expensive

berenApr 3, 2023, 10:20 AM

43 points

9 comments3 min readLW link

Repeated Play of Imperfect Newcomb’s Paradox in Infra-Bayesian Physicalism

Sven NilsenApr 3, 2023, 10:06 AM

2 points

0 comments2 min readLW link

Effective Altruism Virtual Programs Apr-May 2023

Yve Nichols-EvansApr 3, 2023, 6:40 AM

1 point

0 comments1 min readLW link

Board Game Theory

Optimization ProcessApr 3, 2023, 6:23 AM

8 points

0 comments3 min readLW link

Planecrash Podcast

planecrashpodcastApr 3, 2023, 4:34 AM

10 points

5 comments1 min readLW link

[Question] I’m just starting to grasp Shard Theory. Is that a normal feeling?

twkaiserApr 3, 2023, 3:08 AM

−20 points

1 comment1 min readLW link

Rules for living in a 99.9+% lizardman world

at_the_zooApr 3, 2023, 2:39 AM

−1 points

12 comments1 min readLW link

The Friendly Drunk Fool Alignment Strategy

JenniferRMApr 3, 2023, 1:26 AM

29 points

19 comments11 min readLW link

Slack Group: Rationalist Startup Founders

Adam ZernerApr 3, 2023, 12:44 AM

31 points

2 comments3 min readLW link

Orthogonality is Expensive

DragonGodApr 3, 2023, 12:43 AM

21 points

3 comments1 min readLW link

(www.beren.io)

GTP4 capable of limited recursive improving?

Boris KashirinApr 2, 2023, 9:38 PM

2 points

3 comments1 min readLW link