All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Is AGI suicidality the golden ray of hope?

Alex Kirko4 Apr 2023 23:29 UTC

−18 points

4 comments1 min readLW link

Recontextualizing the Risks of AI in More Predictable Outcomes

ignorepeter4 Apr 2023 23:28 UTC

−19 points

2 comments5 min readLW link

LW Team is adjusting moderation policy

Raemon4 Apr 2023 20:41 UTC

305 points

185 comments3 min readLW link

Excessive AI growth-rate yields little socio-economic benefit.

Cleo Nardo4 Apr 2023 19:13 UTC

27 points

22 comments4 min readLW link

Penalize Model Complexity Via Self-Distillation

research_prime_space4 Apr 2023 18:52 UTC

15 points

7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbacon4 Apr 2023 18:23 UTC

−22 points

0 comments3 min readLW link

(www.secretorum.life)

Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds

1a3orn4 Apr 2023 17:39 UTC

214 points

38 comments5 min readLW link 1 review

Play My Futarchy/Prediction Market Mafia Game

Arjun Panickssery4 Apr 2023 16:12 UTC

21 points

2 comments1 min readLW link

(arjunpanickssery.substack.com)

[Question] Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?

Aryeh Englander4 Apr 2023 15:53 UTC

26 points

14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

Zvi4 Apr 2023 14:40 UTC

97 points

9 comments4 min readLW link

(thezvi.wordpress.com)

Running many AI variants to find correct goal generalization

avturchin4 Apr 2023 14:16 UTC

20 points

3 comments1 min readLW link

Invocations: The Other Capabilities Overhang?

Robert_AIZI4 Apr 2023 13:38 UTC

29 points

4 comments4 min readLW link

(aizi.substack.com)

Wanted: Mental Health Program Manager at Rethink Wellbeing

Inga G.4 Apr 2023 11:49 UTC

7 points

0 comments2 min readLW link

Strategies to Prevent AI Annihilation

lastchanceformankind4 Apr 2023 8:59 UTC

−2 points

0 comments4 min readLW link

ACX Meetup Madrid

Pablo Villalobos4 Apr 2023 8:53 UTC

5 points

2 comments1 min readLW link

[Question] Best Ways to Try to Get Funding for Alignment Research?

RGRGRG4 Apr 2023 6:35 UTC

10 points

6 comments1 min readLW link

Consider applying to a 2-week alignment project with former GitHub CEO

Bird Concept4 Apr 2023 6:20 UTC

42 points

0 comments1 min readLW link

(twitter.com)

On how it feels generating art with DALL-E

cortrinkau4 Apr 2023 4:13 UTC

5 points

0 comments3 min readLW link

(cortrinkau.bearblog.dev)

AI Summer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC

130 points

10 comments1 min readLW link

How to respond to the recent condemnations of the rationalist community

Christopher King4 Apr 2023 1:42 UTC

−2 points

7 comments4 min readLW link

Steering systems

Max H4 Apr 2023 0:56 UTC

51 points

1 comment15 min readLW link

ChatGPT Suggests Listening To Russell & Yudkowsky

JenniferRM4 Apr 2023 0:30 UTC

9 points

1 comment17 min readLW link

Complex Systems are Hard to Control

jsteinhardt4 Apr 2023 0:00 UTC

42 points

5 comments10 min readLW link

(bounded-regret.ghost.io)

Apply to the Cavendish Labs Fellowship (by 4/15)

agg and derikk

3 Apr 2023 23:09 UTC

11 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.3 Apr 2023 23:07 UTC

5 points

3 comments1 min readLW link

Communicating effectively under Knightian norms

Richard_Ngo3 Apr 2023 22:39 UTC

97 points

54 comments6 min readLW link

If interpretability research goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC

201 points

11 comments2 min readLW link

Towards empathy in RL agents and beyond: Insights from cognitive science for AI Alignment

Marc Carauleanu3 Apr 2023 19:59 UTC

15 points

6 comments1 min readLW link

(clipchamp.com)

Monthly Roundup #5: April 2023

Zvi3 Apr 2023 18:50 UTC

26 points

12 comments14 min readLW link

(thezvi.wordpress.com)

Exploring non-anthropocentric aspects of AI existential safety

mishka3 Apr 2023 18:07 UTC

11 points

1 comment3 min readLW link

[Question] GJP on AGI

Suh_Prance_Alot3 Apr 2023 17:21 UTC

2 points

0 comments1 min readLW link

Do we have a plan for the “first critical try” problem?

Christopher King3 Apr 2023 16:27 UTC

−3 points

14 comments1 min readLW link

Exploratory Analysis of RLHF Transformers with TransformerLens

Curt Tigges3 Apr 2023 16:09 UTC

21 points

2 comments11 min readLW link

(blog.eleuther.ai)

AWS Has Raised Prices Before

jefftk3 Apr 2023 16:00 UTC

7 points

3 comments1 min readLW link

(www.jefftk.com)

Mati’s introduction to pausing giant AI experiments

Mati_Roy3 Apr 2023 15:56 UTC

7 points

0 comments2 min readLW link

Superintelligence will outsmart us or it isn’t superintelligence

Neil 3 Apr 2023 15:01 UTC

−4 points

4 comments1 min readLW link

AI-kills-everyone scenarios require robotic infrastructure, but not necessarily nanotech

avturchin3 Apr 2023 12:45 UTC

54 points

47 comments4 min readLW link

Orthogonality is expensive

beren3 Apr 2023 10:20 UTC

43 points

9 comments3 min readLW link

Repeated Play of Imperfect Newcomb’s Paradox in Infra-Bayesian Physicalism

Sven Nilsen3 Apr 2023 10:06 UTC

2 points

0 comments2 min readLW link

Effective Altruism Virtual Programs Apr-May 2023

Yve Nichols-Evans3 Apr 2023 6:40 UTC

1 point

0 comments1 min readLW link

Board Game Theory

Optimization Process3 Apr 2023 6:23 UTC

8 points

0 comments3 min readLW link

Planecrash Podcast

planecrashpodcast3 Apr 2023 4:34 UTC

10 points

5 comments1 min readLW link

[Question] I’m just starting to grasp Shard Theory. Is that a normal feeling?

twkaiser3 Apr 2023 3:08 UTC

−20 points

1 comment1 min readLW link

Rules for living in a 99.9+% lizardman world

at_the_zoo3 Apr 2023 2:39 UTC

−1 points

12 comments1 min readLW link

The Friendly Drunk Fool Alignment Strategy

JenniferRM3 Apr 2023 1:26 UTC

31 points

19 comments11 min readLW link

Slack Group: Rationalist Startup Founders

Adam Zerner3 Apr 2023 0:44 UTC

31 points

2 comments3 min readLW link

Orthogonality is Expensive

DragonGod3 Apr 2023 0:43 UTC

21 points

3 comments1 min readLW link

(www.beren.io)

GTP4 capable of limited recursive improving?

Boris Kashirin2 Apr 2023 21:38 UTC

2 points

3 comments1 min readLW link

[Question] Scared about the future of AI

eitan weiss2 Apr 2023 20:37 UTC

−1 points

0 comments1 min readLW link

“a dialogue with myself concerning eliezer yudkowsky” (not author)

the gears to ascension2 Apr 2023 20:12 UTC

13 points

18 comments3 min readLW link