All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Auto-GPT: Open-sourced disaster?

awg5 Apr 2023 22:46 UTC

23 points

18 comments1 min readLW link

(github.com)

The Orthogonality Thesis is Not Obviously True

Bentham's Bulldog5 Apr 2023 21:06 UTC

3 points

80 comments9 min readLW link

Williams-Beuren Syndrome: Frendly Mutations

Takk5 Apr 2023 20:59 UTC

−1 points

1 comment1 min readLW link

OpenAI: Our approach to AI safety

Jacob G-W5 Apr 2023 20:26 UTC

1 point

1 comment1 min readLW link

(openai.com)

Why Are Maximum Entropy Distributions So Ubiquitous?

johnswentworth5 Apr 2023 20:12 UTC

69 points

6 comments9 min readLW link

“On Living in an Atomic Age”, by C.S. Lewis (1948)

tjaffee5 Apr 2023 18:34 UTC

18 points

3 comments8 min readLW link

(hebrew-streams.org)

Eliezer Yudkowsky’s Letter in Time Magazine

Zvi5 Apr 2023 18:00 UTC

217 points

86 comments14 min readLW link

(thezvi.wordpress.com)

Dark Artificial Intelligence

FrankAI5 Apr 2023 17:37 UTC

0 points

0 comments4 min readLW link

[Question] Best arguments against instrumental convergence?

luke_emberson5 Apr 2023 17:06 UTC

5 points

7 comments1 min readLW link

Progress links and tweets, 2023-04-05

jasoncrawford5 Apr 2023 16:18 UTC

20 points

0 comments2 min readLW link

(rootsofprogress.org)

Universality and Hidden Information in Concept Bottleneck Models

Hoagy5 Apr 2023 14:00 UTC

23 points

0 comments11 min readLW link

AI safety and the security mindset: user interface design, red-teams, formal verification

Allison Duettmann5 Apr 2023 11:33 UTC

35 points

0 comments8 min readLW link

ICA Simulacra

Ozyrus5 Apr 2023 6:41 UTC

26 points

2 comments7 min readLW link

AGI deployment as an act of aggression

dr_s5 Apr 2023 6:39 UTC

28 points

30 comments13 min readLW link

A Brief Introduction to Algorithmic Common Intelligence, ACI . 1

Akira Pyinya5 Apr 2023 5:43 UTC

−2 points

1 comment2 min readLW link

46% of US adults at least “somewhat concerned” about AI extinction risk.

Foyle5 Apr 2023 5:25 UTC

1 point

0 comments1 min readLW link

[Question] Has anyone thought about how to proceed now that AI notkilleveryoneism is becoming more relevant/is approaching the Overton window?

metachirality5 Apr 2023 3:06 UTC

11 points

8 comments1 min readLW link

Empathy bandaid for immediate AI catastrophe

installgentoo5 Apr 2023 2:12 UTC

1 point

2 comments1 min readLW link

“Corrigibility at some small length” by dath ilan

Christopher King5 Apr 2023 1:47 UTC

33 points

3 comments9 min readLW link

(www.glowfic.com)

New survey: 46% of Americans are concerned about extinction from AI; 69% support a six-month pause in AI development

Orpheus165 Apr 2023 1:26 UTC

47 points

9 comments1 min readLW link

(today.yougov.com)

Is AGI suicidality the golden ray of hope?

Alex Kirko4 Apr 2023 23:29 UTC

−18 points

4 comments1 min readLW link

Recontextualizing the Risks of AI in More Predictable Outcomes

ignorepeter4 Apr 2023 23:28 UTC

−19 points

2 comments5 min readLW link

LW Team is adjusting moderation policy

Raemon4 Apr 2023 20:41 UTC

307 points

185 comments3 min readLW link

Excessive AI growth-rate yields little socio-economic benefit.

Cleo Nardo4 Apr 2023 19:13 UTC

27 points

22 comments4 min readLW link

Penalize Model Complexity Via Self-Distillation

research_prime_space4 Apr 2023 18:52 UTC

15 points

7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbacon4 Apr 2023 18:23 UTC

−22 points

0 comments3 min readLW link

(www.secretorum.life)

Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds

1a3orn4 Apr 2023 17:39 UTC

214 points

38 comments5 min readLW link 1 review

Play My Futarchy/Prediction Market Mafia Game

Arjun Panickssery4 Apr 2023 16:12 UTC

21 points

2 comments1 min readLW link

(arjunpanickssery.substack.com)

[Question] Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?

Aryeh Englander4 Apr 2023 15:53 UTC

26 points

14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

Zvi4 Apr 2023 14:40 UTC

97 points

9 comments4 min readLW link

(thezvi.wordpress.com)

Running many AI variants to find correct goal generalization

avturchin4 Apr 2023 14:16 UTC

20 points

3 comments1 min readLW link

Invocations: The Other Capabilities Overhang?

Robert_AIZI4 Apr 2023 13:38 UTC

29 points

4 comments4 min readLW link

(aizi.substack.com)

Wanted: Mental Health Program Manager at Rethink Wellbeing

Inga G.4 Apr 2023 11:49 UTC

7 points

0 comments2 min readLW link

Strategies to Prevent AI Annihilation

lastchanceformankind4 Apr 2023 8:59 UTC

−2 points

0 comments4 min readLW link

ACX Meetup Madrid

Pablo Villalobos4 Apr 2023 8:53 UTC

5 points

2 comments1 min readLW link

[Question] Best Ways to Try to Get Funding for Alignment Research?

RGRGRG4 Apr 2023 6:35 UTC

10 points

6 comments1 min readLW link

Consider applying to a 2-week alignment project with former GitHub CEO

Bird Concept4 Apr 2023 6:20 UTC

42 points

0 comments1 min readLW link

(twitter.com)

On how it feels generating art with DALL-E

cortrinkau4 Apr 2023 4:13 UTC

5 points

0 comments3 min readLW link

(cortrinkau.bearblog.dev)

AI Summer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC

130 points

10 comments1 min readLW link

How to respond to the recent condemnations of the rationalist community

Christopher King4 Apr 2023 1:42 UTC

−2 points

7 comments4 min readLW link

Steering systems

Max H4 Apr 2023 0:56 UTC

51 points

1 comment15 min readLW link

ChatGPT Suggests Listening To Russell & Yudkowsky

JenniferRM4 Apr 2023 0:30 UTC

9 points

1 comment17 min readLW link

Complex Systems are Hard to Control

jsteinhardt4 Apr 2023 0:00 UTC

43 points

5 comments10 min readLW link

(bounded-regret.ghost.io)

Apply to the Cavendish Labs Fellowship (by 4/15)

agg and derikk

3 Apr 2023 23:09 UTC

11 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.3 Apr 2023 23:07 UTC

5 points

3 comments1 min readLW link

Communicating effectively under Knightian norms

Richard_Ngo3 Apr 2023 22:39 UTC

99 points

54 comments6 min readLW link

If interpretability research goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC

202 points

11 comments2 min readLW link

Towards empathy in RL agents and beyond: Insights from cognitive science for AI Alignment

Marc Carauleanu3 Apr 2023 19:59 UTC

15 points

6 comments1 min readLW link

(clipchamp.com)

Monthly Roundup #5: April 2023

Zvi3 Apr 2023 18:50 UTC

26 points

12 comments14 min readLW link

(thezvi.wordpress.com)

Exploring non-anthropocentric aspects of AI existential safety

mishka3 Apr 2023 18:07 UTC

11 points

1 comment3 min readLW link