All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Who Aligns the Alignment Researchers?

Ben Smith5 Mar 2023 23:22 UTC

49 points

0 comments11 min readLW link

Startups are like firewood

Adam Zerner5 Mar 2023 23:09 UTC

26 points

2 comments3 min readLW link

A concerning observation from media coverage of AI industry dynamics

Justin Olive5 Mar 2023 21:38 UTC

8 points

3 comments3 min readLW link

Steven Pinker on ChatGPT and AGI (Feb 2023)

Evan R. Murphy5 Mar 2023 21:34 UTC

11 points

8 comments1 min readLW link

(news.harvard.edu)

Is it time to talk about AI doomsday prepping yet?

bokov5 Mar 2023 21:17 UTC

0 points

8 comments1 min readLW link

Coordination explosion before intelligence explosion...?

tailcalled5 Mar 2023 20:48 UTC

47 points

9 comments2 min readLW link

The Ogdoad

Tristan Miano5 Mar 2023 20:01 UTC

−15 points

1 comment37 min readLW link

[Question] What are some good ways to heighten my emotions?

oh543215 Mar 2023 18:06 UTC

5 points

5 comments1 min readLW link

Research proposal: Leveraging Jungian archetypes to create values-based models

MiguelDev5 Mar 2023 17:39 UTC

5 points

2 comments2 min readLW link

Abusing Snap Circuits IC

jefftk5 Mar 2023 17:00 UTC

19 points

3 comments3 min readLW link

(www.jefftk.com)

Do humans derive values from fictitious imputed coherence?

TsviBT5 Mar 2023 15:23 UTC

56 points

11 comments14 min readLW link

The Inner-Compass Theorem

Tristan Miano5 Mar 2023 15:21 UTC

−18 points

12 comments16 min readLW link

Halifax Monthly Meetup: AI Safety Discussion

Ideopunk5 Mar 2023 12:42 UTC

10 points

0 comments1 min readLW link

Why kill everyone?

arisAlexis5 Mar 2023 11:53 UTC

7 points

5 comments2 min readLW link

Selective, Corrective, Structural: Three Ways of Making Social Systems Work

Said Achmiz5 Mar 2023 8:45 UTC

105 points

13 comments2 min readLW link

Substitute goods for leisure are abundant

Adam Zerner5 Mar 2023 3:45 UTC

20 points

7 comments5 min readLW link

[Question] Does polyamory at a workplace turn nepotism up to eleven?

Viliam5 Mar 2023 0:57 UTC

52 points

11 comments2 min readLW link

Why We MUST Build an (aligned) Artificial Superintelligence That Takes Over Human Society—A Thought Experiment

twkaiser5 Mar 2023 0:47 UTC

−13 points

12 comments2 min readLW link

Forecasts on Moore v Harper from Samotsvety

gregjustice5 Mar 2023 0:47 UTC

7 points

0 comments1 min readLW link

(samotsvety.org)

Why Not Just… Build Weak AI Tools For AI Alignment Research?

johnswentworth5 Mar 2023 0:12 UTC

188 points

18 comments6 min readLW link

Consciousness is irrelevant—instead solve alignment by asking this question

Oliver Siegel4 Mar 2023 22:06 UTC

−10 points

6 comments1 min readLW link

More money with less risk: sell services instead of model access

lemonhope4 Mar 2023 20:51 UTC

9 points

3 comments1 min readLW link

Contra “Strong Coherence”

DragonGod4 Mar 2023 20:05 UTC

39 points

24 comments1 min readLW link

The Practitioner’s Path 2.0: A new framework for structured self-improvement

Evenflair4 Mar 2023 19:19 UTC

32 points

2 comments11 min readLW link

(guildoftherose.org)

The Benefits of Distillation in Research

Jonas Hallgren4 Mar 2023 17:45 UTC

15 points

2 comments5 min readLW link

Optimal Music Choice

mbazzani4 Mar 2023 17:26 UTC

5 points

0 comments1 min readLW link

Why don’t more people talk about ecological psychology?

Ppau4 Mar 2023 17:03 UTC

21 points

10 comments7 min readLW link

Switching to Electric Mandolin

jefftk4 Mar 2023 15:40 UTC

16 points

1 comment1 min readLW link

(www.jefftk.com)

Predictive Performance on Metaculus vs. Manifold Markets

nikos4 Mar 2023 8:10 UTC

18 points

0 comments5 min readLW link

Contra Hanson on AI Risk

Liron4 Mar 2023 8:02 UTC

36 points

23 comments8 min readLW link

Bite Sized Tasks

Johannes C. Mayer4 Mar 2023 3:31 UTC

18 points

2 comments2 min readLW link

How popular is ChatGPT? Part 2: slower growth than Pokémon GO

Richard Korzekwa 3 Mar 2023 23:40 UTC

42 points

4 comments6 min readLW link

(aiimpacts.org)

Acausal normalcy

Andrew_Critch3 Mar 2023 23:34 UTC

203 points

40 comments8 min readLW link 1 review

Comments on OpenAI’s “Planning for AGI and beyond”

So8res3 Mar 2023 23:01 UTC

149 points

2 comments14 min readLW link

Why are counterfactuals elusive?

Martín Soto3 Mar 2023 20:13 UTC

14 points

6 comments2 min readLW link

Situational awareness in Large Language Models

Simon Möller3 Mar 2023 18:59 UTC

32 points

2 comments7 min readLW link

AI Governance & Strategy: Priorities, talent gaps, & opportunities

Orpheus163 Mar 2023 18:09 UTC

56 points

2 comments4 min readLW link

Measuring Ads Opt-Out Compliance

jefftk3 Mar 2023 16:00 UTC

18 points

2 comments2 min readLW link

(www.jefftk.com)

ChatGPT tells stories, and a note about reverse engineering: A Working Paper

Bill Benzon3 Mar 2023 15:12 UTC

3 points

0 comments3 min readLW link

Group Wiki Walk

Screwtape3 Mar 2023 15:10 UTC

9 points

0 comments3 min readLW link

Robin Hanson’s latest AI risk position statement

Liron3 Mar 2023 14:25 UTC

55 points

18 comments1 min readLW link

(www.overcomingbias.com)

A reply to Byrnes on the Free Energy Principle

Roman Leventov3 Mar 2023 13:03 UTC

28 points

16 comments14 min readLW link

Sydney can play chess and kind of keep track of the board state

Erik Jenner3 Mar 2023 9:39 UTC

64 points

19 comments6 min readLW link

[Fiction] The boy in the glass dome

Kaj_Sotala3 Mar 2023 7:50 UTC

28 points

0 comments2 min readLW link

(kajsotala.fi)

The Waluigi Effect (mega-post)

Cleo Nardo3 Mar 2023 3:22 UTC

648 points

188 comments16 min readLW link

Aspiring AI safety researchers should ~argmax over AGI timelines

Ryan Kidd3 Mar 2023 2:04 UTC

29 points

8 comments2 min readLW link

ACX/SSC/LW meetup

Épiphanie Gédéon2 Mar 2023 23:37 UTC

8 points

0 comments1 min readLW link

Results Prediction Thread About How Different Factors Affect AI X-Risk

MrThink2 Mar 2023 22:13 UTC

9 points

0 comments2 min readLW link

Why I’m not into the Free Energy Principle

Steven Byrnes2 Mar 2023 19:27 UTC

170 points

55 comments9 min readLW link 1 review

[Question] Lost in the sauce

JungleTact1cs2 Mar 2023 16:58 UTC

−5 points

12 comments1 min readLW link