All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

ChatGPT tells stories about XP-708-DQ, Eliezer, dragons, dark sorceresses, and unaligned robots becoming aligned

Bill BenzonJan 8, 2023, 11:21 PM

6 points

2 comments18 min readLW link

Simulacra are Things

janusJan 8, 2023, 11:03 PM

63 points

7 comments2 min readLW link

[Question] GPT learning from smarter texts?

ViliamJan 8, 2023, 10:23 PM

26 points

7 comments1 min readLW link

Latent variable prediction markets mockup + designer request

tailcalledJan 8, 2023, 10:18 PM

25 points

4 comments1 min readLW link

Citability of Lesswrong and the Alignment Forum

Leon LangJan 8, 2023, 10:12 PM

48 points

2 comments1 min readLW link

I tried to learn as much Deep Learning math as I could in 24 hours

PhosphorousJan 8, 2023, 9:07 PM

31 points

2 comments7 min readLW link

[Question] What specific thing would you do with AI Alignment Research Assistant GPT?

quetzal_rainbowJan 8, 2023, 7:24 PM

47 points

9 comments1 min readLW link

[Question] Research ideas (AI Interpretability & Neurosciences) for a 2-months project

fluxJan 8, 2023, 3:36 PM

3 points

1 comment1 min readLW link

200 COP in MI: Image Model Interpretability

Neel NandaJan 8, 2023, 2:53 PM

18 points

3 comments6 min readLW link

Halifax Monthly Meetup: Moloch in the HRM

IdeopunkJan 8, 2023, 2:49 PM

10 points

0 comments1 min readLW link

Dangers of deference

TsviBTJan 8, 2023, 2:36 PM

62 points

5 comments2 min readLW link

Could evolution produce something truly aligned with its own optimization standards? What would an answer to this mean for AI alignment?

No77eJan 8, 2023, 11:04 AM

3 points

4 comments1 min readLW link

AI psychology should ground the theories of AI consciousness and inform human-AI ethical interaction design

Roman LeventovJan 8, 2023, 6:37 AM

20 points

8 comments2 min readLW link

Stop Talking to Each Other and Start Buying Things: Three Decades of Survival in the Desert of Social Media

the gears to ascensionJan 8, 2023, 4:45 AM

1 point

14 comments1 min readLW link

(catvalente.substack.com)

Can Ads be GDPR Compliant?

jefftkJan 8, 2023, 2:50 AM

39 points

10 comments7 min readLW link

(www.jefftk.com)

Feature suggestion: add a ‘clarity score’ to posts

LVSNJan 8, 2023, 1:00 AM

17 points

5 comments1 min readLW link

[Question] How do I better stick to a morning schedule?

Randomized, ControlledJan 8, 2023, 12:52 AM

8 points

8 comments1 min readLW link

Protectionism will Slow the Deployment of AI

Ben GoldhaberJan 7, 2023, 8:57 PM

30 points

6 comments2 min readLW link

David Krueger on AI Alignment in Academia, Coordination and Testing Intuitions

Michaël TrazziJan 7, 2023, 7:59 PM

13 points

0 comments4 min readLW link

(theinsideview.ai)

Looking for Spanish AI Alignment Researchers

AntbJan 7, 2023, 6:52 PM

7 points

3 comments1 min readLW link

Nothing New: Productive Reframing

adamShimiJan 7, 2023, 6:43 PM

44 points

7 comments3 min readLW link

(epistemologicalvigilance.substack.com)

[Question] Asking for a name for a symptom of rationalization

metachiralityJan 7, 2023, 6:34 PM

6 points

5 comments1 min readLW link

The Fountain of Health: a First Principles Guide to Rejuvenation

PhilJacksonJan 7, 2023, 6:34 PM

115 points

39 comments41 min readLW link

What’s wrong with the paperclips scenario?

No77eJan 7, 2023, 5:58 PM

31 points

11 comments1 min readLW link

Building a Rosetta stone for reductionism and telism (WIP)

mrcbarbierJan 7, 2023, 4:22 PM

5 points

0 comments8 min readLW link

What should a telic science look like?

mrcbarbierJan 7, 2023, 4:13 PM

10 points

0 comments11 min readLW link

Open & Welcome Thread—January 2023

DragonGodJan 7, 2023, 11:16 AM

15 points

37 comments1 min readLW link

Anchoring focalism and the Identifiable victim effect: Bias in Evaluating AGI X-Risks

RemmeltJan 7, 2023, 9:59 AM

1 point

2 comments LW link

Can ChatGPT count?

p.b.Jan 7, 2023, 7:57 AM

13 points

11 comments2 min readLW link

Benevolent AI and mental health

peter schwarzJan 7, 2023, 1:30 AM

−31 points

2 comments1 min readLW link

An Ignorant View on Ineffectiveness of AI Safety

IknownothingJan 7, 2023, 1:29 AM

14 points

7 comments3 min readLW link

Optimizing Human Collective Intelligence to Align AI

Shoshannah TekofskyJan 7, 2023, 1:21 AM

12 points

5 comments6 min readLW link

[Question] [Discussion] How Broad is the Human Cognitive Spectrum?

DragonGodJan 7, 2023, 12:56 AM

29 points

51 comments2 min readLW link

Implications of simulators

TW123Jan 7, 2023, 12:37 AM

17 points

0 comments12 min readLW link

[Linkpost] Jan Leike on three kinds of alignment taxes

Orpheus16Jan 6, 2023, 11:57 PM

27 points

2 comments3 min readLW link

(aligned.substack.com)

The Limit of Language Models

DragonGodJan 6, 2023, 11:53 PM

44 points

26 comments4 min readLW link

Why didn’t we get the four-hour workday?

jasoncrawfordJan 6, 2023, 9:29 PM

141 points

34 comments6 min readLW link

(rootsofprogress.org)

AI security might be helpful for AI alignment

Igor IvanovJan 6, 2023, 8:16 PM

36 points

1 comment2 min readLW link

Categorizing failures as “outer” or “inner” misalignment is often confused

Rohin ShahJan 6, 2023, 3:48 PM

93 points

21 comments8 min readLW link

Definitions of “objective” should be Probable and Predictive

Rohin ShahJan 6, 2023, 3:40 PM

43 points

27 comments12 min readLW link

200 COP in MI: Techniques, Tooling and Automation

Neel NandaJan 6, 2023, 3:08 PM

13 points

0 comments15 min readLW link

Ball Square Station and Ridership Maximization

jefftkJan 6, 2023, 1:20 PM

13 points

0 comments1 min readLW link

(www.jefftk.com)

Childhood Roundup #1

ZviJan 6, 2023, 1:00 PM

84 points

27 comments8 min readLW link

(thezvi.wordpress.com)

AI improving AI [MLAISU W01!]

Esben Kran6 Jan 2023 11:13 UTC

5 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

AI Safety Camp, Virtual Edition 2023

Linda Linsefors6 Jan 2023 11:09 UTC

40 points

10 comments3 min readLW link

(aisafety.camp)

Kakistocuriosity

LVSN6 Jan 2023 7:38 UTC

7 points

3 comments1 min readLW link

AI Safety Camp: Machine Learning for Scientific Discovery

Eleni Angelou6 Jan 2023 3:21 UTC

3 points

0 comments1 min readLW link

Metaculus Year in Review: 2022

ChristianWilliams6 Jan 2023 1:23 UTC

6 points

0 comments LW link

UDASSA

Jacob Falkovich6 Jan 2023 1:07 UTC

27 points

8 comments10 min readLW link

The Involuntary Pacifists

Capybasilisk6 Jan 2023 0:28 UTC

11 points

3 comments2 min readLW link