All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

A bunch of videos in comments

the gears to ascension12 Jun 2023 22:31 UTC

10 points

62 comments1 min readLW link

[Linkpost] The neuroconnectionist research programme

Bogdan Ionut Cirstea12 Jun 2023 21:58 UTC

6 points

1 comment1 min readLW link

Contingency: A Conceptual Tool from Evolutionary Biology for Alignment

clem_acs12 Jun 2023 20:54 UTC

57 points

2 comments14 min readLW link

(acsresearch.org)

Book Review: Autoheterosexuality

tailcalled12 Jun 2023 20:11 UTC

27 points

9 comments24 min readLW link

Aura as a proprioceptive glitch

pchvykov12 Jun 2023 19:30 UTC

37 points

4 comments4 min readLW link

Aligning Mathematical Notions of Infinity with Human Intuition

London L.12 Jun 2023 19:19 UTC

1 point

10 comments9 min readLW link

(medium.com)

ARC is hiring theoretical researchers

paulfchristiano, Jacob_Hilton and Mark Xu

12 Jun 2023 18:50 UTC

126 points

12 comments4 min readLW link

(www.alignment.org)

Introduction to Towards Causal Foundations of Safe AGI

tom4everitt, Lewis Hammond, Francis Rhys Ward, RyanCarey, James Fox, mattmacdermott and sbenthall

12 Jun 2023 17:55 UTC

73 points

6 comments4 min readLW link

Manifold Predicted the AI Extinction Statement and CAIS Wanted it Deleted

David Chee12 Jun 2023 15:54 UTC

71 points

15 comments12 min readLW link

Explicitness

TsviBT12 Jun 2023 15:05 UTC

29 points

0 comments15 min readLW link

If you are too stressed, walk away from the front lines

Neil 12 Jun 2023 14:26 UTC

44 points

14 comments5 min readLW link

UK PM: $125M for AI safety

Hauke Hillebrandt12 Jun 2023 12:33 UTC

31 points

11 comments1 min readLW link

(twitter.com)

[Question] Could induced and stabilized hypomania be a desirable mental state?

MvB12 Jun 2023 12:13 UTC

8 points

22 comments2 min readLW link

Non-loss of control AGI-related catastrophes are out of control too

Yi-Yang, Mo Putera and zeshen

12 Jun 2023 12:01 UTC

2 points

3 comments24 min readLW link

Critiques of prominent AI safety labs: Conjecture

Omega.12 Jun 2023 1:32 UTC

12 points

32 comments33 min readLW link

why I’m anti-YIMBY

bhauth12 Jun 2023 0:19 UTC

20 points

45 comments2 min readLW link

ACX Brno meetup #2

adekcz11 Jun 2023 13:53 UTC

2 points

0 comments1 min readLW link

[Linkpost] Large Language Models Converge on Brain-Like Word Representations

Bogdan Ionut Cirstea11 Jun 2023 11:20 UTC

36 points

12 comments1 min readLW link

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

likenneth11 Jun 2023 5:38 UTC

195 points

4 comments1 min readLW link

(arxiv.org)

You Are a Computer, and No, That’s Not a Metaphor

jakej11 Jun 2023 5:38 UTC

13 points

1 comment22 min readLW link

(sigil.substack.com)

Snake Eyes Paradox

Martin Randall11 Jun 2023 4:10 UTC

22 points

25 comments6 min readLW link

[Question] [Mostly solved] I get distracted while reading, but can easily comprehend audio text for 8+ hours per day. What are the best AI text-to-speech readers? Alternatively, do you have other ideas for what I could do?

kuira11 Jun 2023 3:49 UTC

18 points

7 comments1 min readLW link

The Dictatorship Problem

alyssavance11 Jun 2023 2:45 UTC

35 points

145 comments11 min readLW link

Using Consensus Mechanisms as an approach to Alignment

Prometheus10 Jun 2023 23:38 UTC

9 points

2 comments6 min readLW link

Humanities first math problem, The shallow gene pool.

archeon10 Jun 2023 23:09 UTC

−2 points

0 comments1 min readLW link

I can see how I am Dumb

Johannes C. Mayer10 Jun 2023 19:18 UTC

47 points

11 comments5 min readLW link

Ethodynamics of Omelas

dr_s10 Jun 2023 16:24 UTC

83 points

18 comments9 min readLW link 1 review

Dealing with UFO claims

ChristianKl10 Jun 2023 15:45 UTC

3 points

32 comments1 min readLW link

A Theory of Unsupervised Translation Motivated by Understanding Animal Communication

jsd10 Jun 2023 15:44 UTC

19 points

0 comments1 min readLW link

(arxiv.org)

[Question] What are brains?

Valentine10 Jun 2023 14:46 UTC

10 points

22 comments2 min readLW link

EY in the New York Times

Blueberry10 Jun 2023 12:21 UTC

6 points

14 comments1 min readLW link

(www.nytimes.com)

Goal-misgeneralization is ELK-hard

rokosbasilisk10 Jun 2023 9:32 UTC

2 points

0 comments1 min readLW link

[Question] What do beneficial TDT trades for humanity concretely look like?

Stephen Fowler10 Jun 2023 6:50 UTC

4 points

0 comments1 min readLW link

cloud seeding doesn’t work

bhauth10 Jun 2023 5:14 UTC

7 points

2 comments1 min readLW link

[FICTION] Unboxing Elysium: An AI’S Escape

Super AGI10 Jun 2023 4:41 UTC

−16 points

4 comments14 min readLW link

[FICTION] Prometheus Rising: The Emergence of an AI Consciousness

Super AGI10 Jun 2023 4:41 UTC

−14 points

0 comments9 min readLW link

formalizing the QACI alignment formal-goal

Tamsin Leake and JuliaHP

10 Jun 2023 3:28 UTC

54 points

6 comments13 min readLW link

(carado.moe)

Expert trap: Why is it happening? (Part 2 of 3) – how hindsight, hierarchy, and confirmation biases break conductivity and accuracy of knowledge

Paweł Sysiak9 Jun 2023 23:00 UTC

3 points

0 comments7 min readLW link

Expert trap: What is it? (Part 1 of 3) – how hindsight, hierarchy, and confirmation biases break conductivity and accuracy of knowledge

Paweł Sysiak9 Jun 2023 23:00 UTC

6 points

2 comments8 min readLW link

[Question] How accurate is data about past earth temperatures?

tailcalled9 Jun 2023 21:29 UTC

10 points

2 comments1 min readLW link

Proxi-Antipodes: A Geometrical Intuition For The Difficulty Of Aligning AI With Multitudinous Human Values

Matthew_Opitz9 Jun 2023 21:21 UTC

7 points

0 comments5 min readLW link

Why AI may not save the World

Alberto Zannoni9 Jun 2023 17:42 UTC

0 points

0 comments4 min readLW link

(a16z.com)

You can now listen to the “AI Safety Fundamentals” courses

peter_hartree9 Jun 2023 16:45 UTC

6 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Exploring Concept-Specific Slices in Weight Matrices for Network Interpretability

DuncanFowler9 Jun 2023 16:39 UTC

1 point

0 comments6 min readLW link

A plea for solutionism on AI safety

jasoncrawford9 Jun 2023 16:29 UTC

72 points

6 comments6 min readLW link

(rootsofprogress.org)

Michael Shellenberger: US Has 12 Or More Alien Spacecraft, Say Military And Intelligence Contractors

lc9 Jun 2023 16:11 UTC

12 points

31 comments3 min readLW link

(public.substack.com)

Improvement on MIRI’s Corrigibility

WCargo and Charbel-Raphaël

9 Jun 2023 16:10 UTC

54 points

8 comments13 min readLW link

D&D.Sci 5E: Return of the League of Defenders Evaluation & Ruleset

aphyer9 Jun 2023 15:25 UTC

30 points

8 comments6 min readLW link

InternLM—China’s Best (Unverified)

Lao Mein9 Jun 2023 7:39 UTC

51 points

4 comments1 min readLW link

[Question] Mark for follow up?

JNS9 Jun 2023 5:59 UTC

5 points

4 comments2 min readLW link