All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Expert trap: Why is it happening? (Part 2 of 3) – how hindsight, hierarchy, and confirmation biases break conductivity and accuracy of knowledge

Paweł Sysiak9 Jun 2023 23:00 UTC

3 points

0 comments7 min readLW link

Expert trap: What is it? (Part 1 of 3) – how hindsight, hierarchy, and confirmation biases break conductivity and accuracy of knowledge

Paweł Sysiak9 Jun 2023 23:00 UTC

6 points

2 comments8 min readLW link

[Question] How accurate is data about past earth temperatures?

tailcalled9 Jun 2023 21:29 UTC

10 points

2 comments1 min readLW link

Proxi-Antipodes: A Geometrical Intuition For The Difficulty Of Aligning AI With Multitudinous Human Values

Matthew_Opitz9 Jun 2023 21:21 UTC

7 points

0 comments5 min readLW link

Why AI may not save the World

Alberto Zannoni9 Jun 2023 17:42 UTC

0 points

0 comments4 min readLW link

(a16z.com)

You can now listen to the “AI Safety Fundamentals” courses

peter_hartree9 Jun 2023 16:45 UTC

6 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Exploring Concept-Specific Slices in Weight Matrices for Network Interpretability

DuncanFowler9 Jun 2023 16:39 UTC

1 point

0 comments6 min readLW link

A plea for solutionism on AI safety

jasoncrawford9 Jun 2023 16:29 UTC

72 points

6 comments6 min readLW link

(rootsofprogress.org)

Michael Shellenberger: US Has 12 Or More Alien Spacecraft, Say Military And Intelligence Contractors

lc9 Jun 2023 16:11 UTC

12 points

31 comments3 min readLW link

(public.substack.com)

Improvement on MIRI’s Corrigibility

WCargo and Charbel-Raphaël

9 Jun 2023 16:10 UTC

54 points

8 comments13 min readLW link

D&D.Sci 5E: Return of the League of Defenders Evaluation & Ruleset

aphyer9 Jun 2023 15:25 UTC

30 points

8 comments6 min readLW link

InternLM—China’s Best (Unverified)

Lao Mein9 Jun 2023 7:39 UTC

51 points

4 comments1 min readLW link

[Question] Mark for follow up?

JNS9 Jun 2023 5:59 UTC

5 points

4 comments2 min readLW link

Bringing Little Kids to Contra Dances

jefftk9 Jun 2023 2:20 UTC

24 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] (solved) how do i find others’ shortform posts?

kuira9 Jun 2023 2:15 UTC

1 point

1 comment1 min readLW link

[Question] AI Rights: In your view, what would be required for an AGI to gain rights and protections from the various Governments of the World?

Super AGI9 Jun 2023 1:24 UTC

10 points

26 comments1 min readLW link

A comparison of causal scrubbing, causal abstractions, and related methods

Erik Jenner, Adrià Garriga-alonso and Egor Zverev

8 Jun 2023 23:40 UTC

73 points

3 comments22 min readLW link

Updates and Reflections on Optimal Exercise after Nearly a Decade

romeostevensit8 Jun 2023 23:02 UTC

213 points

57 comments2 min readLW link 1 review

Takeaways from the Mechanistic Interpretability Challenges

scasper8 Jun 2023 18:56 UTC

94 points

5 comments6 min readLW link

Leave an Emotional Line of Retreat

Johannes C. Mayer8 Jun 2023 18:36 UTC

23 points

1 comment1 min readLW link

Current AI harms are also sci-fi

Christopher King8 Jun 2023 17:49 UTC

26 points

3 comments1 min readLW link

Two Ways To Reduce Unhappiness That Comes From Distorted Views of Reality

Anne Hsu8 Jun 2023 17:43 UTC

3 points

0 comments7 min readLW link

Collaboration in Science: Happier People ↔ Better Research

nadinespy8 Jun 2023 17:42 UTC

3 points

0 comments32 min readLW link

Biomimetic alignment: Alignment between animal genes and animal brains as a model for alignment between humans and AI systems

geoffreymiller8 Jun 2023 16:05 UTC

10 points

1 comment16 min readLW link

A potentially high impact differential technological development area

Noosphere898 Jun 2023 14:33 UTC

5 points

2 comments2 min readLW link

[Question] Question for Prediction Market people: where is the money supposed to come from?

Robert_AIZI8 Jun 2023 13:58 UTC

25 points

26 comments1 min readLW link

AI #15: The Principle of Charity

Zvi8 Jun 2023 12:10 UTC

73 points

16 comments44 min readLW link

(thezvi.wordpress.com)

if you’re reading this it’s too late (a new theory on what is causing the Great Stagnation)

rogersbacon8 Jun 2023 11:49 UTC

−10 points

2 comments13 min readLW link

(www.secretorum.life)

[Linkpost] Scaling laws for language encoding models in fMRI

Bogdan Ionut Cirstea8 Jun 2023 10:52 UTC

30 points

0 comments1 min readLW link

Transformative AI is a process

meijer19738 Jun 2023 8:57 UTC

2 points

0 comments5 min readLW link

Crisis of Faith case study: beyond reductionism?

MalcolmOcean8 Jun 2023 6:11 UTC

6 points

9 comments19 min readLW link

I wrote this because of watermelon

Arti8 Jun 2023 3:55 UTC

4 points

2 comments1 min readLW link

Learning Transformer Programs [Linkpost]

aog8 Jun 2023 0:16 UTC

7 points

0 comments1 min readLW link

(arxiv.org)

What will GPT-2030 look like?

jsteinhardt7 Jun 2023 23:40 UTC

185 points

43 comments23 min readLW link

(bounded-regret.ghost.io)

Progress links and tweets, 2023-06-07

jasoncrawford7 Jun 2023 23:26 UTC

11 points

0 comments1 min readLW link

(rootsofprogress.org)

LEAst-squares Concept Erasure (LEACE)

tricky_labyrinth7 Jun 2023 21:51 UTC

68 points

10 comments1 min readLW link

(twitter.com)

Proposal: Tune LLMs to Use Calibrated Language

OneManyNone7 Jun 2023 21:05 UTC

9 points

0 comments5 min readLW link

A moral backlash against AI will probably slow down AGI development

geoffreymiller7 Jun 2023 20:39 UTC

51 points

10 comments14 min readLW link

An Exercise to Build Intuitions on AGI Risk

Lauro Langosco7 Jun 2023 18:35 UTC

52 points

3 comments8 min readLW link

Elon talked with senior Chinese leadership about AI X-risk

ChristianKl7 Jun 2023 15:02 UTC

47 points

2 comments1 min readLW link

(www.youtube.com)

Article Summary: Current and Near-Term AI as a Potential Existential Risk Factor

André Ferretti7 Jun 2023 13:51 UTC

28 points

3 comments1 min readLW link

(dl.acm.org)

gamers beware: modded Minecraft has new malware

the gears to ascension7 Jun 2023 13:49 UTC

14 points

5 comments1 min readLW link

(github.com)

Launching Lightspeed Grants (Apply by July 6th)

habryka7 Jun 2023 2:53 UTC

211 points

42 comments5 min readLW link

Cultivate an obsession with the object level

Richard_Ngo7 Jun 2023 1:39 UTC

77 points

4 comments3 min readLW link

How to Slow AI Development

PeterMcCluskey7 Jun 2023 0:29 UTC

20 points

0 comments5 min readLW link

(bayesianinvestor.com)

[Question] Killing Recurrent Memory Over Self Attention?

Del Nobolo6 Jun 2023 23:02 UTC

3 points

0 comments1 min readLW link

[Job Ad] SERI MATS is (still) hiring for our summer program

Ryan Kidd and zanekay

6 Jun 2023 21:07 UTC

12 points

0 comments7 min readLW link

Why I am not a longtermist (May 2022)

boazbarak6 Jun 2023 20:36 UTC

38 points

19 comments9 min readLW link

(windowsontheory.org)

Society Library seeking contributions for canonical AI Safety debate map

Jarred Filmer6 Jun 2023 18:15 UTC

36 points

0 comments1 min readLW link

(www.societylibrary.org)

A Playbook for AI Risk Reduction (focused on misaligned AI)

HoldenKarnofsky6 Jun 2023 18:05 UTC

90 points

42 comments14 min readLW link 1 review