All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Update on the UK AI Summit and the UK’s Plans

Elliot MckernonNov 10, 2023, 2:47 PM

11 points

0 comments8 min readLW link

Liv Boeree Ted Talk Moloch & AI

Neil Nov 10, 2023, 2:04 PM

10 points

2 comments1 min readLW link

(m.youtube.com)

Picking Mentors For Research Programmes

Raymond DouglasNov 10, 2023, 1:01 PM

105 points

8 comments4 min readLW link

GPT-2030 and Catastrophic Drives: Four Vignettes

jsteinhardtNov 10, 2023, 7:30 AM

50 points

5 comments10 min readLW link

(bounded-regret.ghost.io)

Crock, Crocker, Crockiest

ScrewtapeNov 10, 2023, 6:14 AM

21 points

4 comments6 min readLW link

AI Timelines

habryka, Daniel Kokotajlo, Ajeya Cotra and Ege Erdil

Nov 10, 2023, 5:28 AM

300 points

136 comments51 min readLW link 2 reviews

ACI#6: A Non-Dualistic ACI Model

Akira PyinyaNov 9, 2023, 11:01 PM

10 points

2 comments6 min readLW link

How I got so excited about HowTruthful

Bruce LewisNov 9, 2023, 6:49 PM

17 points

3 comments5 min readLW link

The case for “Generous Tit for Tat” as the ultimate game theory strategy

positivesumNov 9, 2023, 6:41 PM

2 points

3 comments8 min readLW link

(tryingtruly.substack.com)

Text Posts from the Kids Group: 2021

jefftkNov 9, 2023, 5:50 PM

38 points

1 comment8 min readLW link

(www.jefftk.com)

AI #37: Moving Too Fast

ZviNov 9, 2023, 5:50 PM

53 points

5 comments76 min readLW link

(thezvi.wordpress.com)

Learning-theoretic agenda reading list

Vanessa KosoyNov 9, 2023, 5:25 PM

103 points

1 comment2 min readLW link 1 review

Open-ended/Phenomenal Ethics (TLDR)

Ryo Nov 9, 2023, 4:58 PM

3 points

0 comments1 min readLW link

Polysemantic Attention Head in a 4-Layer Transformer

Jett Janiak, cmathw and StefanHex

Nov 9, 2023, 4:16 PM

51 points

0 comments6 min readLW link

On OpenAI Dev Day

ZviNov 9, 2023, 4:10 PM

60 points

0 comments15 min readLW link

(thezvi.wordpress.com)

Antropical Probabilities Are Fully Explained by Difference in Possible Outcomes

Ape in the coatNov 9, 2023, 3:34 PM

19 points

7 comments5 min readLW link

A free to enter, 240 character, open-source iterated prisoner’s dilemma tournament

Isaac KingNov 9, 2023, 8:24 AM

64 points

19 comments1 min readLW link

(manifold.markets)

Into AI Safety Episodes 1 & 2

jacobhaimesNov 9, 2023, 4:36 AM

2 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Making Bad Decisions On Purpose

ScrewtapeNov 9, 2023, 3:36 AM

49 points

8 comments5 min readLW link

Metaculus’s New Sidebar Helps You Find Forecasts Faster

ChristianWilliamsNov 8, 2023, 8:56 PM

15 points

0 comments LW link

(www.metaculus.com)

Open-ended ethics of phenomena (a desiderata with universal morality)

Ryo Nov 8, 2023, 8:10 PM

1 point

0 comments8 min readLW link

Open Agency model can solve the AI regulation dilemma

Roman LeventovNov 8, 2023, 8:00 PM

22 points

1 comment2 min readLW link

Gothenburg LW / ACX meetup

StefanNov 8, 2023, 7:52 PM

1 point

0 comments1 min readLW link

[Question] Why is lesswrong blocking wget and curl (scrape)?

nick lacombeNov 8, 2023, 7:42 PM

21 points

15 comments1 min readLW link

[Question] Is there a lesswrong archive of all public posts?

nick lacombeNov 8, 2023, 7:26 PM

12 points

7 comments1 min readLW link

Five projects from AI Safety Hub Labs 2023

charlie_griffinNov 8, 2023, 7:19 PM

47 points

1 comment6 min readLW link

(www.aisafetyhub.org)

[Question] Can a stupid person become intelligent?

A. T.Nov 8, 2023, 7:01 PM

12 points

24 comments2 min readLW link

Prosthetic Intelligence

KrantzNov 8, 2023, 7:01 PM

7 points

9 comments2 min readLW link

[Question] Do you have a satisfactory workflow for learning about a line of research using GPT4, Claude, etc?

ryan_bNov 8, 2023, 6:05 PM

9 points

3 comments1 min readLW link

What’s going on? LLMs and IS-A sentences

Bill BenzonNov 8, 2023, 4:58 PM

6 points

15 comments4 min readLW link

[Question] What will happen with real estate prices during a slow takeoff?

Ricardo MeneghinNov 8, 2023, 11:58 AM

8 points

1 comment1 min readLW link

Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models

Felix Hofstätter, Francis Rhys Ward, HarrietW, LAThomson, Ollie J, Patrik Bartak and Sam F. Brown

Nov 8, 2023, 11:37 AM

49 points

0 comments18 min readLW link

How well does your research adress the theory-practice gap?

Jonas HallgrenNov 8, 2023, 11:27 AM

18 points

0 comments10 min readLW link

Growth and Form in a Toy Model of Superposition

Liam Carroll and Edmund Lau

Nov 8, 2023, 11:08 AM

90 points

7 comments14 min readLW link

Running your own workshop on handling hostile disagreements

Camille Berger Nov 8, 2023, 10:28 AM

12 points

1 comment7 min readLW link

Thinking By The Clock

ScrewtapeNov 8, 2023, 7:40 AM

197 points

29 comments8 min readLW link 1 review

[Question] Impressions from base-GPT-4?

mishkaNov 8, 2023, 5:43 AM

25 points

25 comments1 min readLW link

Quantopian contest, but for food intake and weight

LucentNov 8, 2023, 5:41 AM

40 points

9 comments3 min readLW link

How I Think, Part Two: Distrusting Individuals

Richard HenageNov 8, 2023, 4:06 AM

4 points

6 comments3 min readLW link

How I Think, Part One: Investing in Fun

Richard HenageNov 8, 2023, 4:00 AM

5 points

2 comments5 min readLW link

Concrete positive visions for a future without AGI

Max HNov 8, 2023, 3:12 AM

41 points

28 comments8 min readLW link

South Bay ACX/LW/EA Meetup & Vegansgiving Potluck

ISNov 8, 2023, 2:30 AM

10 points

0 comments1 min readLW link

Progress links digest, 2023-11-07: Techno-optimism and more

jasoncrawfordNov 8, 2023, 2:05 AM

17 points

7 comments11 min readLW link

(rootsofprogress.org)

Announcing Athena—Women in AI Alignment Research

Claire ShortNov 7, 2023, 9:46 PM

80 points

2 comments3 min readLW link

Vote on Interesting Disagreements

Ben PaceNov 7, 2023, 9:35 PM

159 points

131 comments1 min readLW link

What is democracy for?

JohnstoneNov 7, 2023, 6:17 PM

−5 points

10 comments7 min readLW link

Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation

Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush and scasper

Nov 7, 2023, 5:59 PM

38 points

2 comments2 min readLW link

(arxiv.org)

Implementing Decision Theory

justinpombrioNov 7, 2023, 5:55 PM

22 points

12 comments3 min readLW link

Mirror, Mirror on the Wall: How Do Forecasters Fare by Their Own Call?

nikosNov 7, 2023, 5:39 PM

14 points

5 comments14 min readLW link

Symbiotic self-alignment of AIs.

Spiritus DeiNov 7, 2023, 5:18 PM

1 point

0 comments3 min readLW link