All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Exams-Only Universities

Mati_Roy6 Nov 2022 22:05 UTC

80 points

40 comments2 min readLW link

Democracy Is in Danger, but Not for the Reasons You Think

ExCeph6 Nov 2022 21:15 UTC

−7 points

4 comments12 min readLW link

(ginnungagapfoundation.wordpress.com)

Playground Game: Monster

jefftk6 Nov 2022 16:00 UTC

14 points

4 comments1 min readLW link

(www.jefftk.com)

[Question] Has Pascal’s Mugging problem been completely solved yet?

EniScien6 Nov 2022 12:52 UTC

3 points

11 comments1 min readLW link

[Question] Should I Pursue a PhD?

DragonGod6 Nov 2022 10:58 UTC

8 points

8 comments2 min readLW link

You won’t solve alignment without agent foundations

Mikhail Samin6 Nov 2022 8:07 UTC

29 points

3 comments8 min readLW link

Word-Distance vs Idea-Distance: The Case for Lanoitaring

Sable6 Nov 2022 5:25 UTC

7 points

7 comments7 min readLW link

(affablyevil.substack.com)

Apple Cider Syrup

jefftk6 Nov 2022 2:10 UTC

11 points

6 comments1 min readLW link

(www.jefftk.com)

What is epigenetics?

Metacelsus6 Nov 2022 1:24 UTC

78 points

4 comments6 min readLW link

(denovo.substack.com)

Response

Jarred Filmer6 Nov 2022 1:03 UTC

29 points

2 comments12 min readLW link

[Question] Has anyone increased their AGI timelines?

Darren McKee6 Nov 2022 0:03 UTC

39 points

12 comments1 min readLW link

Takeaways from a survey on AI alignment resources

DanielFilan5 Nov 2022 23:40 UTC

73 points

10 comments6 min readLW link 1 review

(danielfilan.com)

Unpricable Information and Certificate Hell

eva_5 Nov 2022 22:56 UTC

13 points

2 comments6 min readLW link

Recommend HAIST resources for assessing the value of RLHF-related alignment research

Sam Marks and Xander Davies

5 Nov 2022 20:58 UTC

26 points

9 comments3 min readLW link

Instead of technical research, more people should focus on buying time

Orpheus16, Olive Branch and Thomas Larsen

5 Nov 2022 20:43 UTC

101 points

45 comments14 min readLW link

Provably Honest—A First Step

Srijanak De5 Nov 2022 19:18 UTC

10 points

2 comments8 min readLW link

Should AI focus on problem-solving or strategic planning? Why not both?

Oliver Siegel5 Nov 2022 19:17 UTC

−12 points

3 comments1 min readLW link

How to store human values on a computer

Oliver Siegel5 Nov 2022 19:17 UTC

−12 points

17 comments1 min readLW link

The Slippery Slope from DALLE-2 to Deepfake Anarchy

scasper5 Nov 2022 14:53 UTC

17 points

9 comments11 min readLW link

When can a mimic surprise you? Why generative models handle seemingly ill-posed problems

David Johnston5 Nov 2022 13:19 UTC

8 points

4 comments16 min readLW link

My summary of “Pragmatic AI Safety”

Eleni Angelou5 Nov 2022 12:54 UTC

3 points

0 comments5 min readLW link

Review of the Challenge

SD Marlow5 Nov 2022 6:38 UTC

−14 points

5 comments2 min readLW link

Spectrum of Independence

jefftk5 Nov 2022 2:40 UTC

43 points

7 comments1 min readLW link

(www.jefftk.com)

[paper link] Interpreting systems as solving POMDPs: a step towards a formal understanding of agency

the gears to ascension5 Nov 2022 1:06 UTC

13 points

2 comments1 min readLW link

(www.semanticscholar.org)

Metaculus is seeking Software Engineers

dschwarz5 Nov 2022 0:42 UTC

18 points

0 comments1 min readLW link

(apply.workable.com)

Should we “go against nature”?

jasoncrawford4 Nov 2022 22:14 UTC

10 points

3 comments2 min readLW link

(rootsofprogress.org)

How much should we care about non-human animals?

bokov4 Nov 2022 21:36 UTC

17 points

8 comments2 min readLW link

(www.lesswrong.com)

For ELK truth is mostly a distraction

c.trout4 Nov 2022 21:14 UTC

44 points

0 comments21 min readLW link

Toy Models and Tegum Products

Adam Jermyn4 Nov 2022 18:51 UTC

28 points

7 comments5 min readLW link

Follow up to medical miracle

Elizabeth4 Nov 2022 18:00 UTC

77 points

5 comments6 min readLW link

(acesounderglass.com)

Cross-Void Optimization

pneumynym4 Nov 2022 17:47 UTC

1 point

1 comment8 min readLW link

Monthly Shorts 10/22

Celer4 Nov 2022 16:30 UTC

12 points

0 comments6 min readLW link

(keller.substack.com)

Weekly Roundup #4

Zvi4 Nov 2022 15:00 UTC

42 points

1 comment6 min readLW link

(thezvi.wordpress.com)

A new place to discuss cognitive science, ethics and human alignment

Daniel_Friedrich4 Nov 2022 14:34 UTC

3 points

4 comments2 min readLW link

(www.facebook.com)

A newcomer’s guide to the technical AI safety field

zeshen4 Nov 2022 14:29 UTC

42 points

3 comments10 min readLW link

[Question] Are alignment researchers devoting enough time to improving their research capacity?

Carson Jones4 Nov 2022 0:58 UTC

13 points

3 comments3 min readLW link

[Question] Don’t you think RLHF solves outer alignment?

Charbel-Raphaël4 Nov 2022 0:36 UTC

9 points

23 comments1 min readLW link

Mechanistic Interpretability as Reverse Engineering (follow-up to “cars and elephants”)

David Scott Krueger (formerly: capybaralet)3 Nov 2022 23:19 UTC

28 points

3 comments1 min readLW link

[Question] Could a Supreme Court suit work to solve NEPA problems?

ChristianKl3 Nov 2022 21:10 UTC

15 points

0 comments1 min readLW link

[Video] How having Fast Fourier Transforms sooner could have helped with Nuclear Disarmament—Veritasium

mako yass3 Nov 2022 21:04 UTC

17 points

1 comment1 min readLW link

(www.youtube.com)

Further considerations on the Evidentialist’s Wager

Martín Soto3 Nov 2022 20:06 UTC

3 points

9 comments8 min readLW link

AI as a Civilizational Risk Part 6/6: What can be done

PashaKamyshev3 Nov 2022 19:48 UTC

2 points

4 comments4 min readLW link

A Mystery About High Dimensional Concept Encoding

Fabien Roger3 Nov 2022 17:05 UTC

46 points

13 comments7 min readLW link

Why do we post our AI safety plans on the Internet?

Peter S. Park3 Nov 2022 16:02 UTC

4 points

4 comments11 min readLW link

Multiple Deploy-Key Repos

jefftk3 Nov 2022 15:10 UTC

15 points

0 comments1 min readLW link

(www.jefftk.com)

Covid 11/3/22: Asking Forgiveness

Zvi3 Nov 2022 13:50 UTC

23 points

3 comments6 min readLW link

(thezvi.wordpress.com)

Adversarial Policies Beat Professional-Level Go AIs

sanxiyn3 Nov 2022 13:27 UTC

31 points

35 comments1 min readLW link

(goattack.alignmentfund.org)

K-types vs T-types — what priors do you have?

Cleo Nardo3 Nov 2022 11:29 UTC

74 points

25 comments7 min readLW link

Information Markets 2: Optimally Shaped Reward Bets

eva_3 Nov 2022 11:08 UTC

9 points

0 comments3 min readLW link

The Rational Utilitarian Love Movement (A Historical Retrospective)

Caleb Biddulph3 Nov 2022 7:11 UTC

3 points

0 comments1 min readLW link

(ratutilove.substack.com)