All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

China’s position on autonomous weapons

bhauth23 Aug 2023 22:20 UTC

17 points

2 comments1 min readLW link

(academic.oup.com)

Diet Experiment Preregistration: Long-term water fasting + seed oil removal

lc23 Aug 2023 22:08 UTC

56 points

18 comments1 min readLW link

The Low-Hanging Fruit Prior and sloped valleys in the loss landscape

Dmitry Vaintrob and Nina Panickssery

23 Aug 2023 21:12 UTC

84 points

1 comment13 min readLW link

Governing, Fast and Slow

Carson23 Aug 2023 20:01 UTC

3 points

0 comments3 min readLW link

A problem with the most recently published version of CEV

ThomasCederborg23 Aug 2023 18:05 UTC

18 points

10 comments8 min readLW link 1 review

[Question] Which paths to powerful AI should be boosted?

Zach Stein-Perlman23 Aug 2023 16:00 UTC

5 points

1 comment1 min readLW link

A Theory of Laughter

Steven Byrnes23 Aug 2023 15:05 UTC

105 points

18 comments28 min readLW link

Why Is No One Trying To Align Profit Incentives With Alignment Research?

Prometheus23 Aug 2023 13:16 UTC

51 points

11 comments4 min readLW link

Exploring the Responsible Path to AI Research in the Philippines

MiguelDev23 Aug 2023 8:44 UTC

6 points

0 comments6 min readLW link

[Question] Do agents with (mutually known) identical utility functions but irreconcilable knowledge sometimes fight?

mako yass23 Aug 2023 8:13 UTC

14 points

13 comments1 min readLW link

South Bay ACX/SSC Fall Meetups Everywhere

allisona23 Aug 2023 3:00 UTC

3 points

0 comments1 min readLW link

Separate the truth from your wishes

Jacob G-W23 Aug 2023 0:52 UTC

6 points

3 comments1 min readLW link

(jacobgw.com)

Implications of evidential cooperation in large worlds

Lukas Finnveden23 Aug 2023 0:43 UTC

39 points

4 comments17 min readLW link

(lukasfinnveden.substack.com)

South Bay Casual Group Walk

allisona22 Aug 2023 22:43 UTC

7 points

2 comments1 min readLW link

Walk while you talk: don’t balk at “no chalk”

dkl922 Aug 2023 21:27 UTC

42 points

10 comments2 min readLW link

(dkl9.net)

State of Generally Available Self-Driving

jefftk22 Aug 2023 18:50 UTC

66 points

6 comments2 min readLW link

(www.jefftk.com)

Seth Explains Consciousness

Jacob Falkovich22 Aug 2023 18:06 UTC

39 points

130 comments14 min readLW link 1 review

(putanumonit.com)

ChatGPT challenges the case for human irrationality

Kevin Dorst22 Aug 2023 12:46 UTC

3 points

10 comments7 min readLW link

(kevindorst.substack.com)

[Question] Does one have reason to believe the simulation hypothesis is probably true?

kuira22 Aug 2023 8:34 UTC

1 point

20 comments1 min readLW link

The Joan of Arc Challenge For Objective List Theory

Bentham's Bulldog22 Aug 2023 8:01 UTC

−2 points

4 comments10 min readLW link

The Lopsided Lives Argument For Hedonism About Well-being

Bentham's Bulldog22 Aug 2023 7:59 UTC

−2 points

8 comments22 min readLW link

Causality and a Cost Semantics for Neural Networks

scottviteri21 Aug 2023 21:02 UTC

22 points

1 comment1 min readLW link

Ideas for improving epistemics in AI safety outreach

mic21 Aug 2023 19:55 UTC

64 points

6 comments3 min readLW link

Rice’s Theorem says that AIs can’t determine much from studying AI source code

Michael Weiss-Malik21 Aug 2023 19:05 UTC

−12 points

4 comments1 min readLW link

Large Language Models will be Great for Censorship

Ethan Edwards21 Aug 2023 19:03 UTC

185 points

14 comments8 min readLW link

(ethanedwards.substack.com)

“Throwing Exceptions” Is A Strange Programming Pattern

Thoth Hermes21 Aug 2023 18:50 UTC

−2 points

13 comments6 min readLW link

(thothhermes.substack.com)

[Question] Which possible AI systems are relatively safe?

Zach Stein-Perlman21 Aug 2023 17:00 UTC

42 points

20 comments1 min readLW link

Self-shutdown AI

Jan Betley21 Aug 2023 16:48 UTC

13 points

2 comments2 min readLW link

Contextual Translations—Attempt 1

Varshul Gupta21 Aug 2023 14:30 UTC

−1 points

0 comments2 min readLW link

(dubverseblack.substack.com)

DIY Deliberate Practice

lynettebye21 Aug 2023 12:22 UTC

64 points

4 comments5 min readLW link

(lynettebye.com)

Downstairs Opening: 2br Apartment

jefftk21 Aug 2023 0:50 UTC

8 points

2 comments3 min readLW link

(www.jefftk.com)

Efficiency and resource use scaling parity

Ege Erdil21 Aug 2023 0:18 UTC

51 points

1 comment4 min readLW link 1 review

Ruining an expected-log-money maximizer

philh20 Aug 2023 21:20 UTC

33 points

33 comments1 min readLW link 1 review

(reasonableapproximation.net)

Steven Wolfram on AI Alignment

Bill Benzon20 Aug 2023 19:49 UTC

66 points

15 comments4 min readLW link

[Question] What value does personal prediction tracking have?

fx20 Aug 2023 18:43 UTC

8 points

3 comments1 min readLW link

Jan Kulveit’s Corrigibility Thoughts Distilled

brook20 Aug 2023 17:52 UTC

22 points

1 comment5 min readLW link

Memetic Judo #3: The Intelligence of Stochastic Parrots v.2

Max TK20 Aug 2023 15:18 UTC

8 points

33 comments6 min readLW link

ACX/SSC Boulder meetup- September 23

Josh Sacks20 Aug 2023 14:16 UTC

1 point

4 comments1 min readLW link

“Dirty concepts” in AI alignment discourses, and some guesses for how to deal with them

Nora_Ammann and peckzy

20 Aug 2023 9:13 UTC

67 points

4 comments3 min readLW link

Call for Papers on Global AI Governance from the UN

Chris_Leong20 Aug 2023 8:56 UTC

19 points

0 comments1 min readLW link

(www.linkedin.com)

How do I read things on the internet

Vlad Sitalo20 Aug 2023 5:43 UTC

17 points

2 comments8 min readLW link

(vlad.roam.garden)

AI Forecasting: Two Years In

jsteinhardt19 Aug 2023 23:40 UTC

72 points

15 comments11 min readLW link

(bounded-regret.ghost.io)

Four management/leadership book summaries

Nikola Jurkovic19 Aug 2023 23:38 UTC

26 points

2 comments7 min readLW link

Interpreting a dimensionality reduction of a collection of matrices as two positive semidefinite block diagonal matrices

Joseph Van Name19 Aug 2023 19:52 UTC

16 points

2 comments5 min readLW link

Will AI kill everyone? Here’s what the godfathers of AI have to say [RA video]

Writer19 Aug 2023 17:29 UTC

58 points

8 comments2 min readLW link

(youtu.be)

Ten variations on red-pill-blue-pill

Richard_Kennaway19 Aug 2023 16:34 UTC

33 points

34 comments3 min readLW link

Are we running out of new music/movies/art from a metaphysical perspective? (updated)

stephen_s19 Aug 2023 16:24 UTC

4 points

23 comments1 min readLW link

[Question] Any ideas for a prediction market observable that quantifies “culture-warisation”?

Ppau19 Aug 2023 15:11 UTC

6 points

1 comment1 min readLW link

[Question] Clarifying how misalignment can arise from scaling LLMs

Util19 Aug 2023 14:16 UTC

3 points

1 comment1 min readLW link

Chess as a case study in hidden capabilities in ChatGPT

AdamYedidia19 Aug 2023 6:35 UTC

47 points

32 comments6 min readLW link