All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

Kingfisher Album Kickstarter

jefftk23 Mar 2023 23:20 UTC

8 points

0 comments2 min readLW link

(www.jefftk.com)

Is your job replaceable by GPT-4? (as of March 2023)

Bezzi23 Mar 2023 22:16 UTC

18 points

6 comments1 min readLW link

ACX meetup [April]

sallatik23 Mar 2023 20:40 UTC

1 point

0 comments1 min readLW link

Feature idea: extra info about post author’s response to comments.

Nathan Helm-Burger23 Mar 2023 20:14 UTC

6 points

0 comments1 min readLW link

Limit intelligent weapons

Lucas Pfeifer23 Mar 2023 17:54 UTC

−11 points

36 comments1 min readLW link

We have to Upgrade

Jed McCaleb23 Mar 2023 17:53 UTC

132 points

35 comments2 min readLW link

The Overton Window widens: Examples of AI risk in the media

Orpheus1623 Mar 2023 17:10 UTC

107 points

24 comments6 min readLW link

GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that’s incorrect if they differ

Christopher King23 Mar 2023 16:16 UTC

7 points

4 comments8 min readLW link

Is “FOXP2 speech & language disorder” really “FOXP2 forebrain fine-motor crappiness”?

Steven Byrnes23 Mar 2023 16:09 UTC

22 points

8 comments6 min readLW link

EAI Alignment Speaker Series #1: Challenges for Safe & Beneficial Brain-Like Artificial General Intelligence with Steve Byrnes

Curtis Huebner and Steven Byrnes

23 Mar 2023 14:32 UTC

37 points

0 comments27 min readLW link

(youtu.be)

[Question] Alignment-related jobs outside of London/SF

kwiat.dev23 Mar 2023 13:24 UTC

26 points

14 comments1 min readLW link

Zuzalu

vincentweisser23 Mar 2023 11:24 UTC

3 points

0 comments1 min readLW link

How Do Induction Heads Actually Work in Transformers With Finite Capacity?

Fabien Roger23 Mar 2023 9:09 UTC

28 points

0 comments5 min readLW link

ChatGPT’s “fuzzy alignment” isn’t evidence of AGI alignment: the banana test

Michael Tontchev23 Mar 2023 7:12 UTC

23 points

6 comments4 min readLW link

Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research

DragonGod23 Mar 2023 5:45 UTC

68 points

23 comments1 min readLW link

(arxiv.org)

Transcript: NBC Nightly News: AI ‘race to recklessness’ w/ Tristan Harris, Aza Raskin

WilliamKiely23 Mar 2023 1:04 UTC

63 points

4 comments3 min readLW link

Why We MUST Create an AGI that Disempowers Humanity. For Real.

twkaiser22 Mar 2023 23:01 UTC

−17 points

1 comment4 min readLW link

Progress links and tweets, 2023-03-22

jasoncrawford22 Mar 2023 22:19 UTC

13 points

0 comments2 min readLW link

(rootsofprogress.org)

[Question] How to convince someone AGI is coming soon?

Zohar Jackson22 Mar 2023 22:16 UTC

5 points

7 comments1 min readLW link

Harry Potter in The World of Path Semantics

Sven Nilsen22 Mar 2023 20:22 UTC

−3 points

17 comments1 min readLW link

(raw.githubusercontent.com)

Books: Lend, Don’t Give

jefftk22 Mar 2023 18:40 UTC

28 points

2 comments1 min readLW link

(www.jefftk.com)

[Linkpost] Shorter version of report on existential risk from power-seeking AI

Joe Carlsmith22 Mar 2023 18:09 UTC

7 points

0 comments1 min readLW link

Announcing the European Network for AI Safety (ENAIS)

Esben Kran22 Mar 2023 17:57 UTC

19 points

0 comments3 min readLW link

[Question] Genuine question: If Eliezer is so rational why is he fat?

DirichletConvolution22 Mar 2023 17:41 UTC

−48 points

12 comments1 min readLW link

Making better estimates with scarce information

Stan Pinsent22 Mar 2023 17:40 UTC

11 points

5 comments10 min readLW link

Anki with Uncertainty: Turn any flashcard deck into a calibration training tool

Sage Future22 Mar 2023 17:26 UTC

14 points

2 comments1 min readLW link

(www.quantifiedintuitions.org)

Key Questions for Digital Minds

Jacy Reese Anthis22 Mar 2023 17:13 UTC

28 points

1 comment7 min readLW link

(www.sentienceinstitute.org)

Empirical risk minimization is fundamentally confused

Jesse Hoogland22 Mar 2023 16:58 UTC

32 points

8 comments1 min readLW link

[Question] Challenge: Does ChatGPT ever claim that a bad outcome for humanity is actually good?

Yair Halberstadt22 Mar 2023 16:01 UTC

50 points

29 comments1 min readLW link

The space of systems and the space of maps

Jan_Kulveit, rosehadshar, Nora_Ammann and clem_acs

22 Mar 2023 14:59 UTC

38 points

0 comments5 min readLW link

Feature Request to OpenAI: Share button in ChatGPT

Taleuntum22 Mar 2023 14:19 UTC

15 points

4 comments2 min readLW link

Why AI Safety is Hard

Simon Möller22 Mar 2023 10:44 UTC

1 point

0 comments6 min readLW link

[Question] Was Saga of Tatiana the Funny made by Fushimi Gaku?

Eve Grey22 Mar 2023 9:59 UTC

−9 points

0 comments1 min readLW link

The Gom Jabbar scene from Dune is essentially a short film about what Rationality is for

mako yass22 Mar 2023 8:33 UTC

6 points

1 comment1 min readLW link

Agentic GPT simulations: a risk and an opportunity

Yair Halberstadt22 Mar 2023 6:24 UTC

24 points

8 comments1 min readLW link

Emergent Analogical Reasoning in Large Language Models

Roman Leventov22 Mar 2023 5:18 UTC

13 points

2 comments1 min readLW link

(arxiv.org)

[Linkpost] GatesNotes: The Age of AI has begun

WilliamKiely22 Mar 2023 4:20 UTC

19 points

9 comments1 min readLW link

An Appeal to AI Superintelligence: Reasons Not to Preserve (most of) Humanity

Alex Beyman22 Mar 2023 4:09 UTC

−14 points

6 comments19 min readLW link

Truth and Advantage: Response to a draft of “AI safety seems hard to measure”

So8res22 Mar 2023 3:36 UTC

98 points

10 comments5 min readLW link 1 review

A Proposed Approach for AI Safety Movement Building: Projects, Professions, Skills, and Ideas for the Future [long post][bounty for feedback]

peterslattery22 Mar 2023 1:11 UTC

14 points

0 comments32 min readLW link

Principles for Productive Group Meetings

jsteinhardt22 Mar 2023 0:50 UTC

60 points

1 comment13 min readLW link

(bounded-regret.ghost.io)

God vs AI scientifically

Donatas Lučiūnas21 Mar 2023 23:03 UTC

−22 points

45 comments1 min readLW link

A method for empirical back-testing of AI’s ability to self-improve

Michael Tontchev21 Mar 2023 20:24 UTC

3 points

0 comments2 min readLW link

AI Fables

Bard21 Mar 2023 19:19 UTC

18 points

12 comments4 min readLW link

[Question] Adversarial (SEO) GPT training data?

Dagon21 Mar 2023 18:55 UTC

2 points

0 comments1 min readLW link

[Question] Why not constrain wetlabs instead of AI?

Lone Pine21 Mar 2023 18:02 UTC

15 points

10 comments1 min readLW link

[Question] Wouldn’t an intelligent agent keep us alive and help us align itself to our values in order to prevent risk ? by Risk I mean experimentation by trying to align potentially smarter replicas?

Terrence Rotoufle21 Mar 2023 17:44 UTC

−3 points

1 comment2 min readLW link

[Question] Employer considering partnering with major AI labs. What to do?

GraduallyMoreAgitated21 Mar 2023 17:43 UTC

37 points

7 comments2 min readLW link

Sun-following Garden Mirrors?

jefftk21 Mar 2023 16:20 UTC

15 points

5 comments1 min readLW link

(www.jefftk.com)

Some constructions for proof-based cooperation without Löb

James Payor21 Mar 2023 16:12 UTC

50 points

3 comments4 min readLW link