RSS

MichaelDickens

Karma: 2,436

Can AI make ad­vance­ments in moral philos­o­phy by writ­ing proofs?

MichaelDickens14 Apr 2026 0:09 UTC
14 points
4 comments4 min readLW link

Paus­ing AI Is the Best An­swer to Post-Align­ment Problems

MichaelDickens11 Apr 2026 15:46 UTC
61 points
12 comments2 min readLW link

By Strong De­fault, ASI Will End Liberal Democracy

MichaelDickens6 Apr 2026 23:43 UTC
45 points
16 comments3 min readLW link

Which types of AI al­ign­ment re­search are most likely to be good for all sen­tient be­ings?

MichaelDickens23 Mar 2026 13:38 UTC
11 points
1 comment6 min readLW link

If AI al­ign­ment is only as hard as build­ing the steam en­g­ine, then we likely still die

MichaelDickens10 Jan 2026 23:10 UTC
35 points
8 comments4 min readLW link

Where I Am Donat­ing in 2025

MichaelDickens28 Nov 2025 5:07 UTC
32 points
2 comments14 min readLW link

Align­ment Boot­strap­ping Is Dangerous

MichaelDickens27 Nov 2025 18:18 UTC
18 points
0 comments2 min readLW link

We won’t solve post-al­ign­ment prob­lems by do­ing research

MichaelDickens21 Nov 2025 18:03 UTC
23 points
11 comments4 min readLW link

Know­ing Whether AI Align­ment Is a One-Shot Prob­lem Is a One-Shot Problem

MichaelDickens17 Nov 2025 19:11 UTC
32 points
2 comments3 min readLW link

Epistemic Spot Check: Ex­pected Value of Donat­ing to Alex Bores’s Con­gres­sional Campaign

MichaelDickens13 Nov 2025 19:08 UTC
66 points
1 comment6 min readLW link

Things I’ve Be­come More Con­fi­dent About

MichaelDickens3 Nov 2025 3:52 UTC
7 points
0 comments7 min readLW link

Out­live: A Crit­i­cal Review

MichaelDickens4 Jul 2025 2:14 UTC
67 points
4 comments27 min readLW link
(mdickens.me)

[Question] How con­cerned are you about a fast take­off due to a leap in hard­ware us­age?

MichaelDickens14 Jun 2025 1:15 UTC
9 points
7 comments1 min readLW link

Why would AI com­pa­nies use hu­man-level AI to do al­ign­ment re­search?

MichaelDickens25 Apr 2025 19:12 UTC
29 points
8 comments2 min readLW link

What AI safety plans are there?

MichaelDickens23 Apr 2025 22:58 UTC
18 points
4 comments1 min readLW link

Retroac­tive If-Then Commitments

MichaelDickens1 Feb 2025 22:22 UTC
8 points
1 comment1 min readLW link

A “slow take­off” might still look fast

MichaelDickens17 Feb 2023 16:51 UTC
5 points
3 comments1 min readLW link

[Question] How much should I up­date on the fact that my den­tist is named Den­nis?

MichaelDickens26 Dec 2022 19:11 UTC
2 points
3 comments1 min readLW link

[Question] Why does gra­di­ent de­scent always work on neu­ral net­works?

MichaelDickens20 May 2022 21:13 UTC
15 points
11 comments1 min readLW link

MichaelDick­ens’s Shortform

MichaelDickens18 Oct 2021 18:26 UTC
2 points
162 comments1 min readLW link