Martín Soto

Karma: 855

Mathematical Logic grad student, doing AI Safety research for ethical reasons.

Working on conceptual alignment, decision theory, cooperative AI and cause prioritization.

Leave me anonymous feedback.

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín Soto3 Jul 2022 1:05 UTC

5 points

2 comments1 min readLW link

Alignment being impossible might be better than it being really difficult

Martín Soto25 Jul 2022 23:57 UTC

13 points

2 comments2 min readLW link

General advice for transitioning into Theoretical AI Safety

Martín Soto15 Sep 2022 5:23 UTC

11 points

0 comments10 min readLW link

An issue with MacAskill’s Evidentialist’s Wager

Martín Soto21 Sep 2022 22:02 UTC

1 point

9 comments4 min readLW link

[Question] Enriching Youtube content recommendations

Martín Soto27 Sep 2022 16:54 UTC

8 points

4 comments1 min readLW link

Further considerations on the Evidentialist’s Wager

Martín Soto3 Nov 2022 20:06 UTC

3 points

9 comments8 min readLW link

Vanessa Kosoy’s PreDCA, distilled

Martín Soto12 Nov 2022 11:38 UTC

17 points

19 comments5 min readLW link

A short critique of Vanessa Kosoy’s PreDCA

Martín Soto13 Nov 2022 16:00 UTC

27 points

8 comments4 min readLW link

Brute-forcing the universe: a non-standard shot at diamond alignment

Martín Soto22 Nov 2022 22:36 UTC

9 points

2 comments20 min readLW link

The Alignment Problems

Martín Soto12 Jan 2023 22:29 UTC

19 points

0 comments4 min readLW link

Why are counterfactuals elusive?

Martín Soto3 Mar 2023 20:13 UTC

14 points

6 comments2 min readLW link

Quantitative cruxes in Alignment

Martín Soto2 Jul 2023 20:38 UTC

19 points

0 comments23 min readLW link

Sources of evidence in Alignment

Martín Soto2 Jul 2023 20:38 UTC

20 points

0 comments11 min readLW link

Updatelessness doesn’t solve most problems

Martín Soto8 Feb 2024 17:30 UTC

125 points

43 comments12 min readLW link

The lattice of partial updatelessness

Martín Soto10 Feb 2024 17:34 UTC

21 points

5 comments5 min readLW link

Natural abstractions are observer-dependent: a conversation with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC

38 points

13 comments7 min readLW link

Why does generalization work?

Martín Soto20 Feb 2024 17:51 UTC

43 points

16 comments4 min readLW link

Evidential Correlations are Subjective, and it might be a problem

Martín Soto7 Mar 2024 18:37 UTC

26 points

6 comments14 min readLW link

How disagreements about Evidential Correlations could be settled

Martín Soto11 Mar 2024 18:28 UTC

11 points

3 comments4 min readLW link

Comparing Alignment to other AGI interventions: Basic model

Martín Soto20 Mar 2024 18:17 UTC

12 points

4 comments7 min readLW link