Michele Campolo

Karma: 107

Lifelong recursive self-improver, on his way to exploding really intelligently :D

More seriously: my posts are mostly about AI alignment, with an eye towards moral progress. I have a bachelor’s degree in mathematics, I did research at CEEALAR for four years, and now I do research independently.

A fun problem to think about:
Imagine it’s the year 1500. You want to make an AI that is able to tell you that witch hunts are a terrible idea and to convincingly explain why, despite the fact that many people around you seem to think the exact opposite. Assuming you have the technology, how do you do it?

I’m trying to solve that problem, with the difference that we are in the 21st century now (I know, massive spoiler, sorry for that.)

The problem above, and the fact that I’d like to avoid producing AI that can be used for bad purposes, is what motivates my research. If this sounds interesting to you, have a look at these two short posts. If you are looking for something more technical, consider setting some time aside to read these two.

Feel free to reach out if you relate!

You can support my research through Patreon here.

Work in progress:

Maybe coming soon: an alignment technique (not necessarily for making AI that is good at ethics or cause prioritisation) that can be applied to language models
More probably but less soon: a follow-up to both these two posts (more practical, less theoretical and speculative)
Hard to judge if/when: a nicer version of the argument in here

One more reason for AI capable of independent moral reasoning: alignment itself and cause prioritisation

Michele Campolo22 Aug 2025 15:53 UTC

−3 points

0 comments3 min readLW link

Doing good… best?

Michele Campolo22 Aug 2025 15:48 UTC

−1 points

6 comments2 min readLW link

With enough knowledge, any conscious agent acts morally

Michele Campolo22 Aug 2025 15:44 UTC

−2 points

9 comments36 min readLW link

Agents that act for reasons: a thought experiment

Michele Campolo24 Jan 2024 16:47 UTC

3 points

0 comments3 min readLW link

Free agents

Michele Campolo27 Dec 2023 20:20 UTC

6 points

19 comments14 min readLW link

On value in humans, other animals, and AI

Michele Campolo31 Jan 2023 23:33 UTC

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele Campolo31 Jan 2023 23:01 UTC

19 points

2 comments6 min readLW link

Some alternative AI safety research projects

Michele Campolo28 Jun 2022 14:09 UTC

9 points

0 comments3 min readLW link

From language to ethics by automated reasoning

Michele Campolo21 Nov 2021 15:16 UTC

4 points

4 comments6 min readLW link

[Question] What is the strongest argument you know for antirealism?

Michele Campolo12 May 2021 10:53 UTC

7 points

58 comments1 min readLW link

Naturalism and AI alignment

Michele Campolo24 Apr 2021 16:16 UTC

5 points

12 comments8 min readLW link

Literature Review on Goal-Directedness

adamShimi, Michele Campolo and Joe Collman

18 Jan 2021 11:15 UTC

80 points

21 comments31 min readLW link

Decision Theory is multifaceted

Michele Campolo13 Sep 2020 22:30 UTC

9 points

12 comments8 min readLW link

Goals and short descriptions

Michele Campolo2 Jul 2020 17:41 UTC

14 points

8 comments5 min readLW link

Wireheading and discontinuity

Michele Campolo18 Feb 2020 10:49 UTC

21 points

4 comments3 min readLW link

Thinking of tool AIs

Michele Campolo20 Nov 2019 21:47 UTC

6 points

2 comments4 min readLW link