RSS

Michele Campolo

Karma: 121

Lifelong recursive self-improver, on his way to exploding really intelligently :D

More seriously: my posts are mostly about AI alignment, with an eye towards moral progress and creating a better future. If there was a public machine ethics forum, I would write there as well.

An idea:

The idea above, and the fact that I’d like to avoid producing technology that can be used for bad purposes, is what motivates my research. Feel free to reach out if you relate!

At the moment I am doing research at CEEALAR on agents whose behaviour is driven by a reflective process analogous to human moral reasoning, rather than by a metric specified by the designer. See Free agents.

Here are other suggested readings from what I’ve written so far:

-Naturalism and AI alignment
-From language to ethics by automated reasoning
-Criticism of the main framework in AI alignment

Agents that act for rea­sons: a thought experiment

Michele Campolo24 Jan 2024 16:47 UTC
3 points
0 comments3 min readLW link

Free agents

Michele Campolo27 Dec 2023 20:20 UTC
6 points
19 comments13 min readLW link

On value in hu­mans, other an­i­mals, and AI

Michele Campolo31 Jan 2023 23:33 UTC
3 points
17 comments5 min readLW link

Crit­i­cism of the main frame­work in AI alignment

Michele Campolo31 Jan 2023 23:01 UTC
19 points
2 comments6 min readLW link

Some al­ter­na­tive AI safety re­search projects

Michele Campolo28 Jun 2022 14:09 UTC
9 points
0 comments3 min readLW link

From lan­guage to ethics by au­to­mated reasoning

Michele Campolo21 Nov 2021 15:16 UTC
4 points
4 comments6 min readLW link

[Question] What is the strongest ar­gu­ment you know for an­tire­al­ism?

Michele Campolo12 May 2021 10:53 UTC
7 points
58 comments1 min readLW link

Nat­u­ral­ism and AI alignment

Michele Campolo24 Apr 2021 16:16 UTC
5 points
12 comments8 min readLW link

Liter­a­ture Re­view on Goal-Directedness

18 Jan 2021 11:15 UTC
80 points
21 comments31 min readLW link

De­ci­sion The­ory is multifaceted

Michele Campolo13 Sep 2020 22:30 UTC
9 points
12 comments8 min readLW link

Goals and short descriptions

Michele Campolo2 Jul 2020 17:41 UTC
14 points
8 comments5 min readLW link

Wire­head­ing and discontinuity

Michele Campolo18 Feb 2020 10:49 UTC
21 points
4 comments3 min readLW link

Think­ing of tool AIs

Michele Campolo20 Nov 2019 21:47 UTC
6 points
2 comments4 min readLW link