sayan

Karma: 74

[Question] What is your Personal Knowledge Management system?

sayan16 Jul 2019 20:20 UTC

22 points

24 comments1 min readLW link

sayan 24 Sep 2019 8:08 UTC
LW: 11 AF: 3
AF
on: Value Impact
As far as I understand, this post decomoses ‘impact’ into value impact and objective impact. VI is dependent on some agent’s ability to reach arbitrary value-driven goals, while OI depends on any agent’s ability to reach goals in general.

I’m not sure if there exists a robust distinction between the two—the post doesn’t discuss any general demarcation tool.

Maybe I’m wrong, but I think the most important point to note here is that ‘objectiveness’ of an impact is defined not to be about the ‘objective state of the world’ - rather about how ‘general to all agents’ an impact is.

sayan 4 Sep 2019 9:01 UTC
8 points
on: Sayan’s Braindump
Extremely low probability events are great as intuition pumps, but terrible as real world decisionmaking.

sayan 4 Sep 2019 8:28 UTC
8 points
on: Towards a New Impact Measure
Quick question. Given that now the Conservative Agency paper is available, what am I missing if I just read the paper and not this post? It seems easier to me to follow the notations of the paper. Is there any significant difference between the formalization of this post and the paper?

sayan 13 Aug 2018 17:43 UTC
7 points
on: New paper: Long-Term Trajectories of Human Civilization
This is an amazingly comprehensive and useful paper. I wish it was longer with little summaries of some papers it references, rather than just citing them.

I also wish somebody creates a video version of it in the spirit of CGP Grey’s video on the classic Bostrom paper, so that I can just redirect people to the video instead of sub-optimally trying to explain all these things myself.

sayan 9 Aug 2018 8:17 UTC
7 points
on: Open Thread August 2018
I have started to write a series of rigorous introductory blogposts on Reinforcement Learning for people with no background in it. This is totally experimental and I would love to have some feedback on my draft. Please let me know if anyone is interested.

[Question] Unknown Unknowns in AI Alignment

sayan14 Jun 2019 5:07 UTC

6 points

3 comments1 min readLW link

sayan 4 Sep 2019 8:55 UTC
6 points
on: Sayan’s Braindump
Would CIRL with many human agents realistically model our world?

What does AI alignment mean with respect to many humans with different goals? Are we implicitly assuming (with all our current agendas) that the final model of AGI is to being corrigible with one human instructor?

How do we synthesize goals of so many human agents into one utility function? Are we assuming solving alignment with one supervisor is easier? Wouldn’t having many supervisors restrict the space meaningfully?

sayan 21 Sep 2019 6:49 UTC
3 points
on: Reframing Impact
I think this post is broadly making two claims -
1. Impactful things fundamentally feel different.
2. A good Impact Measure should be designed in a way that it strongly safeguards against almost any imperfect objective.
It is also (maybe implicitly) claiming that the three properties mentioned completely specify a good impact measure.

I am looking forward to reading the rest of the sequence with arguments supporting these claims.

sayan 5 Sep 2019 15:37 UTC
3 points
in reply to: sayan’s comment on: Sayan’s Braindump
Seems like this has been done already.

https://www.alignmentforum.org/posts/yXPT4nr4as7JvxLQa/classifying-specification-problems-as-variants-of-goodhart-s

sayan 4 Sep 2019 9:02 UTC
3 points
on: Sayan’s Braindump
Is there a good bijection between specification gaming and wireheading vs different types of Goodhart’s law?

sayan 4 Sep 2019 9:00 UTC
3 points
on: Sayan’s Braindump
Speculation: People never use pro-con lists to actually make decisions, they rather use them rationalizingly to convince others.

sayan 4 Sep 2019 8:59 UTC
3 points
on: Sayan’s Braindump
The internet might be lacking multiple kind of curation and organization tools? How can we improve?

sayan 23 Nov 2019 10:43 UTC
2 points
on: Sayan’s Braindump
Are Dharma traditions that posit ‘innate moral perfection of everyone by default’ reasoning from the just world fallacy?

sayan 23 Nov 2019 10:41 UTC
2 points
on: Sayan’s Braindump
Can we have a market with qualitatively different (un-interconvertible) forms of money?

sayan 4 Sep 2019 9:03 UTC
2 points
on: Sayan’s Braindump
It is so difficult to understand the difference and articulate in pronunciation some accent that is not one’s native, because of the predictive processing of the brain. Our brains are constantly appropriating signals that are closely related to the known ones.

sayan 2 Sep 2018 11:27 UTC
2 points
on: Open Thread September 2018
Just finished reading Yuval Noah Harari’s new book 21 Lessons for the 21st Century. Primary reaction: even if you already know all the things being presented in the book, it is worth a read just because of the clarity into the discussion the book offers.

sayan 12 Dec 2019 9:49 UTC
1 point
on: The Hierarchy of Memory Reconsolidation Techniques
Enjoyed reading this. Looking forward to the next posts in the sequence.

sayan 23 Nov 2019 10:40 UTC
1 point
on: Sayan’s Braindump
How would signalling/countersignalling work in a post-scarcity economy?

sayan 23 Nov 2019 10:38 UTC
1 point
on: Sayan’s Braindump
What are some effective ways to reset the hedonic baseline?