Chipmonk

Karma: 763

«Boundaries» enthusiast. Click here.

chrislakin.com/now

Chipmonk 23 Apr 2023 0:44 UTC
1 point
0
on: «Boundaries», Part 2: trends in EA’s handling of boundaries
1. Expansive thinking
Consider a fictional person named Alex. A “job” for Alex is a scope of affairs (features of the world) that Alex is considered responsible for observing and handling. Alex might have multiple roles that we can think of as jobs, e.g. “office manager”, “husband”, “neighbor”.
I somewhat disagree with how this section is presented so I wrote a post about it and proposed a compromise.
In summary:
- I propose defining boundaries in the Alex example not in terms of “jobs”, but in terms of: 1) contracts (mutual agreements between two parties), and 2) property / things he owns.
- Alex is not “responsible” for *someone else’s* poverty. (And “donating” is not/cannot be part of his “job”.) He is, however, responsible for his values, and in this case because of his values, he is *expressing care* for someone else’s poverty, and this is distinct from “taking responsibility”.

Chipmonk 24 Apr 2023 12:11 UTC
5 points
0
in reply to: Viliam’s comment on: my conceptualizations of infiltration and exfiltration from the «Boundaries» Sequence
I think your comment is mostly relevant and lays out, mechanistically, how speculating about what someone else is thinking can lead to trying to control them (a sovereignty violation); i.e.: from exfiltration to infiltration.
Also—

Chipmonk 25 Apr 2023 22:03 UTC
1 point
0
on: Boundaries vs Frames
Maintaining Boundaries is about Maintaining Free Will and Privacy
I really like this conceptualization! Especially “privacy”. I’ve written a post about the finer details of this wrt to infiltration and exfiltration.
Here’s a peak at how I summarize this at the end:
- Infiltration — sovereignty
  - “One should try not to be controlled by others”
  - “One should try not to control others”
- Exfiltration — privacy / mindreading
  - ~“One should maintain privacy by default and try not to be mind-read by others”
  - ~“One shouldn’t speculate about what others are thinking” / “One shouldn’t invade others’ privacy”

Chipmonk 26 Apr 2023 17:55 UTC
1 point
0
on: my conceptualizations of infiltration and exfiltration from the «Boundaries» Sequence
I updated the post to add two more examples of exfiltration: one pertaining to BATNAs, and one pertaining to energy/heat loss.
And I added a visualization of agents as blobs.

Chipmonk 26 Apr 2023 18:07 UTC
3 points
on: Boundaries enable positive material-informational feedback loops
Thanks for writing this! This is very closely related to Andrew Critch’s «Boundaries» Sequence, 2022. Part 3a formalizes boundaries in terms of Markov blankets, and leakage in terms of conditional mutual information.
I’ve also expanded on such leakage (“infiltration and exfiltration”) in my post my conceptualizations of infiltration and exfiltration from the «Boundaries» Sequence

Chipmonk 26 Apr 2023 18:16 UTC
1 point
−1
on: Agents Over Cartesian World Models
Cartesian boundaries are not real
I disagree with this. This has recently been formalized in Andrew Critch’s «Boundaries» Sequence. E.g.: «Boundaries», Part 3a: Defining boundaries as directed Markov blankets.
Boundaries include things like a cell membrane, a fence around yard, and a national border; see Part 1. In short, a boundary is going to be something that separates the inside of a living system from the outside of the system. More fundamentally, a living system or organism will be defined as
- a) a part of the world, with
- b) a subsystem called its boundary which approximately causally separates another subsystem called its viscera from the rest of the world,
  
  where
- c) the boundary state decomposes into active and passive features that direct causal influence outward and inward respectively, such that
- d) the boundary and viscera together implement a decision-making process that perpetuates these four defining properties.
Also see: Scott Garrabrant: Boundaries vs Frames

Chipmonk 1 May 2023 23:03 UTC
6 points
0
on: The Apprentice Thread 2
[APPRENTICE]
I’m looking for someone to mentor me specifically w.r.t. «Boundaries» (or, similarly: Cartesian Frames). I’m interested in this both for AI safety (I have a draft compilation post on this that I will be posting in the next few days, or else I’d share it here), and also as a rationality technique. I’m interested in doing research on and/or distillation for this.

«Boundaries/Membranes» and AI safety compilation

Chipmonk3 May 2023 21:41 UTC

53 points

17 comments8 min readLW link

Chipmonk 3 May 2023 22:09 UTC
1 point
0
on: «Boundaries» and AI safety compilation
Here are some more posts which might be also related, but less obviously so. I will leave them in this comment for now, but feel free to argue me into including or excluding any of these.
- Agents Over Cartesian World Models
- Empowerment is (almost) All We Need and LOVE in a simbox is all you need (by Jacob Cannell) ?
- Announcing the Alignment of Complex Systems Research Group (2022 June) seems like this would be related?
- Consequentialists: One-Way Pattern Traps
- Other comments by Davidad: 1
- Quintin Pope might think that boundaries arbitrary. Also see the google doc he links in this comment
- Embedded Agency (full-text version)
Also, lmk if anything else should be linked in the main post.

Chipmonk 3 May 2023 22:33 UTC
1 point
0
in reply to: Raemon’s comment on: «Boundaries» and AI safety compilation
I believe I’m abiding by the definition inherent to his sequence, but anyone is free to convince me otherwise.
(Please also let me know if I’ve violated some norm about naming conventions.)
I’ve decided to use “«boundaries»” instead of “boundaries” because “boundaries” colloquially refers to something that’s more like “Hey you crossed my boundaries, you’re so mean!” (see this post for examples), and while I think that these two concepts are related, I find them extraordinarily confusing to consider simultaneously (because “crossing ‘boundaries’” does not imply “crossing «boundaries»”), so I try to be explicit as possible with the use.
In the future I plan to use that word as little as possible because of this, but unfortunately that’s the name of the sequence.
But “Boundaries [technical]” could do…
What links here?
- «Boundaries/Membranes» and AI safety compilation by Chipmonk (3 May 2023 21:41 UTC; 53 points)

Chipmonk 3 May 2023 23:35 UTC
1 point
0
in reply to: Raemon’s comment on: «Boundaries» and AI safety compilation
Ok, I will rename the tag from “«Boundaries»” to “Boundaries [technical]”. Fwiw I consider both strings as referring to the same concept, but I see how it might be weird to use «».

Chipmonk 4 May 2023 17:10 UTC
4 points
1
on: Davidad’s Bold Plan for Alignment: An In-Depth Explanation
I’ve compiled most if not all of everything Davidad has said about «boundaries» (which are mentioned in this post insofar as “deontic feasibility hypothesis” and “elicitors”) to date here: «Boundaries and AI safety compilation. Also see: «Boundaries» for formalizing a bare-bones morality

Chipmonk 4 May 2023 17:11 UTC
2 points
0
on: An Open Agency Architecture for Safe Transformative AI
- Deontic Sufficiency Hypothesis: There exists a human-understandable set of features of finite trajectories in such a world-model, taking values in $(- \infty, 0]$ , such that we can be reasonably confident that all these features being near 0 implies high probability of existential safety, and such that saturating them at 0 is feasible^[2] with high probability, using scientifically-accessible technologies.
  I am optimistic about this largely because of recent progress toward formalizing a natural abstraction of boundaries by Critch and Garrabrant. I find it quite plausible that there is some natural abstraction property $Q$ of world-model trajectories that lies somewhere strictly within the vast moral gulf of
$All Principles That Human CEV Would Endorse \Rightarrow Q \Rightarrow Don't Kill Everyone$
I’ve compiled most if not all of everything Davidad has said to date about «boundaries» here: «Boundaries and AI safety compilation.

Chipmonk 4 May 2023 17:13 UTC
1 point
0
on: «Boundaries», Part 3b: Alignment problems in terms of boundaries
I’ve compiled all of the current «Boundaries» x AI safety thinking and research I could find in this post: «Boundaries» and AI safety compilation. Also see: «Boundaries» for formalizing a bare-bones morality which relates to scoped consequentialism

Chipmonk 4 May 2023 17:14 UTC
1 point
0
on: Boundaries vs Frames
I’ve compiled all of the current «Boundaries» x AI safety thinking and research (like this post) that I could find here: «Boundaries» and AI safety compilation

Chipmonk 4 May 2023 17:14 UTC
1 point
0
on: Boundaries-based security and AI safety approaches
I’ve compiled all of the current «Boundaries» x AI safety thinking and research (like this post) that I could find here: «Boundaries» and AI safety compilation.
(E.g.: Davidad connected this post to moral patienthood on twitter)

Chipmonk 4 May 2023 23:15 UTC
1 point
on: [Beta Feature] Google-Docs-like editing for LessWrong posts
Bug: I can make myself a co-author on a draft that I’ve created (a second co-author).

Chipmonk 7 May 2023 14:49 UTC
1 point
0
in reply to: Alexander Gietelink Oldenziel’s comment on: «Boundaries» and AI safety compilation
Thanks:)

Chipmonk 10 May 2023 0:56 UTC
1 point
0
on: Respect for Boundaries as non-arbirtrary coordination norms
the object obviously has a viscera that’s outside the boundary
I’m not following you here— ‘viscera’ is defined to be what’s within the boundary, no?
Also, what does it mean for the viscera to have different ‘shapes’?

Chipmonk 11 May 2023 15:48 UTC
2 points
1
on: Davidad’s Bold Plan for Alignment: An In-Depth Explanation
The overleaf project linked in the last word of “Why Category Theory” is restricted

Chipmonk

1. Expansive thinking

Maintaining Boundaries is about Maintaining Free Will and Privacy

Cartesian boundaries are not real

«Boundaries/​Mem­branes» and AI safety compilation

«Boundaries/Membranes» and AI safety compilation