Boundaries / Membranes [technical]

TagLast edit: 31 Jul 2024 5:03 UTC by Chris Lakin

Explanation

Andrew Critch, March 2023:

By boundaries, I just mean the approximate causal separation of regions in some kind of physical space (e.g., spacetime) or abstract space (e.g., cyberspace). Here are some examples from my «Boundaries» Sequence:
a cell membrane (separates the inside of a cell from the outside);
a person’s skin (separates the inside of their body from the outside);
a fence around a family’s yard (separates the family’s place of living-together from neighbors and others);
a digital firewall around a local area network (separates the LAN and its users from the rest of the internet);
a sustained disassociation of social groups (separates the two groups from each other)
a national border (separates a state from neighboring states or international waters).

Applications

One application of boundaries to formalize the concept of safety.
https://formalizingboundaries.ai/

Compilation

«Boundaries»/membranes and AI safety compilation.

Beware a common misunderstanding

When I say boundary, I don’t just mean an arbitrary constraint or social norm. («Boundaries» Pt. 1)

It is for this reason that I (@Chipmonk) often use the term “membranes” instead of “boundaries”. It aids understanding.

Credits

Tag created and maintained by Chipmonk, membranes/«boundaries» enthusiast. 2023 April.

«Boundaries», Part 1: a key missing concept from utility theory

Andrew_Critch26 Jul 2022 23:03 UTC

158 points

33 comments7 min readLW link

«Boundaries», Part 3b: Alignment problems in terms of boundaries

Andrew_Critch14 Dec 2022 22:34 UTC

72 points

7 comments13 min readLW link

Acausal normalcy

Andrew_Critch3 Mar 2023 23:34 UTC

200 points

40 comments8 min readLW link 1 review

Agent Boundaries Aren’t Markov Blankets. [Unless they’re non-causal; see comments.]

abramdemski20 Nov 2023 18:23 UTC

82 points

11 comments2 min readLW link

«Boundaries/Membranes» and AI safety compilation

Chris Lakin3 May 2023 21:41 UTC

56 points

17 comments8 min readLW link

What does davidad want from «boundaries»?

Chris Lakin and davidad

6 Feb 2024 17:45 UTC

47 points

1 comment5 min readLW link

“Membranes” is better terminology than “boundaries” alone

Chris Lakin and the gears to ascension

28 May 2023 22:16 UTC

30 points

12 comments3 min readLW link

Agent membranes/boundaries and formalizing “safety”

Chris Lakin3 Jan 2024 17:55 UTC

26 points

46 comments3 min readLW link

[Question] What technical topics could help with boundaries/membranes?

Chris Lakin5 Jan 2024 18:14 UTC

15 points

25 comments1 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris Lakin27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

Retrospective on Mathematical Boundaries Workshop

miyazono and Chris Lakin

12 May 2024 21:58 UTC

22 points

0 comments4 min readLW link

(formalizingboundaries.substack.com)

Boundaries vs Frames

Scott Garrabrant31 Oct 2022 15:14 UTC

58 points

10 comments7 min readLW link

«Boundaries» for formalizing an MVP morality

Chris Lakin13 May 2023 19:10 UTC

19 points

7 comments4 min readLW link

Agent membranes and causal distance

Chris Lakin2 Jan 2024 22:43 UTC

20 points

3 comments3 min readLW link

«Boundaries», Part 3a: Defining boundaries as directed Markov blankets

Andrew_Critch30 Oct 2022 6:31 UTC

90 points

20 comments15 min readLW link

«Boundaries» Sequence (Index Post)

Andrew_Critch26 Jul 2022 19:12 UTC

25 points

1 comment1 min readLW link

«Boundaries», Part 2: trends in EA’s handling of boundaries

Andrew_Critch6 Aug 2022 0:42 UTC

81 points

15 comments7 min readLW link

What is autonomy? Why boundaries are necessary.

Chris Lakin21 Oct 2024 17:56 UTC

8 points

1 comment1 min readLW link

(chrislakin.blog)

Hierarchical Agency: A Missing Piece in AI Alignment

Jan_Kulveit27 Nov 2024 5:49 UTC

115 points

22 comments11 min readLW link

[Question] Plausibility of cyborgism for protecting boundaries?

Chris Lakin27 Mar 2024 18:53 UTC

10 points

6 comments1 min readLW link

Consequentialists: One-Way Pattern Traps

David Udell16 Jan 2023 20:48 UTC

59 points

3 comments14 min readLW link

Agency from a causal perspective

tom4everitt, mattmacdermott, James Fox, Francis Rhys Ward and Jonathan Richens

30 Jun 2023 17:37 UTC

40 points

5 comments6 min readLW link

Embedded Agency (full-text version)

Scott Garrabrant and abramdemski

15 Nov 2018 19:49 UTC

211 points

17 comments54 min readLW link

A list of core AI safety problems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC

165 points

29 comments5 min readLW link

Protecting agent boundaries

Chris Lakin25 Jan 2024 4:13 UTC

11 points

6 comments2 min readLW link

An Open Agency Architecture for Safe Transformative AI

davidad20 Dec 2022 13:04 UTC

80 points

22 comments4 min readLW link

Cartography, blowing one’s mind, the illusion of separation and other general musings

Neil 16 Jun 2023 19:19 UTC

0 points

4 comments2 min readLW link

Boundaries Update #1

Chris Lakin11 Apr 2024 16:07 UTC

3 points

2 comments1 min readLW link

(formalizingboundaries.substack.com)

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

19 Apr 2023 16:09 UTC

167 points

40 comments21 min readLW link 2 reviews

Being nicer than Clippy

Joe Carlsmith16 Jan 2024 19:44 UTC

109 points

32 comments27 min readLW link

Desiderata for an AI

Nathan Helm-Burger19 Jul 2023 16:18 UTC

9 points

0 comments4 min readLW link

Agents Over Cartesian World Models

Mark Xu and evhub

27 Apr 2021 2:06 UTC

67 points

4 comments27 min readLW link

Respect for Boundaries as non-arbirtrary coordination norms

Jonas Hallgren9 May 2023 19:42 UTC

9 points

3 comments7 min readLW link

Encultured AI Pre-planning, Part 1: Enabling New Benchmarks

Andrew_Critch and Nick Hay

8 Aug 2022 22:44 UTC

63 points

2 comments6 min readLW link

Boundary Violations vs Boundary Dissolution

Chris Lakin26 Feb 2024 18:59 UTC

8 points

4 comments1 min readLW link

Boundaries-based security and AI safety approaches

Allison Duettmann12 Apr 2023 12:36 UTC

43 points

2 comments6 min readLW link

[Question] Define “Agent” (Embedded)

Apollonia24 Mar 2024 20:14 UTC

10 points

1 comment1 min readLW link

Incorporating Justice Theory into Decision Theory

StrivingForLegibility21 Jan 2024 19:17 UTC

13 points

20 comments5 min readLW link

[AN #127]: Rethinking agency: Cartesian frames as a formalization of ways to carve up the world into an agent and its environment

Rohin Shah2 Dec 2020 18:20 UTC

53 points

0 comments13 min readLW link

(mailchi.mp)

LOVE in a simbox is all you need

jacob_cannell28 Sep 2022 18:25 UTC

67 points

73 comments44 min readLW link 1 review

Are ethical asymmetries from property rights?

KatjaGrace2 Jul 2018 3:00 UTC

108 points

37 comments3 min readLW link

(meteuphoric.com)

On green

Joe Carlsmith21 Mar 2024 17:38 UTC

290 points

35 comments31 min readLW link

Content and Takeaways from SERI MATS Training Program with John Wentworth

RohanS24 Dec 2022 4:17 UTC

28 points

3 comments12 min readLW link

Boundaries enable positive material-informational feedback loops

jessicata22 Dec 2018 2:46 UTC

36 points

26 comments5 min readLW link

Formalizing «Boundaries» with Markov blankets

Chris Lakin19 Sep 2023 21:01 UTC

23 points

20 comments3 min readLW link

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger Turan10 May 2023 17:41 UTC

31 points

0 comments12 min readLW link

Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations)

Thane Ruthenis22 Dec 2023 20:19 UTC

77 points

14 comments6 min readLW link

A necessary Membrane formalism feature

ThomasCederborg10 Sep 2024 21:33 UTC

20 points

6 comments11 min readLW link

Announcing the Alignment of Complex Systems Research Group

Jan_Kulveit and technicalities

4 Jun 2022 4:10 UTC

92 points

20 comments5 min readLW link

Empowerment is (almost) All We Need

jacob_cannell23 Oct 2022 21:48 UTC

61 points

44 comments17 min readLW link

The Human-AI Reflective Equilibrium

Allison Duettmann24 Jan 2023 1:32 UTC

22 points

1 comment24 min readLW link

No comments.