The Natural Abstraction hypothesis says that:

Our physical world abstracts well: for most systems, the information relevant “far away” from the system (in various senses) is much lower-dimensional than the system itself. These low-dimensional summaries are exactly the high-level abstract objects/​concepts typically used by humans.

These abstractions are “natural”: a wide variety of cognitive architectures will learn to use approximately the same high-level abstract objects/​concepts to reason about the world.

(from “Testing the Natural Abstraction Hypothesis”)

Align­ment By Default

Test­ing The Nat­u­ral Ab­strac­tion Hy­poth­e­sis: Pro­ject Intro

Test­ing The Nat­u­ral Ab­strac­tion Hy­poth­e­sis: Pro­ject Update

Agency As a Nat­u­ral Abstraction

The Nat­u­ral Ab­strac­tion Hy­poth­e­sis: Im­pli­ca­tions and Evidence

Nat­u­ral Cat­e­gories Update

Com­put­ing Nat­u­ral Ab­strac­tions: Lin­ear Approximation

AXRP Epi­sode 15 - Nat­u­ral Ab­strac­tions with John Wentworth

The Core of the Align­ment Prob­lem is...

Causal Ab­strac­tion Toy Model: Med­i­cal Sensor

[Heb­bian Nat­u­ral Ab­strac­tions] Introduction

What Does The Nat­u­ral Ab­strac­tion Frame­work Say About ELK?

