Archetypal Transfer Learning

TagLast edit: 5 Jul 2023 22:54 UTC by MiguelDev

Archetypal Transfer Learning (ATL) is a proposal by @whitehatStoic for what is argued by the author to be a fine tuning approach that “uses archetypal data” to “embed Synthetic Archetypes”. These Synthetic Archetypes are derived from patterns that models assimilate from archetypal data, such as artificial stories. The method yielded a shutdown activation rate of 57.33% in the GPT-2-XL model after fine-tuning.

Related Tags: Corrigibility, Inner Alignment, Outer Alignment

Exploring Functional Decision Theory (FDT) and a modified version (ModFDT)

MiguelDev5 Jul 2023 14:06 UTC

8 points

11 comments15 min readLW link

Relevance of ‘Harmful Intelligence’ Data in Training Datasets (WebText vs. Pile)

MiguelDev12 Oct 2023 12:08 UTC

12 points

0 comments9 min readLW link

GPT-2 XL’s capacity for coherence and ontology clustering

MiguelDev30 Oct 2023 9:24 UTC

6 points

2 comments41 min readLW link

A Multidisciplinary Approach to Alignment (MATA) and Archetypal Transfer Learning (ATL)

MiguelDev19 Jun 2023 2:32 UTC

4 points

2 comments7 min readLW link

On Ilya Sutskever’s “A Theory of Unsupervised Learning”

MiguelDev26 Aug 2023 5:34 UTC

6 points

0 comments19 min readLW link

Research proposal: Leveraging Jungian archetypes to create values-based models

MiguelDev5 Mar 2023 17:39 UTC

5 points

2 comments2 min readLW link

Archetypal Transfer Learning: a Proposed Alignment Solution that solves the Inner & Outer Alignment Problem while adding Corrigible Traits to GPT-2-medium

MiguelDev26 Apr 2023 1:37 UTC

14 points

5 comments10 min readLW link

No comments.