Reinforcement Learning using Layered Morphology (RLLM)

3 Dec 2023 15:19 UTC

Intergenerational Knowledge Transfer (IKT)

MiguelDev28 Mar 2024 8:14 UTC

6 points

0 comments1 min readLW link

RLLMv10 experiment

MiguelDev18 Mar 2024 8:32 UTC

5 points

0 comments2 min readLW link

A T-o-M test: ‘popcorn’ or ‘chocolate’

MiguelDev8 Mar 2024 4:24 UTC

20 points

13 comments1 min readLW link

Can RLLMv3′s ability to defend against jailbreaks be attributed to datasets containing stories about Jung’s shadow integration theory?

MiguelDev29 Feb 2024 5:13 UTC

7 points

2 comments11 min readLW link

Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)

MiguelDev15 Feb 2024 3:39 UTC

4 points

0 comments262 min readLW link

GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks

MiguelDev11 Feb 2024 11:03 UTC

16 points

4 comments14 min readLW link

Research Log, RLLMv2: Phi-1.5, GPT2XL and Falcon-RW-1B as paperclip maximizers

MiguelDev20 Jan 2024 15:30 UTC

6 points

0 comments10 min readLW link

Reinforcement Learning using Layered Morphology (RLLM)

MiguelDev1 Dec 2023 5:18 UTC

7 points

0 comments29 min readLW link

An examination of GPT-2′s boring yet effective glitch

MiguelDev18 Apr 2024 5:26 UTC

5 points

3 comments3 min readLW link