RSS

alamerton

Karma: 43

Let your ideas die so you don’t have to. Navigating parochialism. Fond of fallibilism.

I’m currently working as founder and research lead at Formation Research on technical interventions for lock-in risk, and part-time as a research assistant at King’s College London on clinical machine learning benchmarking.

My website is here.

Digi­tal Er­ror Cor­rec­tion and Lock-In

alamerton8 Apr 2025 15:46 UTC
1 point
0 comments5 min readLW link
(alfielamerton.substack.com)

Or­gani­sa­tion-Level Lock-In Risk Interventions

alamerton1 Apr 2025 12:42 UTC
5 points
0 comments8 min readLW link

Recom­mender Align­ment for Lock-In Risk

alamerton24 Mar 2025 12:56 UTC
2 points
0 comments7 min readLW link

Stac­ity: a Lock-In Risk Bench­mark for Large Lan­guage Models

alamerton13 Mar 2025 12:08 UTC
4 points
0 comments1 min readLW link
(huggingface.co)

Lock-In Threat Models

alamerton10 Mar 2025 10:22 UTC
5 points
0 comments8 min readLW link

What is Lock-In?

alamerton6 Mar 2025 11:09 UTC
5 points
0 comments9 min readLW link

For­ma­tion Re­search: Or­gani­sa­tion Overview

alamerton4 Mar 2025 15:03 UTC
5 points
0 comments11 min readLW link

In-Con­text Learn­ing: An Align­ment Survey

alamerton30 Sep 2024 18:44 UTC
8 points
0 comments20 min readLW link
(docs.google.com)

A Re­view of In-Con­text Learn­ing Hy­pothe­ses for Au­to­mated AI Align­ment Research

alamerton18 Apr 2024 18:29 UTC
25 points
4 comments16 min readLW link