alamerton

Karma: 50

Let your ideas die so you don’t have to. Navigating parochialism. Fond of fallibilism.

I’m currently working as founder and research lead at Formation Research on technical interventions for lock-in risk, and part-time as a research assistant at King’s College London on clinical machine learning benchmarking.

My website is here.

Digital Error Correction and Lock-In

alamerton8 Apr 2025 15:46 UTC

1 point

0 comments5 min readLW link

(alfielamerton.substack.com)

Organisation-Level Lock-In Risk Interventions

alamerton1 Apr 2025 12:42 UTC

5 points

0 comments8 min readLW link

Recommender Alignment for Lock-In Risk

alamerton24 Mar 2025 12:56 UTC

8 points

0 comments7 min readLW link

Stacity: a Lock-In Risk Benchmark for Large Language Models

alamerton13 Mar 2025 12:08 UTC

4 points

0 comments1 min readLW link

(huggingface.co)

Lock-In Threat Models

alamerton10 Mar 2025 10:22 UTC

5 points

0 comments8 min readLW link

What is Lock-In?

alamerton6 Mar 2025 11:09 UTC

5 points

0 comments9 min readLW link

Formation Research: Organisation Overview

alamerton4 Mar 2025 15:03 UTC

6 points

0 comments11 min readLW link

In-Context Learning: An Alignment Survey

alamerton30 Sep 2024 18:44 UTC

8 points

0 comments20 min readLW link

(docs.google.com)

alamerton 19 Apr 2024 19:08 UTC
1 point
0
in reply to: Aaron_Scher’s comment on: A Review of In-Context Learning Hypotheses for Automated AI Alignment Research
I think I mean to say this would imply ICL could not be a new form of learning. And yes, it seems more likely that there could be at least some new knowledge getting generated, one way or another. BI implying all tasks have been previously seen feels extreme, and less likely. I’ve adjusted my wording a bit now.

A Review of In-Context Learning Hypotheses for Automated AI Alignment Research

alamerton18 Apr 2024 18:29 UTC

25 points

4 comments16 min readLW link