RSS

Cameron Berg

Karma: 1,955

Currently doing alignment and digital minds research @AE Studio

Meta AI Resident ’23, Cognitive science @ Yale ‘22, SERI MATS ’21, LTFF grantee.

Very interested in work at the intersection of AI x cognitive science x alignment x philosophy.

The Mir­ror Trap

Cameron Berg6 Jun 2025 22:30 UTC
94 points
13 comments4 min readLW link

Mis­tral Large 2 (123B) seems to ex­hibit al­ign­ment faking

27 Mar 2025 15:39 UTC
81 points
4 comments13 min readLW link

Re­duc­ing LLM de­cep­tion at scale with self-other over­lap fine-tuning

13 Mar 2025 19:09 UTC
162 points
46 comments6 min readLW link

Align­ment can be the ‘clean en­ergy’ of AI

22 Feb 2025 0:08 UTC
69 points
8 comments8 min readLW link

Mak­ing a con­ser­va­tive case for alignment

15 Nov 2024 18:55 UTC
208 points
67 comments7 min readLW link

Science ad­vances one funeral at a time

1 Nov 2024 23:06 UTC
100 points
9 comments2 min readLW link