Trent Hodgeson

Karma: 1,333

AE Studio is a team of 160+ programmers, product designers, and data scientists focused on increasing human agency through neglected high-impact approaches. Originally successful in BCI development and consulting, we’re now applying our expertise to AI alignment research, believing that the space of plausible alignment solutions is vast and under-explored.

Our alignment work includes prosociality research on self-modeling in neural systems, with attention schema theory in particular, self-other overlap mechanisms, and various neglected technical and policy approaches. We maintain a profitable consulting business that allows us to fund and pursue promising but overlooked research directions without pressure to expedite AGI development.

Learn more about us and our mission here:
https://ae.studio/ai-alignment

Trent Hodgeson 1 Aug 2025 17:47 UTC
61 points
44
on: I am worried about near-term non-LLM AI developments
To the degree worries of this general shape are legitimate (we think they very much are), seems like it would be wise for the alignment community to more seriously pursue and evaluate tons of neglected approaches that might solve the fundamental underlying alignment problem, rather than investing the vast majority of resources in things like evals and demos of misalignment failure modes in current LLMs, which definitely are nice to have, but almost certainly won’t themselves directly yield scalable solutions to robustly aligning AGI/ASI.

Trent Hodgeson 22 Apr 2025 15:57 UTC
2 points
0
in reply to: Lucius Bushnaq’s comment on: AE Studio is hiring!
Thanks Lucius, yes, this was tongue-in-cheek and we actually decided to remove it shortly thereafter once we realized it might not come across in the right way. Totally grant the point, and thanks for calling it out.

AE Studio is hiring!

Trent Hodgeson21 Apr 2025 20:35 UTC

20 points

2 comments2 min readLW link

Mistral Large 2 (123B) seems to exhibit alignment faking

Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Cameron Berg, Judd Rosenblatt, Mike Vaiana and Trent Hodgeson

27 Mar 2025 15:39 UTC

81 points

4 comments13 min readLW link

Reducing LLM deception at scale with self-other overlap fine-tuning

Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Judd Rosenblatt, Cameron Berg, Mike Vaiana and Trent Hodgeson

13 Mar 2025 19:09 UTC

162 points

46 comments6 min readLW link

Alignment can be the ‘clean energy’ of AI

Cameron Berg, Judd Rosenblatt and Trent Hodgeson

22 Feb 2025 0:08 UTC

68 points

8 comments8 min readLW link

Making a conservative case for alignment

Cameron Berg, Judd Rosenblatt, phgubbins and Trent Hodgeson

15 Nov 2024 18:55 UTC

208 points

67 comments7 min readLW link

Science advances one funeral at a time

Cameron Berg, Judd Rosenblatt, Diogo de Lucena and Trent Hodgeson

1 Nov 2024 23:06 UTC

100 points

9 comments2 min readLW link

Self-prediction acts as an emergent regularizer

Cameron Berg, Judd Rosenblatt, Mike Vaiana, Diogo de Lucena, florin_pop and Trent Hodgeson

23 Oct 2024 22:27 UTC

91 points

9 comments4 min readLW link

The case for a negative alignment tax

Cameron Berg, Judd Rosenblatt, Diogo de Lucena and Trent Hodgeson

18 Sep 2024 18:33 UTC

77 points

20 comments7 min readLW link

Self-Other Overlap: A Neglected Approach to AI Alignment

Marc Carauleanu, Mike Vaiana, Judd Rosenblatt, Diogo de Lucena, Cameron Berg and Trent Hodgeson

30 Jul 2024 16:22 UTC

226 points

51 comments12 min readLW link

Video Intro to Guaranteed Safe AI

Mike Vaiana, Diogo de Lucena and Trent Hodgeson

11 Jul 2024 17:53 UTC

27 points

0 comments1 min readLW link

(youtu.be)

DIY RLHF: A simple implementation for hands on experience

Mike Vaiana and Trent Hodgeson

10 Jul 2024 12:07 UTC

29 points

0 comments6 min readLW link

Key takeaways from our EA and alignment research surveys

Cameron Berg, Judd Rosenblatt, florin_pop and Trent Hodgeson

3 May 2024 18:10 UTC

112 points

10 comments21 min readLW link

AE Studio @ SXSW: We need more AI consciousness research (and further resources)

Trent Hodgeson, Cameron Berg, Judd Rosenblatt, phgubbins and Diogo de Lucena

26 Mar 2024 20:59 UTC

67 points

8 comments3 min readLW link

Survey for alignment researchers!

Cameron Berg, Judd Rosenblatt and Trent Hodgeson

2 Feb 2024 20:41 UTC

71 points

11 comments1 min readLW link

The ‘Neglected Approaches’ Approach: AE Studio’s Alignment Agenda

Cameron Berg, Judd Rosenblatt, Trent Hodgeson and Marc Carauleanu

18 Dec 2023 20:35 UTC

187 points

23 comments12 min readLW link 1 review