RSS

Aether

TagLast edit: 29 May 2026 12:38 UTC by Joey Yudelson

Aether is a small AI safety research organization.

Im­pli­ca­tions of Con­tinual Learn­ing for LLM Agents: Introduction

12 Jun 2026 18:36 UTC
46 points
0 comments6 min readLW link

What’s Con­tinual Learn­ing, and Why Might We Ex­pect To See It In Ad­vanced LLM Agents?

12 Jun 2026 18:43 UTC
27 points
2 comments17 min readLW link

How might con­tinual learn­ing af­fect safety and al­ign­ment?

13 Jun 2026 17:34 UTC
59 points
2 comments16 min readLW link

Ex­tract-and-Eval­u­ate Mon­i­tor­ing Can Sig­nifi­cantly En­hance CoT Mon­i­tor Perfor­mance (Re­search Note)

8 Aug 2025 10:41 UTC
52 points
7 comments10 min readLW link

A List of Re­search Direc­tions in Char­ac­ter Training

Rauno Arike19 Mar 2026 22:58 UTC
47 points
21 comments8 min readLW link

Hid­den Rea­son­ing in LLMs: A Taxonomy

25 Aug 2025 22:43 UTC
79 points
12 comments12 min readLW link

13 Ar­gu­ments About a Tran­si­tion to Neu­ralese AIs

Rauno Arike7 Nov 2025 16:19 UTC
50 points
14 comments10 min readLW link

Ex­plor­ing Re­in­force­ment Learn­ing Effects on Chain-of-Thought Legibility

6 Jan 2026 3:04 UTC
41 points
3 comments21 min readLW link

[Paper] How does in­for­ma­tion ac­cess af­fect LLM mon­i­tors’ abil­ity to de­tect sab­o­tage?

11 Feb 2026 21:25 UTC
26 points
0 comments6 min readLW link

Effi­ciently De­tect­ing Hid­den Rea­son­ing with a Small Pre­dic­tor Model

13 Jul 2025 16:04 UTC
34 points
3 comments16 min readLW link

Should We Train Against (CoT) Mon­i­tors?

RohanS23 Apr 2026 19:19 UTC
50 points
7 comments33 min readLW link

How we spent our first two weeks as an in­de­pen­dent AI safety re­search group

11 Aug 2025 19:32 UTC
34 points
0 comments10 min readLW link

We Should Study the Anal­ogy Between Inoc­u­la­tion Prompt­ing Non-Ro­bust­ness, Ne­ga­tion Ne­glect, and Back­door Non-Robustness

Vladimir Ivanov28 May 2026 19:17 UTC
17 points
3 comments4 min readLW link

Aether is hiring tech­ni­cal AI safety researchers

5 Jan 2026 22:27 UTC
22 points
0 comments2 min readLW link
No comments.