Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Aether
Tag
Last edit:
29 May 2026 12:38 UTC
by
Joey Yudelson
Aether
is a small AI safety research organization.
Relevant
New
Old
Implications of Continual Learning for LLM Agents: Introduction
RohanS
,
Rauno Arike
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
12 Jun 2026 18:36 UTC
46
points
0
comments
6
min read
LW
link
What’s Continual Learning, and Why Might We Expect To See It In Advanced LLM Agents?
RohanS
,
Rauno Arike
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
12 Jun 2026 18:43 UTC
27
points
2
comments
17
min read
LW
link
How might continual learning affect safety and alignment?
Rauno Arike
,
RohanS
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
13 Jun 2026 17:34 UTC
59
points
2
comments
16
min read
LW
link
Extract-and-Evaluate Monitoring Can Significantly Enhance CoT Monitor Performance (Research Note)
Rauno Arike
,
RohanS
and
Shubhorup Biswas
8 Aug 2025 10:41 UTC
52
points
7
comments
10
min read
LW
link
A List of Research Directions in Character Training
Rauno Arike
19 Mar 2026 22:58 UTC
47
points
21
comments
8
min read
LW
link
Hidden Reasoning in LLMs: A Taxonomy
Rauno Arike
,
RohanS
and
Shubhorup Biswas
25 Aug 2025 22:43 UTC
79
points
12
comments
12
min read
LW
link
13 Arguments About a Transition to Neuralese AIs
Rauno Arike
7 Nov 2025 16:19 UTC
50
points
14
comments
10
min read
LW
link
Exploring Reinforcement Learning Effects on Chain-of-Thought Legibility
Julian H
,
RohanS
,
Baram Sosis
,
vedant-badoni
and
The-Turtle
6 Jan 2026 3:04 UTC
41
points
3
comments
21
min read
LW
link
[Paper] How does information access affect LLM monitors’ ability to detect sabotage?
Rauno Arike
,
Raja Moreno
,
RohanS
,
Shubhorup Biswas
and
Francis Rhys Ward
11 Feb 2026 21:25 UTC
26
points
0
comments
6
min read
LW
link
Efficiently Detecting Hidden Reasoning with a Small Predictor Model
RohanS
,
Vishnu Vardhan Sai Lanka
,
yaumeng
and
daria
13 Jul 2025 16:04 UTC
34
points
3
comments
16
min read
LW
link
Should We Train Against (CoT) Monitors?
RohanS
23 Apr 2026 19:19 UTC
50
points
7
comments
33
min read
LW
link
How we spent our first two weeks as an independent AI safety research group
RohanS
,
Rauno Arike
and
Shubhorup Biswas
11 Aug 2025 19:32 UTC
34
points
0
comments
10
min read
LW
link
We Should Study the Analogy Between Inoculation Prompting Non-Robustness, Negation Neglect, and Backdoor Non-Robustness
Vladimir Ivanov
28 May 2026 19:17 UTC
17
points
3
comments
4
min read
LW
link
Aether is hiring technical AI safety researchers
Rauno Arike
,
RohanS
and
Shubhorup Biswas
5 Jan 2026 22:27 UTC
22
points
0
comments
2
min read
LW
link
No comments.
Back to top