RSS

technicalities

Karma: 1,026

The jailbreak ar­gu­ment against LLM values

technicalities10 Nov 2025 12:05 UTC
24 points
2 comments6 min readLW link

curate

technicalities14 Jan 2025 14:40 UTC
12 points
0 comments2 min readLW link

Shal­low re­view of tech­ni­cal AI safety, 2024

29 Dec 2024 12:01 UTC
197 points
35 comments41 min readLW link

“Safety as a Scien­tific Pur­suit” (2024)

technicalities23 Jan 2024 12:40 UTC
17 points
3 comments2 min readLW link
(banburismus.substack.com)

Ap­pen­dices to the live agendas

27 Nov 2023 11:10 UTC
16 points
4 comments1 min readLW link

Shal­low re­view of live agen­das in al­ign­ment & safety

27 Nov 2023 11:10 UTC
349 points
73 comments29 min readLW link1 review

Ac­tAdd: Steer­ing Lan­guage Models with­out Optimization

6 Sep 2023 17:21 UTC
105 points
3 comments2 min readLW link
(arxiv.org)

An­nounc­ing the Align­ment of Com­plex Sys­tems Re­search Group

4 Jun 2022 4:10 UTC
92 points
20 comments5 min readLW link

Case for emer­gency re­sponse teams

5 Apr 2022 12:45 UTC
24 points
0 comments5 min readLW link
(forum.effectivealtruism.org)

Hinges and crises

29 Mar 2022 11:11 UTC
44 points
7 comments3 min readLW link
(forum.effectivealtruism.org)

Ex­per­i­men­tal longter­mism: the­ory needs data

24 Mar 2022 8:23 UTC
52 points
0 comments4 min readLW link
(forum.effectivealtruism.org)

We have some ev­i­dence that masks work

technicalities11 Jul 2021 18:36 UTC
97 points
13 comments5 min readLW link

Self-help, hard and soft

technicalities7 Jun 2020 15:39 UTC
16 points
0 comments2 min readLW link

Au­to­matic for the people

technicalities8 Jul 2018 14:23 UTC
18 points
23 comments7 min readLW link