How the AI safety technical landscape has changed in the last year, according to some practitioners

tlevin26 Jul 2024 19:06 UTC

35 points

3 comments2 min readLW link

On “first critical tries” in AI alignment

Joe Carlsmith5 Jun 2024 0:19 UTC

54 points

5 comments14 min readLW link

Universal Basic Income and Poverty

Eliezer Yudkowsky26 Jul 2024 7:23 UTC

115 points

40 comments9 min readLW link

UNLOCKING SOLUTIONS

James Stephen Brown27 Jul 2024 4:52 UTC

2 points

0 comments5 min readLW link

(nonzerosum.games)

80,000 hours should remove OpenAI from the Job Board (and similar EA orgs should do similarly)

Raemon3 Jul 2024 20:34 UTC

268 points

63 comments1 min readLW link

The case for stopping AI safety research

catubc23 May 2024 15:55 UTC

51 points

37 comments1 min readLW link

Politics is the Mind-Killer

Eliezer Yudkowsky18 Feb 2007 21:23 UTC

279 points

241 comments2 min readLW link

Index of rationalist groups in the Bay July 2024

Lucie Philippon and Czynski

26 Jul 2024 16:32 UTC

27 points

2 comments1 min readLW link

Utilitarianism and the replaceability of desires and attachments

MichaelStJules27 Jul 2024 1:57 UTC

5 points

0 comments1 min readLW link

Inspired by: Failures in Kindness

X4vier27 Jul 2024 1:21 UTC

16 points

0 comments3 min readLW link

There are no coherence theorems

20 Feb 2023 21:25 UTC

128 points

123 comments19 min readLW link

“AI achieves silver-medal standard solving International Mathematical Olympiad problems”

gjm25 Jul 2024 15:58 UTC

112 points

33 comments2 min readLW link

(deepmind.google)

The Best Tacit Knowledge Videos on Every Subject

Parker Conley31 Mar 2024 17:14 UTC

347 points

138 comments16 min readLW link

My Experience Using Gamification

Wyatt S26 Jul 2024 23:06 UTC

6 points

1 comment4 min readLW link

Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs

L Rudolf L, bilalchughtai, jan betley, kaivu, Jérémy Scheurer, Mikita Balesni, AlexMeinke, Owain_Evans and Marius Hobbhahn

8 Jul 2024 22:24 UTC

99 points

27 comments5 min readLW link

Beyond Kolmogorov and Shannon

Alexander Gietelink Oldenziel and Adam Shai

25 Oct 2022 15:13 UTC

62 points

19 comments5 min readLW link

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide22 Jul 2024 18:45 UTC

97 points

15 comments12 min readLW link

Open Thread Summer 2024

habryka11 Jun 2024 20:57 UTC

18 points

46 comments1 min readLW link

A Visual Task that’s Hard for GPT-4o, but Doable for Primary Schoolers

Lennart Finke26 Jul 2024 17:51 UTC

6 points

1 comment2 min readLW link

[Question] Are there any naturally occurring heat pumps?

FactorialCode13 Apr 2020 5:24 UTC

17 points

11 comments1 min readLW link