habryka(Oliver Habryka)

Karma: 34,359

Running Lightcone Infrastructure, which runs LessWrong. You can reach me at habryka@lesswrong.com. I have signed no contracts or agreements whose existence I cannot mention.

Open Thread Summer 2024

habryka11 Jun 2024 20:57 UTC

18 points

46 comments1 min readLW link

“AI Safety for Fleshy Humans” an AI Safety explainer by Nicky Case

habryka3 May 2024 18:10 UTC

84 points

10 comments4 min readLW link

(aisafety.dance)

Goal oriented cognition in “a single forward pass”

dxu and habryka

22 Apr 2024 5:03 UTC

19 points

14 comments26 min readLW link

Express interest in an “FHI of the West”

habryka18 Apr 2024 3:32 UTC

264 points

41 comments3 min readLW link

Structured Transparency: a framework for addressing use/mis-use trade-offs when sharing information

habryka11 Apr 2024 18:35 UTC

25 points

0 comments2 min readLW link

(arxiv.org)

LessWrong’s (first) album: I Have Been A Good Bing

habryka and kave

1 Apr 2024 7:33 UTC

556 points

173 comments11 min readLW link

How useful is “AI Control” as a framing on AI X-Risk?

habryka and ryan_greenblatt

14 Mar 2024 18:06 UTC

68 points

4 comments34 min readLW link

Open Thread Spring 2024

habryka11 Mar 2024 19:17 UTC

22 points

160 comments1 min readLW link

[Question] Is a random box of gas predictable after 20 seconds?

Thomas Kwa and habryka

24 Jan 2024 23:00 UTC

37 points

35 comments1 min readLW link

[Question] Will quantum randomness affect the 2028 election?

Thomas Kwa and habryka

24 Jan 2024 22:54 UTC

65 points

52 comments1 min readLW link

Vote in the LessWrong review! (LW 2022 Review voting phase)

habryka17 Jan 2024 7:22 UTC

26 points

9 comments2 min readLW link

AI Impacts 2023 Expert Survey on Progress in AI

habryka5 Jan 2024 19:42 UTC

28 points

1 comment7 min readLW link

(wiki.aiimpacts.org)

Originality vs. Correctness

alkjash and habryka

6 Dec 2023 18:51 UTC

60 points

16 comments25 min readLW link

The LessWrong 2022 Review

habryka5 Dec 2023 4:00 UTC

115 points

43 comments4 min readLW link

Open Thread – Winter 2023/2024

habryka4 Dec 2023 22:59 UTC

35 points

160 comments1 min readLW link

Complex systems research as a field (and its relevance to AI Alignment)

Nora_Ammann and habryka

1 Dec 2023 22:10 UTC

64 points

9 comments19 min readLW link

How useful is mechanistic interpretability?

ryan_greenblatt, Neel Nanda, Buck and habryka

1 Dec 2023 2:54 UTC

160 points

53 comments25 min readLW link

My techno-optimism [By Vitalik Buterin]

habryka27 Nov 2023 23:53 UTC

104 points

17 comments2 min readLW link

(www.lesswrong.com)

“Epistemic range of motion” and LessWrong moderation

habryka and Gabriel Alfour

27 Nov 2023 21:58 UTC

60 points

3 comments12 min readLW link

Debate helps supervise human experts [Paper]

habryka17 Nov 2023 5:25 UTC

29 points

6 comments1 min readLW link

(github.com)