24 Jan 2024 23:00 UTC

38 points

35 comments1 min readLW link

[Question] Will quantum randomness affect the 2028 election?

Thomas Kwa and habryka

24 Jan 2024 22:54 UTC

66 points

52 comments1 min readLW link

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Dan H and Corin Katzke

24 Jan 2024 19:38 UTC

27 points

1 comment6 min readLW link

(newsletter.safe.ai)

Krueger Lab AI Safety Internship 2024

Joey Bream24 Jan 2024 19:17 UTC

3 points

0 comments1 min readLW link

Agents that act for reasons: a thought experiment

Michele Campolo24 Jan 2024 16:47 UTC

3 points

0 comments3 min readLW link

Impact Assessment of AI Safety Camp (Arb Research)

Samuel Holton24 Jan 2024 16:19 UTC

10 points

0 comments11 min readLW link

(forum.effectivealtruism.org)

The case for ensuring that powerful AIs are controlled

ryan_greenblatt and Buck

24 Jan 2024 16:11 UTC

290 points

74 comments28 min readLW link 1 review

LLMs can strategically deceive while doing gain-of-function research

Igor Ivanov24 Jan 2024 15:45 UTC

36 points

4 comments11 min readLW link

Monthly Roundup #14: January 2024

Zvi24 Jan 2024 12:50 UTC

38 points

22 comments44 min readLW link

(thezvi.wordpress.com)

This might be the last AI Safety Camp

Remmelt and Linda Linsefors

24 Jan 2024 9:33 UTC

198 points

34 comments1 min readLW link

Global LessWrong/AC10 Meetup on VRChat

Tomás B. and the gears to ascension

24 Jan 2024 5:44 UTC

15 points

2 comments1 min readLW link

Humans aren’t fleeb.

Charlie Steiner24 Jan 2024 5:31 UTC

38 points

5 comments2 min readLW link

A Paradigm Shift in Sustainability

Jose Miguel Cruz y Celis23 Jan 2024 23:34 UTC

5 points

0 comments18 min readLW link

From Finite Factors to Bayes Nets

J Bostock23 Jan 2024 20:03 UTC

38 points

7 comments8 min readLW link

Institutional economics through the lens of scale-free regulative development, morphogenesis, and cognitive science

Roman Leventov23 Jan 2024 19:42 UTC

8 points

0 comments14 min readLW link

Making a Secular Solstice Songbook

jefftk23 Jan 2024 19:40 UTC

38 points

6 comments1 min readLW link

(www.jefftk.com)

Simple Appreciations

Jonathan Moregård23 Jan 2024 16:23 UTC

17 points

11 comments4 min readLW link

(open.substack.com)

[Question] What environmental cues had you not seen them would have ended in disaster?

koratkar23 Jan 2024 14:59 UTC

26 points

4 comments1 min readLW link

Loneliness and suicide mitigation for students using GPT3-enabled chatbots (survey of Replika users in Nature)

Kaj_Sotala23 Jan 2024 14:05 UTC

46 points

2 comments2 min readLW link

(www.nature.com)

“Safety as a Scientific Pursuit” (2024)

technicalities23 Jan 2024 12:40 UTC

18 points

3 comments2 min readLW link

(banburismus.substack.com)

Brainstorming: Slow Takeoff

David Piepgrass23 Jan 2024 6:58 UTC

3 points

0 comments51 min readLW link

Reframing Acausal Trolling as Acausal Patronage

StrivingForLegibility23 Jan 2024 3:04 UTC

14 points

0 comments2 min readLW link

Orthogonality or the “Human Worth Hypothesis”?

Jeffs23 Jan 2024 0:57 UTC

21 points

31 comments3 min readLW link

the subreddit size threshold

bhauth23 Jan 2024 0:38 UTC

32 points

3 comments4 min readLW link

(www.bhauth.com)

Starting in mechanistic interpretability

Jakub Smékal22 Jan 2024 23:40 UTC

1 point

0 comments3 min readLW link

(jakubsmekal.com)

We need a Science of Evals

Marius Hobbhahn and Jérémy Scheurer

22 Jan 2024 20:30 UTC

75 points

13 comments9 min readLW link

Announcing the SoS Research Collective for independent researchers (and academics thinking independently)

rogersbacon22 Jan 2024 20:13 UTC

15 points

0 comments8 min readLW link

(www.theseedsofscience.pub)

A Brief Assessment of OpenAI’s Preparedness Framework & Some Suggestions for Improvement

simeon_c22 Jan 2024 20:08 UTC

14 points

0 comments6 min readLW link

(uploads-ssl.webflow.com)

D&D.Sci(-fi): Colonizing the SuperHyperSphere [Evaluation and Ruleset]

abstractapplic22 Jan 2024 19:20 UTC

40 points

7 comments3 min readLW link

′ petertodd’’s last stand: The final days of open GPT-3 research

mwatkins22 Jan 2024 18:47 UTC

109 points

16 comments45 min readLW link

InterLab – a toolkit for experiments with multi-agent interactions

Tomáš Gavenčiak, Ada Böhm and Jan_Kulveit

22 Jan 2024 18:23 UTC

69 points

0 comments8 min readLW link

(acsresearch.org)

San Fernando Valley Rationalist Meetup

Thomas Broadley22 Jan 2024 16:49 UTC

3 points

1 comment1 min readLW link

Who Organizes Dances?

jefftk22 Jan 2024 14:30 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

Values Darwinism

pchvykov22 Jan 2024 10:44 UTC

11 points

13 comments3 min readLW link

[Question] The akrasia doom loop and executive function disorders: a question

TeaTieAndHat22 Jan 2024 7:01 UTC

20 points

7 comments2 min readLW link

Predicting AGI by the Turing Test

Yuxi_Liu22 Jan 2024 4:22 UTC

21 points

2 comments10 min readLW link

(yuxi-liu-wired.github.io)

Incorporating Justice Theory into Decision Theory

StrivingForLegibility21 Jan 2024 19:17 UTC

13 points

20 comments5 min readLW link

Deliberate Dysentery: Q&A about Human Challenge Trials

Niko_McCarty21 Jan 2024 19:05 UTC

16 points

1 comment18 min readLW link

(www.asimov.press)

When Does Altruism Strengthen Altruism?

jefftk21 Jan 2024 18:50 UTC

47 points

2 comments3 min readLW link

(www.jefftk.com)

A Shutdown Problem Proposal

johnswentworth and David Lorell

21 Jan 2024 18:12 UTC

126 points

67 comments6 min readLW link

Is principled mass-outreach possible, for AGI X-risk?

Nicholas Kross21 Jan 2024 17:45 UTC

9 points

5 comments3 min readLW link

Vacuum: Theory and Technologies

nomagicpill21 Jan 2024 17:23 UTC

34 points

0 comments25 min readLW link

(210ethan.github.io)

Another Non-Anthropic Paradox: The Unsurprising Rareness of Rare Events

Ape in the coat21 Jan 2024 15:58 UTC

21 points

16 comments6 min readLW link

Book review: Cuisine and Empire

eukaryote21 Jan 2024 6:15 UTC

40 points

2 comments12 min readLW link

(eukaryotewritesblog.com)

Catalogue of POLITICO Reports and Other Cited Articles on Effective Altruism and AI Safety Connections in Washington, DC

Evan_Gaensbauer21 Jan 2024 2:15 UTC

4 points

0 comments1 min readLW link

(docs.google.com)

You can rack up massive amounts of data quickly by asking questions to all your friends

Neil 21 Jan 2024 1:27 UTC

14 points

2 comments2 min readLW link

[Question] Party for biomedical rejuvenation research: European parliament elections

Iakov Dudinsky21 Jan 2024 0:35 UTC

2 points

0 comments1 min readLW link

[Question] Why have insurance markets succeeded where prediction markets have not?

JNank21 Jan 2024 0:35 UTC

13 points

13 comments1 min readLW link

[linkpost] Self-Rewarding Language Models

Jacob G-W21 Jan 2024 0:30 UTC

13 points

2 comments1 min readLW link

(arxiv.org)

Why Improving Dialogue Feels So Hard

matto20 Jan 2024 21:26 UTC

22 points

8 comments3 min readLW link