Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Is a random box of gas predictable after 20 seconds?
Thomas Kwa
and
habryka
Jan 24, 2024, 11:00 PM
37
points
35
comments
1
min read
LW
link
[Question]
Will quantum randomness affect the 2028 election?
Thomas Kwa
and
habryka
Jan 24, 2024, 10:54 PM
66
points
52
comments
1
min read
LW
link
AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes
Dan H
and
Corin Katzke
Jan 24, 2024, 7:38 PM
27
points
1
comment
6
min read
LW
link
(newsletter.safe.ai)
Krueger Lab AI Safety Internship 2024
Joey Bream
Jan 24, 2024, 7:17 PM
3
points
0
comments
1
min read
LW
link
Agents that act for reasons: a thought experiment
Michele Campolo
Jan 24, 2024, 4:47 PM
3
points
0
comments
3
min read
LW
link
Impact Assessment of AI Safety Camp (Arb Research)
Samuel Holton
Jan 24, 2024, 4:19 PM
10
points
0
comments
11
min read
LW
link
(forum.effectivealtruism.org)
The case for ensuring that powerful AIs are controlled
ryan_greenblatt
and
Buck
Jan 24, 2024, 4:11 PM
276
points
73
comments
28
min read
LW
link
LLMs can strategically deceive while doing gain-of-function research
Igor Ivanov
Jan 24, 2024, 3:45 PM
36
points
4
comments
11
min read
LW
link
Monthly Roundup #14: January 2024
Zvi
Jan 24, 2024, 12:50 PM
38
points
22
comments
44
min read
LW
link
(thezvi.wordpress.com)
This might be the last AI Safety Camp
Remmelt
and
Linda Linsefors
Jan 24, 2024, 9:33 AM
196
points
34
comments
1
min read
LW
link
Global LessWrong/AC10 Meetup on VRChat
Tomás B.
and
the gears to ascension
Jan 24, 2024, 5:44 AM
15
points
2
comments
1
min read
LW
link
Humans aren’t fleeb.
Charlie Steiner
Jan 24, 2024, 5:31 AM
37
points
5
comments
2
min read
LW
link
A Paradigm Shift in Sustainability
Jose Miguel Cruz y Celis
Jan 23, 2024, 11:34 PM
5
points
0
comments
18
min read
LW
link
From Finite Factors to Bayes Nets
J Bostock
Jan 23, 2024, 8:03 PM
38
points
7
comments
8
min read
LW
link
Institutional economics through the lens of scale-free regulative development, morphogenesis, and cognitive science
Roman Leventov
Jan 23, 2024, 7:42 PM
8
points
0
comments
14
min read
LW
link
Making a Secular Solstice Songbook
jefftk
Jan 23, 2024, 7:40 PM
38
points
6
comments
1
min read
LW
link
(www.jefftk.com)
Simple Appreciations
Jonathan Moregård
Jan 23, 2024, 4:23 PM
17
points
11
comments
4
min read
LW
link
(open.substack.com)
[Question]
What environmental cues had you not seen them would have ended in disaster?
koratkar
Jan 23, 2024, 2:59 PM
11
points
1
comment
1
min read
LW
link
Loneliness and suicide mitigation for students using GPT3-enabled chatbots (survey of Replika users in Nature)
Kaj_Sotala
Jan 23, 2024, 2:05 PM
45
points
2
comments
2
min read
LW
link
(www.nature.com)
“Safety as a Scientific Pursuit” (2024)
technicalities
Jan 23, 2024, 12:40 PM
17
points
3
comments
2
min read
LW
link
(banburismus.substack.com)
Brainstorming: Slow Takeoff
David Piepgrass
Jan 23, 2024, 6:58 AM
3
points
0
comments
51
min read
LW
link
Reframing Acausal Trolling as Acausal Patronage
StrivingForLegibility
Jan 23, 2024, 3:04 AM
14
points
0
comments
2
min read
LW
link
Orthogonality or the “Human Worth Hypothesis”?
Jeffs
Jan 23, 2024, 12:57 AM
21
points
31
comments
3
min read
LW
link
the subreddit size threshold
bhauth
Jan 23, 2024, 12:38 AM
32
points
3
comments
4
min read
LW
link
(www.bhauth.com)
Starting in mechanistic interpretability
Jakub Smékal
Jan 22, 2024, 11:40 PM
1
point
0
comments
3
min read
LW
link
(jakubsmekal.com)
We need a Science of Evals
Marius Hobbhahn
and
Jérémy Scheurer
Jan 22, 2024, 8:30 PM
71
points
13
comments
9
min read
LW
link
Announcing the SoS Research Collective for independent researchers (and academics thinking independently)
rogersbacon
Jan 22, 2024, 8:13 PM
15
points
0
comments
8
min read
LW
link
(www.theseedsofscience.pub)
A Brief Assessment of OpenAI’s Preparedness Framework & Some Suggestions for Improvement
simeon_c
Jan 22, 2024, 8:08 PM
14
points
0
comments
6
min read
LW
link
(uploads-ssl.webflow.com)
D&D.Sci(-fi): Colonizing the SuperHyperSphere [Evaluation and Ruleset]
abstractapplic
Jan 22, 2024, 7:20 PM
40
points
7
comments
3
min read
LW
link
′ petertodd’’s last stand: The final days of open GPT-3 research
mwatkins
Jan 22, 2024, 6:47 PM
109
points
16
comments
45
min read
LW
link
InterLab – a toolkit for experiments with multi-agent interactions
Tomáš Gavenčiak
,
Ada Böhm
and
Jan_Kulveit
Jan 22, 2024, 6:23 PM
69
points
0
comments
8
min read
LW
link
(acsresearch.org)
San Fernando Valley Rationalist Meetup
Thomas Broadley
Jan 22, 2024, 4:49 PM
3
points
1
comment
1
min read
LW
link
Who Organizes Dances?
jefftk
Jan 22, 2024, 2:30 PM
12
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Values Darwinism
pchvykov
Jan 22, 2024, 10:44 AM
11
points
13
comments
3
min read
LW
link
[Question]
The akrasia doom loop and executive function disorders: a question
TeaTieAndHat
Jan 22, 2024, 7:01 AM
18
points
7
comments
2
min read
LW
link
Predicting AGI by the Turing Test
Yuxi_Liu
Jan 22, 2024, 4:22 AM
21
points
2
comments
10
min read
LW
link
(yuxi-liu-wired.github.io)
Incorporating Justice Theory into Decision Theory
StrivingForLegibility
Jan 21, 2024, 7:17 PM
13
points
20
comments
5
min read
LW
link
Deliberate Dysentery: Q&A about Human Challenge Trials
Niko_McCarty
Jan 21, 2024, 7:05 PM
16
points
1
comment
18
min read
LW
link
(www.asimov.press)
When Does Altruism Strengthen Altruism?
jefftk
Jan 21, 2024, 6:50 PM
44
points
2
comments
3
min read
LW
link
(www.jefftk.com)
A Shutdown Problem Proposal
johnswentworth
and
David Lorell
Jan 21, 2024, 6:12 PM
125
points
61
comments
6
min read
LW
link
Is principled mass-outreach possible, for AGI X-risk?
Nicholas / Heather Kross
Jan 21, 2024, 5:45 PM
9
points
5
comments
3
min read
LW
link
Vacuum: Theory and Technologies
nomagicpill
Jan 21, 2024, 5:23 PM
33
points
0
comments
25
min read
LW
link
(210ethan.github.io)
Another Non-Anthropic Paradox: The Unsurprising Rareness of Rare Events
Ape in the coat
Jan 21, 2024, 3:58 PM
19
points
16
comments
6
min read
LW
link
Book review: Cuisine and Empire
eukaryote
Jan 21, 2024, 6:15 AM
40
points
2
comments
12
min read
LW
link
(eukaryotewritesblog.com)
Catalogue of POLITICO Reports and Other Cited Articles on Effective Altruism and AI Safety Connections in Washington, DC
Evan_Gaensbauer
Jan 21, 2024, 2:15 AM
4
points
0
comments
LW
link
(docs.google.com)
You can rack up massive amounts of data quickly by asking questions to all your friends
Neil
Jan 21, 2024, 1:27 AM
14
points
2
comments
2
min read
LW
link
[Question]
Party for biomedical rejuvenation research: European parliament elections
Iakov Dudinsky
Jan 21, 2024, 12:35 AM
1
point
0
comments
1
min read
LW
link
[Question]
Why have insurance markets succeeded where prediction markets have not?
JNank
Jan 21, 2024, 12:35 AM
13
points
13
comments
1
min read
LW
link
[linkpost] Self-Rewarding Language Models
Jacob G-W
Jan 21, 2024, 12:30 AM
13
points
2
comments
1
min read
LW
link
(arxiv.org)
Why Improving Dialogue Feels So Hard
matto
Jan 20, 2024, 9:26 PM
21
points
8
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel