Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
2023 ACX Meetups Everywhere—Newton, MA
duck_master
Aug 9, 2023, 10:47 PM
6
points
2
comments
1
min read
LW
link
Progress links digest, 2023-08-09: US adds new nuclear, Katalin Karikó interview, and more
jasoncrawford
Aug 9, 2023, 7:22 PM
18
points
0
comments
3
min read
LW
link
(rootsofprogress.org)
Mech Interp Challenge: August—Deciphering the First Unique Character Model
CallumMcDougall
Aug 9, 2023, 7:14 PM
36
points
1
comment
3
min read
LW
link
Real Meaning of life has been found. Eliezer discovered it in 2000′s.
Jorterder
Aug 9, 2023, 6:13 PM
−15
points
1
comment
1
min read
LW
link
(docs.google.com)
Marginal Revolution unofficial birthday party
Derek M. Jones
Aug 9, 2023, 2:35 PM
4
points
0
comments
1
min read
LW
link
A content analysis of the SQ-R questionnaire and a proposal for testing EQ-SQ theory
tailcalled
Aug 9, 2023, 1:51 PM
10
points
2
comments
13
min read
LW
link
[Question]
Does LessWrong allow exempting posts from being scraped by GPTBot?
mic
Aug 9, 2023, 1:02 PM
29
points
3
comments
1
min read
LW
link
If I Was An Eccentric Trillionaire
niplav
Aug 9, 2023, 7:56 AM
9
points
8
comments
26
min read
LW
link
Modulating sycophancy in an RLHF model via activation steering
Nina Panickssery
Aug 9, 2023, 7:06 AM
69
points
20
comments
12
min read
LW
link
Open Thread—August 2023
habryka
Aug 9, 2023, 3:52 AM
18
points
49
comments
1
min read
LW
link
marine cloud brightening
bhauth
Aug 9, 2023, 2:50 AM
40
points
14
comments
3
min read
LW
link
(www.bhauth.com)
Inflection.ai is a major AGI lab
Nikola Jurkovic
Aug 9, 2023, 1:05 AM
137
points
13
comments
2
min read
LW
link
Acausal Now: We could totally acausally bargain with aliens at our current tech level if desired
Christopher King
Aug 9, 2023, 12:50 AM
1
point
5
comments
4
min read
LW
link
Necromancy’s unintended consequences.
Christopher King
Aug 9, 2023, 12:08 AM
−6
points
2
comments
2
min read
LW
link
What’s A “Market”?
johnswentworth
Aug 8, 2023, 11:29 PM
94
points
16
comments
10
min read
LW
link
Podcast (+transcript): Nathan Barnard on how US financial regulation can inform AI governance
Aaron Bergman
Aug 8, 2023, 9:46 PM
8
points
0
comments
LW
link
(www.aaronbergman.net)
What are the flaws in this argument about p(Doom)?
William the Kiwi
Aug 8, 2023, 8:34 PM
−2
points
26
comments
1
min read
LW
link
A Simple Theory Of Consciousness
SherlockHolmes
Aug 8, 2023, 6:05 PM
2
points
5
comments
1
min read
LW
link
(peterholmes.medium.com)
[Linkpost] Rationally awake
jpc
Aug 8, 2023, 5:59 PM
−1
points
0
comments
4
min read
LW
link
(jpc.dev)
Yet more UFO Betting: Put Up or Shut Up
MoreRatsWrongReUAP
Aug 8, 2023, 5:50 PM
10
points
18
comments
1
min read
LW
link
AISN #18: Challenges of Reinforcement Learning from Human Feedback, Microsoft’s Security Breach, and Conceptual Research on AI Safety
Dan H
Aug 8, 2023, 3:52 PM
13
points
0
comments
LW
link
(newsletter.safe.ai)
[Question]
Beginner’s question about RLHF
FTPickle
Aug 8, 2023, 3:48 PM
1
point
3
comments
1
min read
LW
link
My Trial Period as an Independent Alignment Researcher
Bart Bussmann
Aug 8, 2023, 2:16 PM
34
points
1
comment
3
min read
LW
link
4 types of AGI selection, and how to constrain them
Remmelt
Aug 8, 2023, 10:02 AM
−4
points
3
comments
3
min read
LW
link
Notice your everything
metachirality
Aug 8, 2023, 2:38 AM
15
points
1
comment
2
min read
LW
link
Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research
evhub
,
Nicholas Schiefer
,
Carson Denison
and
Ethan Perez
Aug 8, 2023, 1:30 AM
318
points
30
comments
18
min read
LW
link
1
review
Perpetually Declining Population?
jefftk
Aug 8, 2023, 1:30 AM
48
points
29
comments
3
min read
LW
link
(www.jefftk.com)
[Question]
How do I find all the items on LW that I’ve *favorited* or upvoted?
Alex K. Chen (parrot)
Aug 7, 2023, 11:51 PM
14
points
3
comments
1
min read
LW
link
A plea for more funding shortfall transparency
porby
Aug 7, 2023, 9:33 PM
73
points
4
comments
2
min read
LW
link
[Question]
Tips for reducing thinking branching factor
Simon Berens
Aug 7, 2023, 8:21 PM
4
points
6
comments
1
min read
LW
link
An interactive introduction to grokking and mechanistic interpretability
Adam Pearce
and
Asma Ghandeharioun
Aug 7, 2023, 7:09 PM
23
points
3
comments
1
min read
LW
link
(pair.withgoogle.com)
Feedbackloop-first Rationality
Raemon
Aug 7, 2023, 5:58 PM
205
points
69
comments
8
min read
LW
link
2
reviews
Growing Bonsai Networks with RNNs
ameo
Aug 7, 2023, 5:34 PM
21
points
5
comments
1
min read
LW
link
(cprimozic.net)
[Question]
Should I test myself for microplastics?
Augs
Aug 7, 2023, 5:31 PM
9
points
2
comments
1
min read
LW
link
Optimisation Measures: Desiderata, Impossibility, Proposals
mattmacdermott
and
Alexander Gietelink Oldenziel
Aug 7, 2023, 3:52 PM
36
points
9
comments
1
min read
LW
link
Announcing the Clearer Thinking micro-grants program for 2023
spencerg
Aug 7, 2023, 3:21 PM
14
points
1
comment
1
min read
LW
link
(www.clearerthinking.org)
What I’ve been reading, July–August 2023
jasoncrawford
Aug 7, 2023, 2:22 PM
23
points
0
comments
13
min read
LW
link
(rootsofprogress.org)
Monthly Roundup #9: August 2023
Zvi
Aug 7, 2023, 1:20 PM
42
points
25
comments
57
min read
LW
link
(thezvi.wordpress.com)
Strengthening the Argument for Intrinsic AI Safety: The S-Curves Perspective
avturchin
Aug 7, 2023, 1:13 PM
8
points
0
comments
12
min read
LW
link
Overview of how AI might exacerbate long-running catastrophic risks
Hauke Hillebrandt
Aug 7, 2023, 11:53 AM
20
points
0
comments
11
min read
LW
link
(aisafetyfundamentals.com)
Drinks at a bar
yakimoff
Aug 7, 2023, 2:52 AM
3
points
0
comments
1
min read
LW
link
Problems with Robin Hanson’s Quillette Article On AI
DaemonicSigil
Aug 6, 2023, 10:13 PM
89
points
33
comments
8
min read
LW
link
Yann LeCun on AGI and AI Safety
Chris_Leong
Aug 6, 2023, 9:56 PM
37
points
13
comments
1
min read
LW
link
(drive.google.com)
Computational Thread Art
CallumMcDougall
Aug 6, 2023, 9:42 PM
76
points
2
comments
6
min read
LW
link
‘We’re changing the clouds.’ An unforeseen test of geoengineering is fueling record ocean warmth
Annapurna
Aug 6, 2023, 8:58 PM
60
points
6
comments
1
min read
LW
link
(www.science.org)
[Linkpost] Will AI avoid exploitation?
cdkg
Aug 6, 2023, 2:28 PM
22
points
1
comment
1
min read
LW
link
Reducing the risk of catastrophically misaligned AI by avoiding the Singleton scenario: the Manyton Variant
GravitasGradient
Aug 6, 2023, 2:24 PM
−6
points
0
comments
3
min read
LW
link
Rebooting AI Governance: An AI-Driven Approach to AI Governance
utilon
Aug 6, 2023, 2:19 PM
1
point
1
comment
29
min read
LW
link
(forum.effectivealtruism.org)
Model-Based Policy Analysis under Deep Uncertainty
utilon
Aug 6, 2023, 2:07 PM
16
points
1
comment
23
min read
LW
link
(forum.effectivealtruism.org)
[Question]
On being in a bad place and too stubborn to leave.
TeaTieAndHat
Aug 6, 2023, 11:45 AM
12
points
14
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel