Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
Making the “stance” explicit
NicholasKees
Feb 16, 2024, 11:57 PM
23
points
3
comments
2
min read
LW
link
2023 Survey Results
Screwtape
Feb 16, 2024, 10:24 PM
150
points
26
comments
44
min read
LW
link
Physics-based early warning signal shows that AMOC is on tipping course
Annapurna
Feb 16, 2024, 10:07 PM
19
points
3
comments
1
min read
LW
link
(www.science.org)
Kingfisher Winter Tour 2024
jefftk
Feb 16, 2024, 9:40 PM
8
points
0
comments
1
min read
LW
link
(www.jefftk.com)
The Pointer Resolution Problem
Jozdien
Feb 16, 2024, 9:25 PM
41
points
20
comments
3
min read
LW
link
Every “Every Bay Area House Party” Bay Area House Party
Richard_Ngo
Feb 16, 2024, 6:53 PM
181
points
6
comments
4
min read
LW
link
“No-one in my org puts money in their pension”
Tobes
Feb 16, 2024, 6:33 PM
272
points
16
comments
9
min read
LW
link
(seekingtobejolly.substack.com)
Addressing Feature Suppression in SAEs
Benjamin Wright
and
Lee Sharkey
Feb 16, 2024, 6:32 PM
86
points
4
comments
10
min read
LW
link
Retrospective: PIBBSS Fellowship 2023
DusanDNesic
and
Nora_Ammann
Feb 16, 2024, 5:48 PM
31
points
1
comment
8
min read
LW
link
Fatebook for Chrome: Make and embed forecasts anywhere on the web
Adam B
and
Sage Future
Feb 16, 2024, 4:08 PM
14
points
3
comments
1
min read
LW
link
“Arctic Instincts? The universal principles of Arctic psychological adaptation and the origins of East Asian psychology”—Call for Reviewers (Seeds of Science)
rogersbacon
Feb 16, 2024, 3:02 PM
0
points
0
comments
2
min read
LW
link
The Altman Technocracy
PhilosophicalSoul
Feb 16, 2024, 1:27 PM
5
points
31
comments
2
min read
LW
link
Discord space for people with FTX clawbacks/claims request
kotrfa
Feb 16, 2024, 9:04 AM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
OpenAI’s Sora is an agent
Caleb Biddulph
Feb 16, 2024, 7:35 AM
97
points
25
comments
4
min read
LW
link
Massapequa (Long Island), New York – ACX/SSC Meetup
Gabriel Weil
Feb 16, 2024, 1:24 AM
4
points
0
comments
1
min read
LW
link
Offering AI safety support calls for ML professionals
Vael Gates
Feb 15, 2024, 11:48 PM
61
points
1
comment
LW
link
7. Evolution and Ethics
RogerDearnaley
Feb 15, 2024, 11:38 PM
3
points
7
comments
6
min read
LW
link
Mapping the semantic void III: Exploring neighbourhoods
mwatkins
Feb 15, 2024, 11:01 PM
13
points
0
comments
10
min read
LW
link
Mapping the semantic void II: Above, below and between token embeddings
mwatkins
Feb 15, 2024, 11:00 PM
31
points
4
comments
10
min read
LW
link
Raising children on the eve of AI
juliawise
Feb 15, 2024, 9:28 PM
275
points
47
comments
5
min read
LW
link
What’s happening behind the scenes with my HowTruthful project
Bruce Lewis
Feb 15, 2024, 6:27 PM
7
points
0
comments
3
min read
LW
link
Gemini 1.5 released
Cole Wyeth
Feb 15, 2024, 6:02 PM
19
points
3
comments
1
min read
LW
link
(blog.google)
AI play for the next 3 years: Lemonade Insurance
Prin (Premek) Paska
Feb 15, 2024, 1:48 PM
2
points
4
comments
1
min read
LW
link
(docs.google.com)
Collection of Scientific and Other Classifications
niplav
Feb 15, 2024, 12:58 PM
16
points
0
comments
1
min read
LW
link
“Open Source AI” isn’t Open Source
Davidmanheim
Feb 15, 2024, 8:59 AM
18
points
16
comments
1
min read
LW
link
(davidmanheim.substack.com)
Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)
MiguelDev
Feb 15, 2024, 3:39 AM
4
points
0
comments
262
min read
LW
link
11 diceware words is enough
DanielFilan
and
benwr
Feb 15, 2024, 12:13 AM
23
points
6
comments
1
min read
LW
link
(threadreaderapp.com)
Searching for Searching for Search
Rubi J. Hudson
Feb 14, 2024, 11:51 PM
21
points
4
comments
7
min read
LW
link
Some questions for the people at 80,000 Hours
yanni kyriacos
Feb 14, 2024, 11:15 PM
1
point
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Disrupting malicious uses of AI by state-affiliated threat actors
agucova
Feb 14, 2024, 9:28 PM
11
points
2
comments
LW
link
(openai.com)
Critiques of the AI control agenda
Jozdien
Feb 14, 2024, 7:25 PM
48
points
14
comments
9
min read
LW
link
Bad business advice
Logan Kieller
Feb 14, 2024, 5:01 PM
12
points
2
comments
3
min read
LW
link
(logankieller.substack.com)
Examples of governments doing good in house (or contracted) technical research
NathanBarnard
Feb 14, 2024, 4:22 PM
12
points
2
comments
2
min read
LW
link
[Question]
How can we legally/illegally enhance the progress of the law of accelerating returns in AI learning?
Gabi QUENE
Feb 14, 2024, 11:06 AM
−25
points
0
comments
1
min read
LW
link
[Question]
What experiment settles the Gary Marcus vs Geoffrey Hinton debate?
Valentin Baltadzhiev
Feb 14, 2024, 9:06 AM
12
points
8
comments
1
min read
LW
link
[Question]
Optimizing for Agency?
Michael Soareverix
Feb 14, 2024, 8:31 AM
10
points
9
comments
2
min read
LW
link
Requirements for a Basin of Attraction to Alignment
RogerDearnaley
Feb 14, 2024, 7:10 AM
41
points
12
comments
31
min read
LW
link
FTX expects to return all customer money; clawbacks may go away
Mikhail Samin
Feb 14, 2024, 3:43 AM
33
points
1
comment
LW
link
(www.nytimes.com)
Scale Was All We Needed, At First
Gabe M
Feb 14, 2024, 1:49 AM
295
points
34
comments
8
min read
LW
link
(aiacumen.substack.com)
CFAR Takeaways: Andrew Critch
Raemon
Feb 14, 2024, 1:37 AM
217
points
64
comments
5
min read
LW
link
Meetup In a Box: Year In Review
Czynski
Feb 14, 2024, 1:18 AM
26
points
1
comment
4
min read
LW
link
An EA used deceptive messaging to advance their project; we need mechanisms to avoid deontologically dubious plans
Mikhail Samin
Feb 13, 2024, 11:15 PM
24
points
1
comment
LW
link
Useful starting code for interpretability
eggsyntax
Feb 13, 2024, 11:13 PM
26
points
2
comments
1
min read
LW
link
Masterpiece
Richard_Ngo
Feb 13, 2024, 11:10 PM
166
points
21
comments
4
min read
LW
link
(www.narrativeark.xyz)
A Bridge Between Utilitarianism & Stoicism
Jonathan Moregård
Feb 13, 2024, 10:46 PM
5
points
0
comments
5
min read
LW
link
(honestliving.substack.com)
The “context window” analogy for human minds
Ruby
Feb 13, 2024, 7:29 PM
38
points
0
comments
2
min read
LW
link
More on the Apple Vision Pro
Zvi
Feb 13, 2024, 5:40 PM
33
points
5
comments
8
min read
LW
link
(thezvi.wordpress.com)
Linear White
Teja Prabhu
Feb 13, 2024, 4:31 PM
−3
points
3
comments
3
min read
LW
link
(krez.expert)
Causality is Everywhere
silentbob
Feb 13, 2024, 1:44 PM
26
points
12
comments
8
min read
LW
link
Technologies and Terminology: AI isn’t Software, it’s… Deepware?
Davidmanheim
and
abramdemski
Feb 13, 2024, 1:37 PM
40
points
10
comments
8
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel