Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Question]
Looking for ideas of public assets (stocks, funds, ETFs) that I can invest in to have a chance at profiting from the mass adoption and commercialization of AI technology
Annapurna
Dec 7, 2022, 10:35 PM
15
points
9
comments
1
min read
LW
link
A Fallibilist Wordview
Toni MUENDEL
Dec 7, 2022, 8:59 PM
−13
points
2
comments
13
min read
LW
link
Thoughts on AGI organizations and capabilities work
Rob Bensinger
and
So8res
Dec 7, 2022, 7:46 PM
102
points
17
comments
5
min read
LW
link
How to Think About Climate Models and How to Improve Them
clans
Dec 7, 2022, 7:37 PM
7
points
0
comments
2
min read
LW
link
(locationtbd.home.blog)
The novelty quotient
River Lewis
Dec 7, 2022, 5:16 PM
4
points
7
comments
2
min read
LW
link
(heytraveler.substack.com)
ChatGPT: “An error occurred. If this issue persists...”
Bill Benzon
Dec 7, 2022, 3:41 PM
5
points
11
comments
3
min read
LW
link
Take 6: CAIS is actually Orwellian.
Charlie Steiner
Dec 7, 2022, 1:50 PM
12
points
8
comments
2
min read
LW
link
Peter Thiel on Technological Stagnation and Out of Touch Rationalists
Matt Goldenberg
Dec 7, 2022, 1:15 PM
9
points
26
comments
1
min read
LW
link
(youtu.be)
[Link] Wavefunctions: from Linear Algebra to Spinors
sen
Dec 7, 2022, 12:44 PM
11
points
12
comments
1
min read
LW
link
(paperclip.substack.com)
Why I like Zulip instead of Slack or Discord
Alok Singh
Dec 7, 2022, 9:28 AM
31
points
10
comments
1
min read
LW
link
Bioweapons, and ChatGPT (another vulnerability story)
Beeblebrox
Dec 7, 2022, 7:27 AM
−5
points
0
comments
2
min read
LW
link
Where to be an AI Safety Professor
scasper
Dec 7, 2022, 7:09 AM
31
points
12
comments
2
min read
LW
link
[Question]
Are there any tools to convert LW sequences to PDF or any other file format?
quetzal_rainbow
Dec 7, 2022, 5:28 AM
2
points
2
comments
1
min read
LW
link
Manifold Markets community meetup
Sinclair Chen
Dec 7, 2022, 3:25 AM
4
points
0
comments
1
min read
LW
link
“Attention Passengers”: not for Signs
jefftk
Dec 7, 2022, 2:00 AM
27
points
10
comments
1
min read
LW
link
(www.jefftk.com)
[ASoT] Probability Infects Concepts it Touches
Ulisse Mini
Dec 7, 2022, 1:48 AM
10
points
4
comments
1
min read
LW
link
Simple Way to Prevent Power-Seeking AI
research_prime_space
Dec 7, 2022, 12:26 AM
12
points
1
comment
1
min read
LW
link
In defense of probably wrong mechanistic models
evhub
Dec 6, 2022, 11:24 PM
55
points
10
comments
2
min read
LW
link
AI Safety in a Vulnerable World: Requesting Feedback on Preliminary Thoughts
Jordan Arel
Dec 6, 2022, 10:35 PM
4
points
2
comments
3
min read
LW
link
ChatGPT and the Human Race
Ben Reilly
Dec 6, 2022, 9:38 PM
6
points
1
comment
3
min read
LW
link
[Question]
How do finite factored sets compare with phase space?
Alex_Altair
Dec 6, 2022, 8:05 PM
15
points
1
comment
1
min read
LW
link
Mesa-Optimizers via Grokking
orthonormal
Dec 6, 2022, 8:05 PM
36
points
4
comments
6
min read
LW
link
Using GPT-Eliezer against ChatGPT Jailbreaking
Stuart_Armstrong
and
rgorman
Dec 6, 2022, 7:54 PM
170
points
85
comments
9
min read
LW
link
The Parable of the Crimp
Phosphorous
Dec 6, 2022, 6:41 PM
11
points
3
comments
3
min read
LW
link
The Categorical Imperative Obscures
Gordon Seidoh Worley
Dec 6, 2022, 5:48 PM
17
points
17
comments
2
min read
LW
link
MIRI’s “Death with Dignity” in 60 seconds.
Cleo Nardo
Dec 6, 2022, 5:18 PM
58
points
4
comments
1
min read
LW
link
Things roll downhill
awenonian
Dec 6, 2022, 3:27 PM
19
points
0
comments
1
min read
LW
link
EA & LW Forums Weekly Summary (28th Nov − 4th Dec 22′)
Zoe Williams
Dec 6, 2022, 9:38 AM
10
points
1
comment
LW
link
Take 5: Another problem for natural abstractions is laziness.
Charlie Steiner
Dec 6, 2022, 7:00 AM
31
points
4
comments
3
min read
LW
link
Verification Is Not Easier Than Generation In General
johnswentworth
Dec 6, 2022, 5:20 AM
73
points
27
comments
1
min read
LW
link
Shh, don’t tell the AI it’s likely to be evil
naterush
Dec 6, 2022, 3:35 AM
19
points
9
comments
1
min read
LW
link
[Question]
What are the major underlying divisions in AI safety?
Chris_Leong
Dec 6, 2022, 3:28 AM
5
points
2
comments
1
min read
LW
link
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
Dec 5, 2022, 10:51 PM
98
points
15
comments
1
min read
LW
link
(aligned.substack.com)
The No Free Lunch theorem for dummies
Steven Byrnes
Dec 5, 2022, 9:46 PM
37
points
16
comments
3
min read
LW
link
ChatGPT and Ideological Turing Test
Viliam
Dec 5, 2022, 9:45 PM
42
points
1
comment
1
min read
LW
link
ChatGPT on Spielberg’s A.I. and AI Alignment
Bill Benzon
Dec 5, 2022, 9:10 PM
5
points
0
comments
4
min read
LW
link
Updating my AI timelines
Matthew Barnett
Dec 5, 2022, 8:46 PM
145
points
50
comments
2
min read
LW
link
Steering Behaviour: Testing for (Non-)Myopia in Language Models
Evan R. Murphy
and
Megan Kinniment
Dec 5, 2022, 8:28 PM
40
points
19
comments
10
min read
LW
link
College Admissions as a Brutal One-Shot Game
devansh
Dec 5, 2022, 8:05 PM
8
points
26
comments
2
min read
LW
link
Analysis of AI Safety surveys for field-building insights
Ash Jafari
Dec 5, 2022, 7:21 PM
11
points
2
comments
5
min read
LW
link
Testing Ways to Bypass ChatGPT’s Safety Features
Robert_AIZI
Dec 5, 2022, 6:50 PM
7
points
4
comments
5
min read
LW
link
(aizi.substack.com)
Foresight for AGI Safety Strategy: Mitigating Risks and Identifying Golden Opportunities
jacquesthibs
Dec 5, 2022, 4:09 PM
28
points
6
comments
8
min read
LW
link
Aligned Behavior is not Evidence of Alignment Past a Certain Level of Intelligence
Ronny Fernandez
Dec 5, 2022, 3:19 PM
19
points
5
comments
7
min read
LW
link
[Question]
How should I judge the impact of giving $5k to a family of three kids and two mentally ill parents?
Blake
5 Dec 2022 13:42 UTC
10
points
10
comments
1
min read
LW
link
Is the “Valley of Confused Abstractions” real?
jacquesthibs
5 Dec 2022 13:36 UTC
20
points
11
comments
2
min read
LW
link
Take 4: One problem with natural abstractions is there’s too many of them.
Charlie Steiner
5 Dec 2022 10:39 UTC
37
points
4
comments
1
min read
LW
link
[Question]
What are some good Lesswrong-related accounts or hashtags on Mastodon that I should follow?
SpectrumDT
5 Dec 2022 9:42 UTC
2
points
0
comments
1
min read
LW
link
[Question]
Who are some prominent reasonable people who are confident that AI won’t kill everyone?
Optimization Process
5 Dec 2022 9:12 UTC
72
points
54
comments
1
min read
LW
link
Monthly Shorts 11/22
Celer
5 Dec 2022 7:30 UTC
8
points
0
comments
3
min read
LW
link
(keller.substack.com)
A ChatGPT story about ChatGPT doom
SurfingOrca
5 Dec 2022 5:40 UTC
6
points
2
comments
4
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel