Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Instantiating an agent with GPT-4 and text-davinci-003
Max H
Mar 19, 2023, 11:57 PM
13
points
3
comments
32
min read
LW
link
Can This Idea Dramatically Improve Effective Vegan Activism?
NothingIsArt
Mar 19, 2023, 11:39 PM
−5
points
2
comments
1
min read
LW
link
Value Pluralism and AI
Göran Crafte
Mar 19, 2023, 11:38 PM
8
points
4
comments
2
min read
LW
link
Tabooing “Frame Control”
Raemon
Mar 19, 2023, 11:33 PM
66
points
41
comments
10
min read
LW
link
High Status Eschews Quantification of Performance
niplav
Mar 19, 2023, 10:14 PM
128
points
36
comments
5
min read
LW
link
The Hidden Complexity of Thought
Isaac King
Mar 19, 2023, 9:59 PM
15
points
3
comments
3
min read
LW
link
(outsidetheasylum.blog)
[Question]
“Wide” vs “Tall” superintelligence
Templarrr
Mar 19, 2023, 7:23 PM
15
points
8
comments
1
min read
LW
link
Humanity’s Lack of Unity Will Lead to AGI Catastrophe
MiguelDev
Mar 19, 2023, 7:18 PM
3
points
2
comments
4
min read
LW
link
Probabilistic Payor Lemma?
abramdemski
Mar 19, 2023, 5:57 PM
69
points
7
comments
4
min read
LW
link
AGI is uncontrollable, alignment is impossible
Donatas Lučiūnas
Mar 19, 2023, 5:49 PM
−12
points
21
comments
1
min read
LW
link
Playbook for the Great Divergence
intellectronica
Mar 19, 2023, 5:42 PM
14
points
0
comments
3
min read
LW
link
(www.intellectronica.net)
How AI could workaround goals if rated by people
ProgramCrafter
Mar 19, 2023, 3:51 PM
1
point
1
comment
1
min read
LW
link
[Question]
GPT-4 and ASCII Images?
carterallen
Mar 19, 2023, 3:46 PM
10
points
17
comments
1
min read
LW
link
A tension between two prosaic alignment subgoals
Alex Lawsen
Mar 19, 2023, 2:07 PM
31
points
8
comments
1
min read
LW
link
Shell games
TsviBT
Mar 19, 2023, 10:43 AM
93
points
9
comments
4
min read
LW
link
1
review
Self-censorship is probably bad for epistemology. Maybe we should figure out a way to avoid it?
DaemonicSigil
Mar 19, 2023, 9:04 AM
7
points
1
comment
3
min read
LW
link
Mahler 6 at the San Francisco Symphony
yakimoff
Mar 19, 2023, 4:06 AM
1
point
0
comments
1
min read
LW
link
Feature proposal: integrate LessWrong with ChatGPT to promote active reading
DirectedEvolution
Mar 19, 2023, 3:41 AM
10
points
4
comments
1
min read
LW
link
Against Deep Ideas
YafahEdelman
Mar 19, 2023, 3:04 AM
53
points
14
comments
2
min read
LW
link
More information about the dangerous capability evaluations we did with GPT-4 and Claude.
Beth Barnes
Mar 19, 2023, 12:25 AM
233
points
54
comments
8
min read
LW
link
(evals.alignment.org)
Cryonics companies should let people make conditions for reawakening
Andrew Vlahos
Mar 18, 2023, 9:03 PM
10
points
11
comments
4
min read
LW
link
“Publish or Perish” (a quick note on why you should try to make your work legible to existing academic communities)
David Scott Krueger (formerly: capybaralet)
Mar 18, 2023, 7:01 PM
112
points
49
comments
1
min read
LW
link
1
review
Dan Luu on “You can only communicate one top priority”
Raemon
Mar 18, 2023, 6:55 PM
149
points
18
comments
3
min read
LW
link
(twitter.com)
An Appeal to AI Superintelligence: Reasons to Preserve Humanity
James_Miller
Mar 18, 2023, 4:22 PM
41
points
73
comments
12
min read
LW
link
[Question]
What did you do with GPT4?
ChristianKl
Mar 18, 2023, 3:21 PM
27
points
17
comments
1
min read
LW
link
Try to solve the hard parts of the alignment problem
Mikhail Samin
Mar 18, 2023, 2:55 PM
54
points
33
comments
5
min read
LW
link
Testing ChatGPT 3.5 for political biases using roleplaying prompts
twkaiser
Mar 18, 2023, 11:42 AM
−2
points
2
comments
19
min read
LW
link
(hackernoon.com)
What I did to reduce the risk of Long COVID (and manage symptoms) after getting COVID
Sameerishere
Mar 18, 2023, 5:32 AM
11
points
3
comments
10
min read
LW
link
(retired article) AGI With Internet Access: Why we won’t stuff the genie back in its bottle.
Max TK
Mar 18, 2023, 3:43 AM
5
points
10
comments
4
min read
LW
link
St. Patty’s Day LA meetup
lc
Mar 18, 2023, 12:00 AM
8
points
0
comments
1
min read
LW
link
[Question]
Why Carl Jung is not popular in AI Alignment Research?
MiguelDev
Mar 17, 2023, 11:56 PM
−3
points
13
comments
1
min read
LW
link
[Event] Join Metaculus for Forecast Friday on March 24th!
ChristianWilliams
Mar 17, 2023, 10:47 PM
3
points
0
comments
LW
link
Meetup Tip: The Next Meetup Will Be. . .
Screwtape
Mar 17, 2023, 10:04 PM
44
points
0
comments
3
min read
LW
link
The Power of High Speed Stupidity
robotelvis
Mar 17, 2023, 9:41 PM
33
points
6
comments
9
min read
LW
link
1
review
(messyprogress.substack.com)
Retrospective on ‘GPT-4 Predictions’ After the Release of GPT-4
Stephen McAleese
Mar 17, 2023, 6:34 PM
26
points
6
comments
6
min read
LW
link
“Carefully Bootstrapped Alignment” is organizationally hard
Raemon
Mar 17, 2023, 6:00 PM
262
points
23
comments
11
min read
LW
link
1
review
[Question]
Are nested jailbreaks inevitable?
judson
Mar 17, 2023, 5:43 PM
1
point
0
comments
1
min read
LW
link
Ethical AI investments?
Justin wilson
Mar 17, 2023, 5:43 PM
24
points
15
comments
1
min read
LW
link
New economic system for AI era
ksme sho
Mar 17, 2023, 5:42 PM
−1
points
1
comment
5
min read
LW
link
On some first principles of intelligence
Macheng_Shen
Mar 17, 2023, 5:42 PM
−14
points
0
comments
4
min read
LW
link
Essential Behaviorism Terms
Rivka
Mar 17, 2023, 5:41 PM
15
points
1
comment
10
min read
LW
link
Vector semantics and “Kubla Khan,” Part 2
Bill Benzon
Mar 17, 2023, 4:32 PM
2
points
0
comments
3
min read
LW
link
Super-Luigi = Luigi + (Luigi—Waluigi)
Alexei
Mar 17, 2023, 3:27 PM
16
points
9
comments
1
min read
LW
link
Survey on intermediate goals in AI governance
MichaelA
and
MaxRa
Mar 17, 2023, 1:12 PM
25
points
3
comments
1
min read
LW
link
GPT-4 solves Gary Marcus-induced flubs
JakubK
Mar 17, 2023, 6:40 AM
56
points
29
comments
2
min read
LW
link
(docs.google.com)
[Question]
Are the LLM “intelligence” tests publicly available for humans to take?
nim
Mar 17, 2023, 12:09 AM
7
points
12
comments
1
min read
LW
link
Donation offsets for ChatGPT Plus subscriptions
Jeffrey Ladish
Mar 16, 2023, 11:29 PM
53
points
3
comments
3
min read
LW
link
The algorithm isn’t doing X, it’s just doing Y.
Cleo Nardo
Mar 16, 2023, 11:28 PM
53
points
43
comments
5
min read
LW
link
Announcing the ERA Cambridge Summer Research Fellowship
Nandini Shiralkar
Mar 16, 2023, 10:57 PM
11
points
0
comments
3
min read
LW
link
Gradual takeoff, fast failure
Max H
Mar 16, 2023, 10:02 PM
15
points
4
comments
5
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel