Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Instantiating an agent with GPT-4 and text-davinci-003
Max H
19 Mar 2023 23:57 UTC
13
points
3
comments
32
min read
LW
link
Can This Idea Dramatically Improve Effective Vegan Activism?
NothingIsArt
19 Mar 2023 23:39 UTC
−5
points
2
comments
1
min read
LW
link
Value Pluralism and AI
Göran Crafte
19 Mar 2023 23:38 UTC
8
points
4
comments
2
min read
LW
link
Tabooing “Frame Control”
Raemon
19 Mar 2023 23:33 UTC
66
points
41
comments
10
min read
LW
link
High Status Eschews Quantification of Performance
niplav
19 Mar 2023 22:14 UTC
128
points
36
comments
5
min read
LW
link
The Hidden Complexity of Thought
Isaac King
19 Mar 2023 21:59 UTC
15
points
3
comments
3
min read
LW
link
(outsidetheasylum.blog)
[Question]
“Wide” vs “Tall” superintelligence
Templarrr
19 Mar 2023 19:23 UTC
15
points
8
comments
1
min read
LW
link
Humanity’s Lack of Unity Will Lead to AGI Catastrophe
MiguelDev
19 Mar 2023 19:18 UTC
3
points
2
comments
4
min read
LW
link
Probabilistic Payor Lemma?
abramdemski
19 Mar 2023 17:57 UTC
69
points
7
comments
4
min read
LW
link
AGI is uncontrollable, alignment is impossible
Donatas Lučiūnas
19 Mar 2023 17:49 UTC
−12
points
21
comments
1
min read
LW
link
Playbook for the Great Divergence
intellectronica
19 Mar 2023 17:42 UTC
14
points
0
comments
3
min read
LW
link
(www.intellectronica.net)
How AI could workaround goals if rated by people
ProgramCrafter
19 Mar 2023 15:51 UTC
1
point
1
comment
1
min read
LW
link
[Question]
GPT-4 and ASCII Images?
carterallen
19 Mar 2023 15:46 UTC
10
points
17
comments
1
min read
LW
link
A tension between two prosaic alignment subgoals
Alex Lawsen
19 Mar 2023 14:07 UTC
31
points
8
comments
1
min read
LW
link
Shell games
TsviBT
19 Mar 2023 10:43 UTC
96
points
9
comments
4
min read
LW
link
1
review
Self-censorship is probably bad for epistemology. Maybe we should figure out a way to avoid it?
DaemonicSigil
19 Mar 2023 9:04 UTC
7
points
1
comment
3
min read
LW
link
Mahler 6 at the San Francisco Symphony
yakimoff
19 Mar 2023 4:06 UTC
1
point
0
comments
1
min read
LW
link
Feature proposal: integrate LessWrong with ChatGPT to promote active reading
DirectedEvolution
19 Mar 2023 3:41 UTC
10
points
4
comments
1
min read
LW
link
Against Deep Ideas
YafahEdelman
19 Mar 2023 3:04 UTC
53
points
14
comments
2
min read
LW
link
More information about the dangerous capability evaluations we did with GPT-4 and Claude.
Beth Barnes
19 Mar 2023 0:25 UTC
233
points
54
comments
8
min read
LW
link
(evals.alignment.org)
Cryonics companies should let people make conditions for reawakening
Andrew Vlahos
18 Mar 2023 21:03 UTC
10
points
11
comments
4
min read
LW
link
“Publish or Perish” (a quick note on why you should try to make your work legible to existing academic communities)
David Scott Krueger (formerly: capybaralet)
18 Mar 2023 19:01 UTC
112
points
49
comments
1
min read
LW
link
1
review
Dan Luu on “You can only communicate one top priority”
Raemon
18 Mar 2023 18:55 UTC
149
points
18
comments
3
min read
LW
link
(twitter.com)
An Appeal to AI Superintelligence: Reasons to Preserve Humanity
James_Miller
18 Mar 2023 16:22 UTC
43
points
74
comments
12
min read
LW
link
[Question]
What did you do with GPT4?
ChristianKl
18 Mar 2023 15:21 UTC
27
points
17
comments
1
min read
LW
link
Try to solve the hard parts of the alignment problem
Mikhail Samin
18 Mar 2023 14:55 UTC
54
points
33
comments
5
min read
LW
link
Testing ChatGPT 3.5 for political biases using roleplaying prompts
twkaiser
18 Mar 2023 11:42 UTC
−2
points
2
comments
19
min read
LW
link
(hackernoon.com)
What I did to reduce the risk of Long COVID (and manage symptoms) after getting COVID
Sameerishere
18 Mar 2023 5:32 UTC
11
points
3
comments
10
min read
LW
link
(retired article) AGI With Internet Access: Why we won’t stuff the genie back in its bottle.
Max TK
18 Mar 2023 3:43 UTC
5
points
10
comments
4
min read
LW
link
St. Patty’s Day LA meetup
lc
18 Mar 2023 0:00 UTC
8
points
0
comments
1
min read
LW
link
[Question]
Why Carl Jung is not popular in AI Alignment Research?
MiguelDev
17 Mar 2023 23:56 UTC
−3
points
13
comments
1
min read
LW
link
[Event] Join Metaculus for Forecast Friday on March 24th!
ChristianWilliams
17 Mar 2023 22:47 UTC
3
points
0
comments
1
min read
LW
link
(www.eventbrite.com)
Meetup Tip: The Next Meetup Will Be. . .
Screwtape
17 Mar 2023 22:04 UTC
45
points
0
comments
3
min read
LW
link
The Power of High Speed Stupidity
robotelvis
17 Mar 2023 21:41 UTC
33
points
6
comments
9
min read
LW
link
1
review
(messyprogress.substack.com)
Retrospective on ‘GPT-4 Predictions’ After the Release of GPT-4
Stephen McAleese
17 Mar 2023 18:34 UTC
26
points
6
comments
6
min read
LW
link
“Carefully Bootstrapped Alignment” is organizationally hard
Raemon
17 Mar 2023 18:00 UTC
266
points
23
comments
11
min read
LW
link
1
review
[Question]
Are nested jailbreaks inevitable?
judson
17 Mar 2023 17:43 UTC
1
point
0
comments
1
min read
LW
link
Ethical AI investments?
Justin wilson
17 Mar 2023 17:43 UTC
25
points
15
comments
1
min read
LW
link
New economic system for AI era
ksme sho
17 Mar 2023 17:42 UTC
−1
points
1
comment
5
min read
LW
link
On some first principles of intelligence
Macheng_Shen
17 Mar 2023 17:42 UTC
−14
points
0
comments
4
min read
LW
link
Essential Behaviorism Terms
Rivka
17 Mar 2023 17:41 UTC
16
points
1
comment
10
min read
LW
link
Vector semantics and “Kubla Khan,” Part 2
Bill Benzon
17 Mar 2023 16:32 UTC
2
points
0
comments
3
min read
LW
link
Super-Luigi = Luigi + (Luigi—Waluigi)
Alexei
17 Mar 2023 15:27 UTC
16
points
9
comments
1
min read
LW
link
Survey on intermediate goals in AI governance
MichaelA
and
MaxRa
17 Mar 2023 13:12 UTC
25
points
3
comments
1
min read
LW
link
GPT-4 solves Gary Marcus-induced flubs
JakubK
17 Mar 2023 6:40 UTC
57
points
29
comments
2
min read
LW
link
(docs.google.com)
[Question]
Are the LLM “intelligence” tests publicly available for humans to take?
nim
17 Mar 2023 0:09 UTC
7
points
13
comments
1
min read
LW
link
Donation offsets for ChatGPT Plus subscriptions
Jeffrey Ladish
16 Mar 2023 23:29 UTC
53
points
3
comments
3
min read
LW
link
The algorithm isn’t doing X, it’s just doing Y.
Cleo Nardo
16 Mar 2023 23:28 UTC
53
points
43
comments
5
min read
LW
link
Announcing the ERA Cambridge Summer Research Fellowship
Nandini Shiralkar
16 Mar 2023 22:57 UTC
11
points
0
comments
3
min read
LW
link
Gradual takeoff, fast failure
Max H
16 Mar 2023 22:02 UTC
15
points
4
comments
5
min read
LW
link
Back to top
Next