Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
List of requests for an AI slowdown/halt.
Cleo Nardo
Apr 14, 2023, 11:55 PM
46
points
6
comments
1
min read
LW
link
[linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23
Arjun Panickssery
Apr 14, 2023, 11:26 PM
26
points
16
comments
LW
link
“Do X because decision theory” ~= “Do X because bayes theorem”
lc
Apr 14, 2023, 8:57 PM
39
points
1
comment
2
min read
LW
link
LLMs and hallucination, like white on rice?
Bill Benzon
Apr 14, 2023, 7:53 PM
5
points
0
comments
3
min read
LW
link
GPT-4 is easily controlled/exploited with tricky decision theoretic dilemmas.
scasper
Apr 14, 2023, 7:39 PM
6
points
4
comments
2
min read
LW
link
On Caring about our AI Progeny
PeterMcCluskey
Apr 14, 2023, 7:32 PM
22
points
5
comments
1
min read
LW
link
(bayesianinvestor.com)
Moderation notes re: recent Said/Duncan threads
Raemon
Apr 14, 2023, 6:06 PM
50
points
560
comments
2
min read
LW
link
What we’ve learned so far from our technological temptations project
Richard Korzekwa
Apr 14, 2023, 5:46 PM
15
points
4
comments
11
min read
LW
link
(aiimpacts.org)
[Question]
How does consciousness interact with architecture?
FinalFormal2
Apr 14, 2023, 3:56 PM
5
points
3
comments
1
min read
LW
link
Iqisa: A Library For Handling Forecasting Datasets
niplav
Apr 14, 2023, 3:16 PM
27
points
0
comments
LW
link
What’s this probability you’re reporting?
EOC
and
SCP
Apr 14, 2023, 3:07 PM
19
points
10
comments
3
min read
LW
link
Navigating AI Risks (NAIR) #1: Slowing Down AI
simeon_c
Apr 14, 2023, 2:35 PM
11
points
3
comments
1
min read
LW
link
(navigatingairisks.substack.com)
[Question]
What would the FLI moratorium actually do?
ChristianKl
Apr 14, 2023, 1:14 PM
17
points
7
comments
1
min read
LW
link
Research Report: Incorrectness Cascades
Robert_AIZI
Apr 14, 2023, 12:49 PM
19
points
0
comments
10
min read
LW
link
(aizi.substack.com)
The self-unalignment problem
Jan_Kulveit
and
rosehadshar
Apr 14, 2023, 12:10 PM
155
points
24
comments
10
min read
LW
link
AI Safety Europe Retreat 2023 Retrospective
Magdalena Wache
Apr 14, 2023, 9:05 AM
43
points
0
comments
2
min read
LW
link
[Question]
What’s the difference between Wisdom and Rationality?
Yoav Ravid
Apr 14, 2023, 6:22 AM
8
points
4
comments
1
min read
LW
link
Shapley Value Attribution in Chain of Thought
leogao
Apr 14, 2023, 5:56 AM
106
points
7
comments
4
min read
LW
link
A freshman year during the AI midgame: my approach to the next year
Buck
Apr 14, 2023, 12:38 AM
154
points
15
comments
LW
link
1
review
Against AI Understanding and Sentience: Large Language Models, Meaning, and the Patterns of Human Language Use
Jonathan Yan
Apr 13, 2023, 11:29 PM
−1
points
0
comments
1
min read
LW
link
(philsci-archive.pitt.edu)
Financial Times: We must slow down the race to God-like AI
trevor
Apr 13, 2023, 7:55 PM
113
points
17
comments
16
min read
LW
link
(www.ft.com)
R0 Is Not Counterfactual
jefftk
Apr 13, 2023, 7:50 PM
33
points
9
comments
2
min read
LW
link
(www.jefftk.com)
Subscripts for Probabilities
niplav
Apr 13, 2023, 6:32 PM
67
points
9
comments
5
min read
LW
link
The Virus—Short Story
Michael Soareverix
Apr 13, 2023, 6:18 PM
4
points
0
comments
4
min read
LW
link
First ACX Brno Meetup
adekcz
Apr 13, 2023, 5:42 PM
2
points
0
comments
1
min read
LW
link
Polluting the agentic commons
hamandcheese
Apr 13, 2023, 5:42 PM
7
points
4
comments
2
min read
LW
link
(www.secondbest.ca)
Cambridge LW Meetup: When Science Isn’t Enough
Tony Wang
and
Darmani
Apr 13, 2023, 5:36 PM
2
points
0
comments
1
min read
LW
link
Even if human & AI alignment are just as easy, we are screwed
Matthew_Opitz
Apr 13, 2023, 5:32 PM
35
points
5
comments
5
min read
LW
link
Intro to Ontogenetic Curriculum
Eris
Apr 13, 2023, 5:15 PM
20
points
1
comment
2
min read
LW
link
Was Homer a stochastic parrot? Meaning in literary texts and LLMs
Bill Benzon
Apr 13, 2023, 4:44 PM
7
points
4
comments
3
min read
LW
link
AI #7: Free Agency
Zvi
Apr 13, 2023, 4:20 PM
33
points
12
comments
47
min read
LW
link
(thezvi.wordpress.com)
Navigating the Open-Source AI Landscape: Data, Funding, and Safety
André Ferretti
and
mic
Apr 13, 2023, 3:29 PM
32
points
7
comments
11
min read
LW
link
(forum.effectivealtruism.org)
On AutoGPT
Zvi
Apr 13, 2023, 12:30 PM
248
points
47
comments
20
min read
LW
link
(thezvi.wordpress.com)
Identifying semantic neurons, mechanistic circuits & interpretability web apps
Esben Kran
and
Neel Nanda
Apr 13, 2023, 11:59 AM
18
points
0
comments
8
min read
LW
link
Trying AgentGPT, an AutoGPT variant
Gunnar_Zarncke
Apr 13, 2023, 10:13 AM
10
points
9
comments
1
min read
LW
link
Announcing Epoch’s dashboard of key trends and figures in Machine Learning
Jsevillamol
Apr 13, 2023, 7:33 AM
35
points
7
comments
1
min read
LW
link
(epochai.org)
[Question]
What is the best source to explain short AI timelines to a skeptical person?
trevor
Apr 13, 2023, 4:29 AM
12
points
12
comments
1
min read
LW
link
“Aligned” foundation models don’t imply aligned systems
Max H
Apr 13, 2023, 4:13 AM
39
points
11
comments
5
min read
LW
link
[Question]
Using ChatGPT for memory reconsolidation?
warrenjordan
Apr 13, 2023, 1:27 AM
3
points
2
comments
1
min read
LW
link
Independence Dividends
jefftk
Apr 13, 2023, 1:20 AM
35
points
11
comments
1
min read
LW
link
(www.jefftk.com)
AI x-risk, approximately ordered by embarrassment
Alex Lawsen
Apr 12, 2023, 11:01 PM
151
points
7
comments
19
min read
LW
link
AXRP Episode 20 - ‘Reform’ AI Alignment with Scott Aaronson
DanielFilan
Apr 12, 2023, 9:30 PM
22
points
2
comments
68
min read
LW
link
Apply to >30 AI safety funders in one application with the Nonlinear Network
KatWoods
,
Emerson Spartz
and
Drew Spartz
12 Apr 2023 21:23 UTC
65
points
12
comments
2
min read
LW
link
AGI goal space is big, but narrowing might not be as hard as it seems.
Jacy Reese Anthis
12 Apr 2023 19:03 UTC
15
points
0
comments
3
min read
LW
link
Natural language alignment
Jacy Reese Anthis
12 Apr 2023 19:02 UTC
31
points
2
comments
2
min read
LW
link
Repugnant levels of violins
Solenoid_Entity
12 Apr 2023 17:11 UTC
73
points
10
comments
12
min read
LW
link
Progress links and tweets, 2023-04-12
jasoncrawford
12 Apr 2023 16:52 UTC
8
points
2
comments
1
min read
LW
link
(rootsofprogress.org)
A basic mathematical structure of intelligence
Golol
12 Apr 2023 16:49 UTC
4
points
6
comments
4
min read
LW
link
[Question]
Should AutoGPT update us towards researching IDA?
Michaël Trazzi
12 Apr 2023 16:41 UTC
15
points
5
comments
1
min read
LW
link
Boxing lessons
yakimoff
12 Apr 2023 16:19 UTC
1
point
0
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel