Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
SmartyHeaderCode: anomalous tokens for GPT3.5 and GPT-4
AdamYedidia
Apr 15, 2023, 10:35 PM
71
points
18
comments
6
min read
LW
link
Open-source LLMs may prove Bostrom’s vulnerable world hypothesis
Roope Ahvenharju
Apr 15, 2023, 7:16 PM
1
point
1
comment
1
min read
LW
link
[linkpost] Elon Musk plans AI start-up to rival OpenAI
Hatfield
Apr 15, 2023, 7:06 PM
11
points
11
comments
1
min read
LW
link
(www.ft.com)
FLI report: Policymaking in the Pause
Zach Stein-Perlman
Apr 15, 2023, 5:01 PM
15
points
3
comments
1
min read
LW
link
(futureoflife.org)
Reflective journal entries using GPT-4 and Obsidian that demand less willpower.
Solenoid_Entity
Apr 15, 2023, 12:45 PM
56
points
24
comments
7
min read
LW
link
An example elevator pitch for AI doom
laserfiche
Apr 15, 2023, 12:29 PM
2
points
5
comments
1
min read
LW
link
AI as Contact with our Collective Unconscious
Scott Broock
Apr 15, 2023, 2:11 AM
−4
points
6
comments
4
min read
LW
link
The Truth About False
Thoth Hermes
Apr 15, 2023, 1:01 AM
−21
points
4
comments
17
min read
LW
link
(thothhermes.substack.com)
The ‘ petertodd’ phenomenon
mwatkins
Apr 15, 2023, 12:59 AM
192
points
50
comments
38
min read
LW
link
1
review
[Question]
Concave Utility Question
Scott Garrabrant
Apr 15, 2023, 12:14 AM
55
points
36
comments
2
min read
LW
link
List of requests for an AI slowdown/halt.
Cleo Nardo
Apr 14, 2023, 11:55 PM
46
points
6
comments
1
min read
LW
link
[linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23
Arjun Panickssery
Apr 14, 2023, 11:26 PM
26
points
16
comments
LW
link
“Do X because decision theory” ~= “Do X because bayes theorem”
lc
Apr 14, 2023, 8:57 PM
39
points
1
comment
2
min read
LW
link
LLMs and hallucination, like white on rice?
Bill Benzon
Apr 14, 2023, 7:53 PM
5
points
0
comments
3
min read
LW
link
GPT-4 is easily controlled/exploited with tricky decision theoretic dilemmas.
scasper
Apr 14, 2023, 7:39 PM
6
points
4
comments
2
min read
LW
link
On Caring about our AI Progeny
PeterMcCluskey
Apr 14, 2023, 7:32 PM
22
points
5
comments
1
min read
LW
link
(bayesianinvestor.com)
Moderation notes re: recent Said/Duncan threads
Raemon
Apr 14, 2023, 6:06 PM
50
points
560
comments
2
min read
LW
link
What we’ve learned so far from our technological temptations project
Richard Korzekwa
Apr 14, 2023, 5:46 PM
15
points
4
comments
11
min read
LW
link
(aiimpacts.org)
[Question]
How does consciousness interact with architecture?
FinalFormal2
Apr 14, 2023, 3:56 PM
5
points
3
comments
1
min read
LW
link
Iqisa: A Library For Handling Forecasting Datasets
niplav
Apr 14, 2023, 3:16 PM
27
points
0
comments
LW
link
What’s this probability you’re reporting?
EOC
and
SCP
Apr 14, 2023, 3:07 PM
19
points
10
comments
3
min read
LW
link
Navigating AI Risks (NAIR) #1: Slowing Down AI
simeon_c
Apr 14, 2023, 2:35 PM
11
points
3
comments
1
min read
LW
link
(navigatingairisks.substack.com)
[Question]
What would the FLI moratorium actually do?
ChristianKl
Apr 14, 2023, 1:14 PM
17
points
7
comments
1
min read
LW
link
Research Report: Incorrectness Cascades
Robert_AIZI
Apr 14, 2023, 12:49 PM
19
points
0
comments
10
min read
LW
link
(aizi.substack.com)
The self-unalignment problem
Jan_Kulveit
and
rosehadshar
Apr 14, 2023, 12:10 PM
155
points
24
comments
10
min read
LW
link
AI Safety Europe Retreat 2023 Retrospective
Magdalena Wache
Apr 14, 2023, 9:05 AM
43
points
0
comments
2
min read
LW
link
[Question]
What’s the difference between Wisdom and Rationality?
Yoav Ravid
Apr 14, 2023, 6:22 AM
8
points
4
comments
1
min read
LW
link
Shapley Value Attribution in Chain of Thought
leogao
Apr 14, 2023, 5:56 AM
106
points
7
comments
4
min read
LW
link
A freshman year during the AI midgame: my approach to the next year
Buck
Apr 14, 2023, 12:38 AM
154
points
15
comments
LW
link
1
review
Against AI Understanding and Sentience: Large Language Models, Meaning, and the Patterns of Human Language Use
Jonathan Yan
Apr 13, 2023, 11:29 PM
−1
points
0
comments
1
min read
LW
link
(philsci-archive.pitt.edu)
Financial Times: We must slow down the race to God-like AI
trevor
Apr 13, 2023, 7:55 PM
113
points
17
comments
16
min read
LW
link
(www.ft.com)
R0 Is Not Counterfactual
jefftk
Apr 13, 2023, 7:50 PM
33
points
9
comments
2
min read
LW
link
(www.jefftk.com)
Subscripts for Probabilities
niplav
Apr 13, 2023, 6:32 PM
67
points
9
comments
5
min read
LW
link
The Virus—Short Story
Michael Soareverix
Apr 13, 2023, 6:18 PM
4
points
0
comments
4
min read
LW
link
First ACX Brno Meetup
adekcz
Apr 13, 2023, 5:42 PM
2
points
0
comments
1
min read
LW
link
Polluting the agentic commons
hamandcheese
Apr 13, 2023, 5:42 PM
7
points
4
comments
2
min read
LW
link
(www.secondbest.ca)
Cambridge LW Meetup: When Science Isn’t Enough
Tony Wang
and
Darmani
Apr 13, 2023, 5:36 PM
2
points
0
comments
1
min read
LW
link
Even if human & AI alignment are just as easy, we are screwed
Matthew_Opitz
Apr 13, 2023, 5:32 PM
35
points
5
comments
5
min read
LW
link
Intro to Ontogenetic Curriculum
Eris
Apr 13, 2023, 5:15 PM
20
points
1
comment
2
min read
LW
link
Was Homer a stochastic parrot? Meaning in literary texts and LLMs
Bill Benzon
Apr 13, 2023, 4:44 PM
7
points
4
comments
3
min read
LW
link
AI #7: Free Agency
Zvi
Apr 13, 2023, 4:20 PM
33
points
12
comments
47
min read
LW
link
(thezvi.wordpress.com)
Navigating the Open-Source AI Landscape: Data, Funding, and Safety
André Ferretti
and
mic
Apr 13, 2023, 3:29 PM
32
points
7
comments
11
min read
LW
link
(forum.effectivealtruism.org)
On AutoGPT
Zvi
Apr 13, 2023, 12:30 PM
248
points
47
comments
20
min read
LW
link
(thezvi.wordpress.com)
Identifying semantic neurons, mechanistic circuits & interpretability web apps
Esben Kran
and
Neel Nanda
Apr 13, 2023, 11:59 AM
18
points
0
comments
8
min read
LW
link
Trying AgentGPT, an AutoGPT variant
Gunnar_Zarncke
13 Apr 2023 10:13 UTC
10
points
9
comments
1
min read
LW
link
Announcing Epoch’s dashboard of key trends and figures in Machine Learning
Jsevillamol
13 Apr 2023 7:33 UTC
35
points
7
comments
1
min read
LW
link
(epochai.org)
[Question]
What is the best source to explain short AI timelines to a skeptical person?
trevor
13 Apr 2023 4:29 UTC
12
points
12
comments
1
min read
LW
link
“Aligned” foundation models don’t imply aligned systems
Max H
13 Apr 2023 4:13 UTC
39
points
11
comments
5
min read
LW
link
[Question]
Using ChatGPT for memory reconsolidation?
warrenjordan
13 Apr 2023 1:27 UTC
3
points
2
comments
1
min read
LW
link
Independence Dividends
jefftk
13 Apr 2023 1:20 UTC
35
points
11
comments
1
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel