Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Page
1
[Question]
What are the known difficulties with this alignment approach?
tailcalled
Feb 11, 2024, 10:52 PM
18
points
24
comments
1
min read
LW
link
[Question]
What are the deciding factors of human cognitive endurance?
koratkar
Feb 11, 2024, 9:56 PM
22
points
3
comments
1
min read
LW
link
Carl Shulman On Dwarkesh Podcast June 2023
Moonicker
Feb 11, 2024, 9:02 PM
18
points
0
comments
159
min read
LW
link
How do you actually obtain and report a likelihood function for scientific research?
Peter Berggren
Feb 11, 2024, 5:42 PM
55
points
4
comments
1
min read
LW
link
The entropy maxim for binary questions
dkl9
Feb 11, 2024, 5:17 PM
2
points
1
comment
1
min read
LW
link
(dkl9.net)
GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks
MiguelDev
Feb 11, 2024, 11:03 AM
16
points
4
comments
14
min read
LW
link
[Question]
What’s the theory of impact for activation vectors?
Chris_Leong
Feb 11, 2024, 7:34 AM
61
points
12
comments
1
min read
LW
link
Experimenting With Footboard Piezos
jefftk
Feb 11, 2024, 3:00 AM
11
points
2
comments
2
min read
LW
link
(www.jefftk.com)
The Core Values of Life—A proposal for a universal theory of ethics
Thomas Gjøstøl
Feb 10, 2024, 9:48 PM
2
points
4
comments
18
min read
LW
link
And All the Shoggoths Merely Players
Zack_M_Davis
Feb 10, 2024, 7:56 PM
170
points
57
comments
12
min read
LW
link
Sam Altman’s Chip Ambitions Undercut OpenAI’s Safety Strategy
garrison
Feb 10, 2024, 7:52 PM
198
points
52
comments
LW
link
(garrisonlovely.substack.com)
The lattice of partial updatelessness
Martín Soto
Feb 10, 2024, 5:34 PM
23
points
5
comments
5
min read
LW
link
A Strange ACH Corner Case
jefftk
Feb 10, 2024, 3:00 AM
27
points
2
comments
2
min read
LW
link
(www.jefftk.com)
Dreams of AI alignment: The danger of suggestive names
TurnTrout
Feb 10, 2024, 1:22 AM
103
points
59
comments
4
min read
LW
link
Scenario planning for AI x-risk
Corin Katzke
Feb 10, 2024, 12:14 AM
24
points
12
comments
14
min read
LW
link
(forum.effectivealtruism.org)
Close the Gates to an Inhuman Future: How and why we should choose to not develop superhuman general-purpose artificial intelligence
aaguirre
Feb 9, 2024, 8:25 PM
13
points
0
comments
1
min read
LW
link
(arxiv.org)
[Crosspost] Deep Dive: The Coming Technological Singularity—How to survive in a Post-human Era
simulacra.exe
Feb 9, 2024, 6:49 PM
2
points
2
comments
9
min read
LW
link
The Ideal Speech Situation as a Tool for AI Ethical Reflection: A Framework for Alignment
kenneth myers
Feb 9, 2024, 6:40 PM
6
points
12
comments
3
min read
LW
link
What’s ChatGPT’s Favorite Ice Cream Flavor? An Investigation Into Synthetic Respondents
Greg Robison
Feb 9, 2024, 6:38 PM
19
points
4
comments
15
min read
LW
link
OpenAI wants to raise 5-7 trillion
O O
Feb 9, 2024, 4:15 PM
13
points
29
comments
1
min read
LW
link
(decrypt.co)
[Question]
Constituency-sized AI congress?
Nathan Helm-Burger
Feb 9, 2024, 4:01 PM
11
points
5
comments
1
min read
LW
link
One True Love
Zvi
Feb 9, 2024, 3:10 PM
34
points
7
comments
10
min read
LW
link
(thezvi.wordpress.com)
[Question]
Executive function advice from people who are good at it?
TeaTieAndHat
Feb 9, 2024, 10:11 AM
7
points
1
comment
1
min read
LW
link
[Question]
Do you want to make an AI Alignment song?
Kabir Kumar
Feb 9, 2024, 8:22 AM
4
points
0
comments
1
min read
LW
link
Skills I’d like my collaborators to have
Raemon
Feb 9, 2024, 8:20 AM
106
points
9
comments
8
min read
LW
link
Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)
RP
and
agg
Feb 9, 2024, 7:00 AM
50
points
6
comments
3
min read
LW
link
Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety
Ben Smith
Feb 9, 2024, 6:40 AM
22
points
0
comments
LW
link
(www.nist.gov)
Running the Numbers on a Heat Pump
jefftk
Feb 9, 2024, 3:00 AM
30
points
12
comments
4
min read
LW
link
(www.jefftk.com)
[Question]
How do high-trust societies form?
Shankar Sivarajan
Feb 9, 2024, 1:11 AM
23
points
17
comments
1
min read
LW
link
[Question]
How do health systems work in adequate worlds?
mukashi
Feb 9, 2024, 12:54 AM
10
points
2
comments
1
min read
LW
link
Twin Cities ACX Meetup—February 2024
Timothy M.
Feb 8, 2024, 11:26 PM
1
point
2
comments
1
min read
LW
link
A review of “Don’t forget the boundary problem...”
jessicata
Feb 8, 2024, 11:19 PM
12
points
1
comment
12
min read
LW
link
(unstablerontology.substack.com)
aintelope project update
Gunnar_Zarncke
Feb 8, 2024, 6:32 PM
24
points
2
comments
3
min read
LW
link
Updatelessness doesn’t solve most problems
Martín Soto
Feb 8, 2024, 5:30 PM
135
points
45
comments
12
min read
LW
link
Predicting Alignment Award Winners Using ChatGPT 4
Shoshannah Tekofsky
Feb 8, 2024, 2:38 PM
16
points
2
comments
11
min read
LW
link
AI #50: The Most Dangerous Thing
Zvi
Feb 8, 2024, 2:30 PM
53
points
4
comments
24
min read
LW
link
(thezvi.wordpress.com)
How to develop a photographic memory 3/3
PhilosophicalSoul
Feb 8, 2024, 9:22 AM
6
points
2
comments
18
min read
LW
link
Believing In
AnnaSalamon
Feb 8, 2024, 7:06 AM
241
points
51
comments
13
min read
LW
link
Measuring pre-peer-review epistemic status
Jakub Smékal
Feb 8, 2024, 5:09 AM
1
point
0
comments
2
min read
LW
link
A Chess-GPT Linear Emergent World Representation
Adam Karvonen
Feb 8, 2024, 4:25 AM
105
points
14
comments
7
min read
LW
link
(adamkarvonen.github.io)
Domestic Production vs International Wealth Creation
100YearPants
Feb 8, 2024, 4:25 AM
1
point
0
comments
1
min read
LW
link
Conditional prediction markets are evidential, not causal
philh
Feb 7, 2024, 9:52 PM
55
points
10
comments
2
min read
LW
link
A Back-Of-The-Envelope Calculation On How Unlikely The Circumstantial Evidence Around Covid-19 Is
Roko
Feb 7, 2024, 9:49 PM
−1
points
36
comments
5
min read
LW
link
Nitric oxide for covid and other viral infections
Elizabeth
Feb 7, 2024, 9:30 PM
39
points
6
comments
6
min read
LW
link
(acesounderglass.com)
Debating with More Persuasive LLMs Leads to More Truthful Answers
Akbir Khan
,
John Hughes
,
Dan Valentine
,
Sam Bowman
and
Ethan Perez
Feb 7, 2024, 9:28 PM
89
points
14
comments
9
min read
LW
link
(arxiv.org)
[Question]
Choosing a book on causality
martinkunev
Feb 7, 2024, 9:16 PM
4
points
3
comments
1
min read
LW
link
More Hyphenation
Arjun Panickssery
Feb 7, 2024, 7:43 PM
88
points
19
comments
1
min read
LW
link
(arjunpanickssery.substack.com)
Reading writing advice doesn’t make writing easier
Henry Sleight
Feb 7, 2024, 7:14 PM
17
points
0
comments
5
min read
LW
link
(open.substack.com)
[Question]
What’s this 3rd secret directive of evolution called? (survive & spread & ___)
lemonhope
Feb 7, 2024, 2:11 PM
10
points
11
comments
1
min read
LW
link
Training of superintelligence is secretly adversarial
quetzal_rainbow
Feb 7, 2024, 1:38 PM
15
points
2
comments
5
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel