Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Believe in Yourself and don’t stop Improving
Johannes C. Mayer
Apr 25, 2023, 10:34 PM
0
points
0
comments
1
min read
LW
link
Should LW have an official list of norms?
Ruby
Apr 25, 2023, 9:20 PM
58
points
31
comments
5
min read
LW
link
Implementing a Transformer from scratch in PyTorch—a write-up on my experience
Mislav Jurić
Apr 25, 2023, 8:51 PM
20
points
0
comments
10
min read
LW
link
Exploring the Lottery Ticket Hypothesis
Rauno Arike
Apr 25, 2023, 8:06 PM
58
points
3
comments
11
min read
LW
link
Genetic Sequencing of Wastewater: Prevalence to Relative Abundance
jefftk
Apr 25, 2023, 7:30 PM
17
points
2
comments
2
min read
LW
link
(www.jefftk.com)
[Feedback please] New User’s Guide to LessWrong
Ruby
Apr 25, 2023, 6:54 PM
38
points
18
comments
6
min read
LW
link
Reframing the burden of proof: Companies should prove that models are safe (rather than expecting auditors to prove that models are dangerous)
Orpheus16
Apr 25, 2023, 6:49 PM
27
points
11
comments
3
min read
LW
link
(childrenoficarus.substack.com)
LLMs for online discussion moderation
Dave Lindbergh
Apr 25, 2023, 4:53 PM
12
points
3
comments
3
min read
LW
link
AI Safety Newsletter #3: AI policy proposals and a new challenger approaches
ozhang
Apr 25, 2023, 4:15 PM
33
points
0
comments
LW
link
EA might systematically generate a scarcity mindset that produces low-integrity actors
Severin T. Seehrich
Apr 25, 2023, 3:50 PM
26
points
2
comments
LW
link
Max Tegmark’s new Time article on how we’re in a Don’t Look Up scenario [Linkpost]
Jonas Hallgren
Apr 25, 2023, 3:41 PM
39
points
9
comments
1
min read
LW
link
(time.com)
WHO Biological Risk warning
Jonas Kgomo
Apr 25, 2023, 3:10 PM
−6
points
2
comments
1
min read
LW
link
A Rant on Calculus III
Wofsen
Apr 25, 2023, 2:51 PM
−5
points
2
comments
1
min read
LW
link
Briefly how I’ve updated since ChatGPT
rime
Apr 25, 2023, 2:47 PM
48
points
2
comments
2
min read
LW
link
Discuss AI Policy Recommendations
Giles
Apr 25, 2023, 2:21 PM
8
points
0
comments
1
min read
LW
link
Explaining the Transformer Circuits Framework by Example
Felix Hofstätter
Apr 25, 2023, 1:45 PM
8
points
0
comments
15
min read
LW
link
Notes on Potential Future AI Tax Policy
Zvi
Apr 25, 2023, 1:30 PM
33
points
6
comments
9
min read
LW
link
(thezvi.wordpress.com)
Sentience in Silicon: The Challenges of AI Consciousness
Hannes Thurnherr
Apr 25, 2023, 1:15 PM
5
points
2
comments
5
min read
LW
link
Paths to failure
Karl von Wendt
and
mespa
Apr 25, 2023, 8:03 AM
29
points
1
comment
8
min read
LW
link
My Assessment of the Chinese AI Safety Community
Lao Mein
Apr 25, 2023, 4:21 AM
252
points
94
comments
3
min read
LW
link
Making Nanobots isn’t a one-shot process, even for an artificial superintelligance
dankrad
Apr 25, 2023, 12:39 AM
20
points
13
comments
6
min read
LW
link
Mental Models Of People Can Be People
Nox ML
Apr 25, 2023, 12:03 AM
14
points
55
comments
8
min read
LW
link
Progress links and tweets, 2023-04-24
jasoncrawford
Apr 24, 2023, 9:17 PM
16
points
1
comment
2
min read
LW
link
(rootsofprogress.org)
Ideas for AI labs: Reading list
Zach Stein-Perlman
Apr 24, 2023, 7:00 PM
11
points
0
comments
4
min read
LW
link
Deep learning models might be secretly (almost) linear
beren
Apr 24, 2023, 6:43 PM
117
points
29
comments
4
min read
LW
link
Subjective AI/ML Digest: April II
Boris T
Apr 24, 2023, 6:33 PM
1
point
0
comments
1
min read
LW
link
(borisagain.substack.com)
The Toxoplasma of AGI Doom and Capabilities?
Robert_AIZI
Apr 24, 2023, 6:11 PM
72
points
12
comments
1
min read
LW
link
[Question]
Measures of Internet Virality and News Popularity
T431
Apr 24, 2023, 5:43 PM
4
points
4
comments
1
min read
LW
link
A concise sum-up of the basic argument for AI doom
Mergimio H. Doefevmil
Apr 24, 2023, 5:37 PM
11
points
6
comments
2
min read
LW
link
A response to Conjecture’s CoEm proposal
Kristian Freed
Apr 24, 2023, 5:23 PM
7
points
0
comments
4
min read
LW
link
Camaraderie at scale: in search of shared identity
eq
Apr 24, 2023, 4:46 PM
8
points
2
comments
8
min read
LW
link
A Hypothetical Takeover Scenario Twitter Poll
Zvi
Apr 24, 2023, 2:00 PM
54
points
9
comments
17
min read
LW
link
(thezvi.wordpress.com)
Cape Town, South Africa—ACX Meetups Everywhere “Spring” 2023
moyamo
Apr 24, 2023, 1:37 PM
2
points
0
comments
1
min read
LW
link
Credible, costly, pseudonymity
M. Y. Zuo
Apr 24, 2023, 1:35 PM
1
point
11
comments
1
min read
LW
link
On Artifice and Intelligence
Jonathan Yan
Apr 24, 2023, 1:26 PM
2
points
0
comments
1
min read
LW
link
(medium.com)
AGI ruin mostly rests on strong claims about alignment and deployment, not about society
Rob Bensinger
Apr 24, 2023, 1:06 PM
70
points
8
comments
6
min read
LW
link
For alignment, we should simultaneously use multiple theories of cognition and value
Roman Leventov
Apr 24, 2023, 10:37 AM
23
points
5
comments
5
min read
LW
link
Power laws in Speedrunning and Machine Learning
Jsevillamol
and
Ege Erdil
Apr 24, 2023, 10:06 AM
71
points
1
comment
1
min read
LW
link
(arxiv.org)
[Question]
“User does not meet the requirements to vote”
Monkle
Apr 24, 2023, 9:53 AM
4
points
3
comments
1
min read
LW
link
The Brain is Not Close to Thermodynamic Limits on Computation
DaemonicSigil
Apr 24, 2023, 8:21 AM
167
points
58
comments
5
min read
LW
link
Value Learning – Towards Resolving Confusion
PashaKamyshev
Apr 24, 2023, 6:43 AM
4
points
0
comments
18
min read
LW
link
Summaries of top forum posts (17th − 23rd April 2023)
Zoe Williams
Apr 24, 2023, 4:13 AM
18
points
0
comments
LW
link
Do LLMs dream of emergent sheep?
Shmi
Apr 24, 2023, 3:26 AM
16
points
2
comments
1
min read
LW
link
Not using a priori information for Russian propaganda
EniScien
Apr 24, 2023, 1:14 AM
−5
points
4
comments
1
min read
LW
link
Contra Yudkowsky on AI Doom
jacob_cannell
24 Apr 2023 0:20 UTC
89
points
111
comments
9
min read
LW
link
Consequentialism is in the Stars not Ourselves
DragonGod
24 Apr 2023 0:02 UTC
7
points
19
comments
5
min read
LW
link
When did humans become self-aware?
Derek M. Jones
23 Apr 2023 22:36 UTC
6
points
2
comments
1
min read
LW
link
(vectors.substack.com)
[Question]
Are there AI policies that are robustly net-positive even when considering different AI scenarios?
Noosphere89
23 Apr 2023 21:46 UTC
11
points
1
comment
1
min read
LW
link
Getting Started With Naturalism
LoganStrohl
23 Apr 2023 21:02 UTC
69
points
4
comments
11
min read
LW
link
1
review
[Question]
Why do we care about agency for alignment?
Chris_Leong
23 Apr 2023 18:10 UTC
22
points
19
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel