Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
AI Safety at the Frontier: Paper Highlights, August ’24
gasteigerjo
Sep 3, 2024, 7:17 PM
28
points
0
comments
6
min read
LW
link
(aisafetyfrontier.substack.com)
The Checklist: What Succeeding at AI Safety Will Involve
Sam Bowman
Sep 3, 2024, 6:18 PM
151
points
49
comments
22
min read
LW
link
(sleepinyourhat.github.io)
Democracy beyond majoritarianism
Arturo Macias
Sep 3, 2024, 3:10 PM
5
points
2
comments
4
min read
LW
link
On the UBI Paper
Zvi
Sep 3, 2024, 2:50 PM
60
points
6
comments
19
min read
LW
link
(thezvi.wordpress.com)
An Opinionated Look at Inference Rules
Gianluca Calcagni
Sep 3, 2024, 1:32 PM
−5
points
2
comments
13
min read
LW
link
Announcing the PIBBSS Symposium ’24!
DusanDNesic
and
clem_acs
Sep 3, 2024, 11:19 AM
19
points
0
comments
3
min read
LW
link
Reducing global AI competition through the Commerce Control List and Immigration reform: a dual-pronged approach
Ben Smith
Sep 3, 2024, 5:28 AM
16
points
2
comments
LW
link
How I got 4.2M YouTube views without making a single video
Closed Limelike Curves
Sep 3, 2024, 3:52 AM
395
points
36
comments
1
min read
LW
link
Duped: AI and the Making of a Global Suicide Cult
izzyness
Sep 2, 2024, 6:51 PM
−8
points
0
comments
1
min read
LW
link
A gentle introduction to sparse autoencoders
Nick Jiang
Sep 2, 2024, 6:11 PM
16
points
1
comment
6
min read
LW
link
What makes math problems hard for reinforcement learning: a case study
Anibal, Bartek, Sergei, Shehper and Piotr
Sep 2, 2024, 6:11 PM
1
point
0
comments
2
min read
LW
link
(arxiv.org)
Survey: How Do Elite Chinese Students Feel About the Risks of AI?
Nick Corvino
Sep 2, 2024, 6:11 PM
141
points
13
comments
10
min read
LW
link
Data-driven donations to help Democrats win federal elections: an update
Michael Cohn
Sep 2, 2024, 4:32 PM
−1
points
2
comments
1
min read
LW
link
(perplexedguide.net)
[Question]
What are the effective utilitarian pros and cons of having children (in rich countries)?
SpectrumDT
Sep 2, 2024, 10:01 AM
2
points
4
comments
1
min read
LW
link
My decomposition of the alignment problem
Daniel C
Sep 2, 2024, 12:21 AM
22
points
22
comments
13
min read
LW
link
DC Forecasting & Prediction Markets Meetup
David Glidden
Sep 2, 2024, 12:00 AM
1
point
0
comments
1
min read
LW
link
A primer on the next generation of antibodies
Abhishaike Mahajan
Sep 1, 2024, 10:37 PM
25
points
0
comments
19
min read
LW
link
(www.owlposting.com)
[Question]
Who looked into extreme nuclear meltdowns?
Remmelt
Sep 1, 2024, 9:38 PM
2
points
8
comments
LW
link
Redundant Attention Heads in Large Language Models For In Context Learning
skunnavakkam
Sep 1, 2024, 8:08 PM
7
points
2
comments
4
min read
LW
link
(skunnavakkam.github.io)
The Role of Transparency and Explainability in Responsible NLP
RAMEBC78
Sep 1, 2024, 8:08 PM
−3
points
1
comment
5
min read
LW
link
Book Review: What Even Is Gender?
Joey Marcellino
Sep 1, 2024, 4:09 PM
31
points
14
comments
12
min read
LW
link
Can a Bayesian Oracle Prevent Harm from an Agent? (Bengio et al. 2024)
mattmacdermott
Sep 1, 2024, 7:46 AM
26
points
0
comments
5
min read
LW
link
(yoshuabengio.org)
Forecasting One-Shot Games
Raemon
Aug 31, 2024, 11:10 PM
48
points
0
comments
7
min read
LW
link
On epistemic autonomy
sanyer
Aug 31, 2024, 6:50 PM
11
points
0
comments
2
min read
LW
link
Epistemic states as a potential benign prior
Tamsin Leake
Aug 31, 2024, 6:26 PM
31
points
2
comments
8
min read
LW
link
(carado.moe)
My Model of Epistemology
adamShimi
Aug 31, 2024, 5:01 PM
37
points
1
comment
8
min read
LW
link
(epistemologicalfascinations.substack.com)
Verification methods for international AI agreements
Orpheus16
Aug 31, 2024, 2:58 PM
14
points
1
comment
4
min read
LW
link
(arxiv.org)
Fake Blog Posts as a Problem Solving Device
silentbob
Aug 31, 2024, 9:22 AM
7
points
0
comments
2
min read
LW
link
Actually Rational & Kind Sequences Reading Group
segfault
Aug 31, 2024, 4:21 AM
−66
points
1
comment
1
min read
LW
link
Anthropic is being sued for copying books to train Claude
Remmelt
Aug 31, 2024, 2:57 AM
20
points
4
comments
2
min read
LW
link
(fingfx.thomsonreuters.com)
Book review: On the Edge
PeterMcCluskey
Aug 30, 2024, 10:18 PM
39
points
0
comments
9
min read
LW
link
(bayesianinvestor.com)
Can Large Language Models effectively identify cybersecurity risks?
emile delcourt
Aug 30, 2024, 8:20 PM
18
points
0
comments
11
min read
LW
link
Singular learning theory: exercises
Zach Furman
Aug 30, 2024, 8:00 PM
90
points
5
comments
14
min read
LW
link
AI for Bio: State Of The Field
sarahconstantin
Aug 30, 2024, 6:00 PM
73
points
2
comments
15
min read
LW
link
(sarahconstantin.substack.com)
Multi-Tiered AI
Timothy Bruneau
Aug 30, 2024, 5:46 PM
1
point
0
comments
2
min read
LW
link
I universally trying to reject the Mind Projection Fallacy—consequences
YanLyutnev
Aug 30, 2024, 5:42 PM
−3
points
0
comments
9
min read
LW
link
AIS terminology proposal: standardize terms for probability ranges
eggsyntax
Aug 30, 2024, 3:43 PM
30
points
12
comments
2
min read
LW
link
[Question]
Does a time-reversible physical law/Cellular Automaton always imply the First Law of Thermodynamics?
Noosphere89
Aug 30, 2024, 3:12 PM
7
points
11
comments
1
min read
LW
link
Principles for the AGI Race
William_S
Aug 30, 2024, 2:29 PM
248
points
17
comments
18
min read
LW
link
Congressional Insider Trading
Maxwell Tabarrok
Aug 30, 2024, 1:32 PM
57
points
6
comments
7
min read
LW
link
(www.maximum-progress.com)
[Question]
Thoughts on paper “How Organisms Come to Know the World: Fundamental Limits on Artificial General Intelligence”?
mikbp
Aug 30, 2024, 9:04 AM
2
points
3
comments
1
min read
LW
link
Are LLMs on the Path to AGI?
Davidmanheim
Aug 30, 2024, 3:14 AM
14
points
2
comments
5
min read
LW
link
Nursing doubts
dynomight
30 Aug 2024 2:25 UTC
144
points
23
comments
9
min read
LW
link
(dynomight.net)
Free Will and Dodging Anvils: AIXI Off-Policy
Cole Wyeth
29 Aug 2024 22:42 UTC
39
points
12
comments
9
min read
LW
link
Seattle USA—ACX Meetups Everywhere Fall 2024
a7x
29 Aug 2024 21:42 UTC
2
points
0
comments
1
min read
LW
link
Rancho Cucamonga USA—ACX Meetups Everywhere Fall 2024
Nelson James Horsley
29 Aug 2024 19:18 UTC
1
point
0
comments
1
min read
LW
link
Reno USA—ACX Meetups Everywhere Fall 2024
Daniel Gold
29 Aug 2024 19:18 UTC
1
point
0
comments
1
min read
LW
link
Tamarindo Costa Rica—ACX Meetups Everywhere Fall 2024
timeless
29 Aug 2024 18:44 UTC
1
point
0
comments
1
min read
LW
link
Santiago Chile—ACX Meetups Everywhere Fall 2024
Iñaki
29 Aug 2024 18:44 UTC
1
point
0
comments
1
min read
LW
link
Florianópolis Brazil—ACX Meetups Everywhere Fall 2024
Adiel
29 Aug 2024 18:44 UTC
1
point
0
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel