Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Briefly Extending Differential Optimization to Distributions
J Bostock
Mar 10, 2024, 8:41 PM
4
points
0
comments
2
min read
LW
link
Evolution did a surprising good job at aligning humans...to social status
Eli Tyre
Mar 10, 2024, 7:34 PM
24
points
37
comments
1
min read
LW
link
Pausing AI is Positive Expected Value
Liron
Mar 10, 2024, 5:10 PM
9
points
2
comments
3
min read
LW
link
(twitter.com)
W2SG: Introduction
Maria Kapros
Mar 10, 2024, 4:25 PM
2
points
2
comments
10
min read
LW
link
An Optimistic Solution to the Fermi Paradox
Glenn Clayton
Mar 10, 2024, 2:39 PM
4
points
6
comments
13
min read
LW
link
Counterfactual Civilization Simulation Version −1.0 aka my application to Johannes Mayer’s SPAR project
Morphism
Mar 10, 2024, 10:10 AM
1
point
0
comments
14
min read
LW
link
Notes from a Prompt Factory
Richard_Ngo
Mar 10, 2024, 5:13 AM
104
points
19
comments
9
min read
LW
link
(www.narrativeark.xyz)
Investigating Basin Volume with XOR Networks
CatGoddess
Mar 10, 2024, 1:35 AM
10
points
0
comments
5
min read
LW
link
[Linkpost] MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Bogdan Ionut Cirstea
Mar 10, 2024, 1:30 AM
10
points
0
comments
1
min read
LW
link
(openreview.net)
0th Person and 1st Person Logic
Adele Lopez
Mar 10, 2024, 12:56 AM
60
points
28
comments
6
min read
LW
link
Completion Estimates
scarcegreengrass
Mar 9, 2024, 10:56 PM
7
points
2
comments
3
min read
LW
link
Semi-Simplicial Types, Part I: Motivation and History
astradiol
Mar 9, 2024, 10:07 PM
20
points
3
comments
10
min read
LW
link
Distinctions when Discussing Utility Functions
ozziegooen
Mar 9, 2024, 8:14 PM
24
points
7
comments
LW
link
What is progress?
jasoncrawford
Mar 9, 2024, 4:28 PM
10
points
4
comments
6
min read
LW
link
(rootsofprogress.org)
Fifteen Lawsuits against OpenAI
Remmelt
Mar 9, 2024, 12:22 PM
27
points
4
comments
1
min read
LW
link
Cambridge ACX/SSC monthly meetup (location changed to Fort St George!)
hamishtodd1
Mar 9, 2024, 11:10 AM
2
points
0
comments
1
min read
LW
link
MA E-ZPass Without a Car?
jefftk
Mar 9, 2024, 2:40 AM
15
points
2
comments
1
min read
LW
link
(www.jefftk.com)
Closeness To the Issue (Part 5 of “The Sense Of Physical Necessity”)
LoganStrohl
Mar 9, 2024, 12:36 AM
36
points
0
comments
15
min read
LW
link
Exploring the Evolution and Migration of Different Layer Embedding in LLMs
Ruixuan Huang
Mar 8, 2024, 3:01 PM
6
points
0
comments
8
min read
LW
link
[Question]
When and why did ‘training’ become ‘pretraining’?
beren
Mar 8, 2024, 2:29 PM
16
points
6
comments
1
min read
LW
link
A T-o-M test: ‘popcorn’ or ‘chocolate’
MiguelDev
Mar 8, 2024, 4:24 AM
20
points
13
comments
1
min read
LW
link
Scenario Forecasting Workshop: Materials and Learnings
elifland
and
charlie_griffin
Mar 8, 2024, 2:30 AM
50
points
3
comments
2
min read
LW
link
Forecasting future gains due to post-training enhancements
elifland
,
Joel Becker
and
simeon_c
Mar 8, 2024, 2:11 AM
31
points
2
comments
1
min read
LW
link
(docs.google.com)
Do LLMs sometime simulate something akin to a dream?
Nezek
Mar 8, 2024, 1:25 AM
8
points
4
comments
1
min read
LW
link
Community norms poll (2 mins)
Nathan Young
Mar 7, 2024, 9:45 PM
11
points
1
comment
1
min read
LW
link
Announcing Convergence Analysis: An Institute for AI Scenario & Governance Research
David_Kristoffersson
and
Deric Cheng
Mar 7, 2024, 9:37 PM
23
points
1
comment
4
min read
LW
link
Woods’ new preprint on object permanence
Steven Byrnes
Mar 7, 2024, 9:29 PM
58
points
1
comment
6
min read
LW
link
MATS AI Safety Strategy Curriculum
Ronny Fernandez
and
Ryan Kidd
Mar 7, 2024, 7:59 PM
74
points
2
comments
16
min read
LW
link
Political Biases in LLMs: Literature Review & Current Uses of AI in Elections
Yashvardhan Sharma
,
Robayet Hossain
and
Ariana Gamarra
Mar 7, 2024, 7:17 PM
6
points
0
comments
6
min read
LW
link
Evidential Correlations are Subjective, and it might be a problem
Martín Soto
Mar 7, 2024, 6:37 PM
26
points
6
comments
14
min read
LW
link
AI Safety 101 : Capabilities—Human Level AI, What? How? and When?
markov
and
Charbel-Raphaël
Mar 7, 2024, 5:29 PM
46
points
8
comments
54
min read
LW
link
A Review of Weak to Strong Generalization [AI Safety Camp]
sevdeawesome
Mar 7, 2024, 5:16 PM
14
points
0
comments
9
min read
LW
link
AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets
Corin Katzke
and
Dan H
Mar 7, 2024, 4:39 PM
8
points
0
comments
8
min read
LW
link
(newsletter.safe.ai)
AI #54: Clauding Along
Zvi
Mar 7, 2024, 4:00 PM
45
points
11
comments
51
min read
LW
link
(thezvi.wordpress.com)
Being Interested in Other People
Jonathan Moregård
Mar 7, 2024, 10:13 AM
14
points
1
comment
3
min read
LW
link
(youbutbetter.substack.com)
Talking to Congress: Can constituents contacting their legislator influence policy?
Tristan Williams
Mar 7, 2024, 9:24 AM
14
points
0
comments
LW
link
Explaining the AI Alignment Problem to Tibetan Buddhist Monks
Paul Colognese
Mar 7, 2024, 9:00 AM
20
points
3
comments
6
min read
LW
link
What if Alignment is Not Enough?
WillPetillo
Mar 7, 2024, 8:10 AM
15
points
46
comments
9
min read
LW
link
Sparks of AGI prompts on GPT2XL and its variant, RLLMv3
MiguelDev
Mar 7, 2024, 6:33 AM
4
points
0
comments
4
min read
LW
link
An AI, a box, and a threat
jwfiredragon
Mar 7, 2024, 6:15 AM
9
points
0
comments
6
min read
LW
link
Mud and Despair (Part 4 of “The Sense Of Physical Necessity”)
LoganStrohl
Mar 7, 2024, 12:14 AM
38
points
0
comments
2
min read
LW
link
introduction to thermal conductivity and noise management
bhauth
Mar 6, 2024, 11:14 PM
31
points
1
comment
4
min read
LW
link
(www.bhauth.com)
Essaying Other Plans
Screwtape
6 Mar 2024 22:59 UTC
29
points
4
comments
7
min read
LW
link
Invest in ACX Grants projects!
Saul Munn
6 Mar 2024 20:27 UTC
23
points
1
comment
LW
link
Vote on Anthropic Topics to Discuss
Ben Pace
6 Mar 2024 19:43 UTC
75
points
55
comments
1
min read
LW
link
Simple Kelly betting in prediction markets
jessicata
6 Mar 2024 18:59 UTC
38
points
3
comments
3
min read
LW
link
(unstablerontology.substack.com)
On Claude 3.0
Zvi
6 Mar 2024 18:50 UTC
76
points
5
comments
31
min read
LW
link
(thezvi.wordpress.com)
[Question]
Why correlation, though?
numpyNaN
6 Mar 2024 16:53 UTC
22
points
7
comments
1
min read
LW
link
Using axis lines for good or evil
dynomight
6 Mar 2024 14:47 UTC
151
points
39
comments
4
min read
LW
link
(dynomight.net)
Let’s build definitely-not-conscious AI
lemonhope
6 Mar 2024 7:50 UTC
4
points
18
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel