Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
1
Video essay: How Will We Know When AI is Conscious?
JanPro
Sep 6, 2023, 6:10 PM
11
points
7
comments
1
min read
LW
link
(www.youtube.com)
My First Post
Jaivardhan Nawani
Sep 6, 2023, 5:42 PM
35
points
9
comments
1
min read
LW
link
ActAdd: Steering Language Models without Optimization
technicalities
,
TurnTrout
,
lisathiergart
,
David Udell
,
Ulisse Mini
and
Monte M
Sep 6, 2023, 5:21 PM
105
points
3
comments
2
min read
LW
link
(arxiv.org)
Monthly Roundup #10: September 2023
Zvi
Sep 6, 2023, 1:20 PM
35
points
4
comments
56
min read
LW
link
(thezvi.wordpress.com)
Find Hot French Food Near Me: A Follow-up
aphyer
Sep 6, 2023, 12:32 PM
75
points
19
comments
2
min read
LW
link
Manifest 2023
Saul Munn
and
Austin Chen
Sep 6, 2023, 11:24 AM
3
points
0
comments
1
min read
LW
link
Last Chance: Get tickets to Manifest 2023! (Sep 22-24 in Berkeley)
Saul Munn
and
Austin Chen
Sep 6, 2023, 10:35 AM
5
points
0
comments
1
min read
LW
link
What I’ve been reading, September 2023
jasoncrawford
Sep 6, 2023, 9:32 AM
17
points
0
comments
5
min read
LW
link
(rootsofprogress.org)
Decision Theory: A (Normative) Introduction
Pareto Optimal
Sep 6, 2023, 8:22 AM
−1
points
1
comment
3
min read
LW
link
(paretooptimal.substack.com)
[Question]
What’s the easiest way to make a luminator?
kuira
Sep 6, 2023, 12:07 AM
7
points
13
comments
1
min read
LW
link
Ordinary claims require ordinary evidence
blake8086
Sep 5, 2023, 10:09 PM
1
point
3
comments
2
min read
LW
link
Conversation about paradigms, intellectual progress, social consensus, and AI
Ruby
and
RobertM
Sep 5, 2023, 9:30 PM
14
points
6
comments
1
min read
LW
link
What I would do if I wasn’t at ARC Evals
LawrenceC
Sep 5, 2023, 7:19 PM
220
points
10
comments
13
min read
LW
link
1
review
The Evolutionary Pathway from Biological to Digital Intelligence: A Cosmic Perspective
George360
Sep 5, 2023, 5:47 PM
−17
points
0
comments
4
min read
LW
link
The Illusion of Universal Morality: A Dynamic Perspective on Genetic Fitness and Ethical Complexity
George360
Sep 5, 2023, 5:47 PM
−9
points
7
comments
2
min read
LW
link
Benchmarks for Detecting Measurement Tampering [Redwood Research]
ryan_greenblatt
and
Fabien Roger
Sep 5, 2023, 4:44 PM
94
points
22
comments
20
min read
LW
link
1
review
(arxiv.org)
[Question]
Strongest real-world examples supporting AI risk claims?
rosehadshar
Sep 5, 2023, 3:12 PM
41
points
7
comments
1
min read
LW
link
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy
Dan H
Sep 5, 2023, 3:03 PM
15
points
0
comments
5
min read
LW
link
(newsletter.safe.ai)
Who Has the Best Food?
Zvi
Sep 5, 2023, 1:40 PM
52
points
61
comments
10
min read
LW
link
(thezvi.wordpress.com)
World, mind, and learnability: A note on the metaphysical structure of the cosmos [& LLMs]
Bill Benzon
Sep 5, 2023, 12:19 PM
4
points
1
comment
5
min read
LW
link
Deleted
goktu
Sep 5, 2023, 8:10 AM
−12
points
1
comment
1
min read
LW
link
Text Posts from the Kids Group: 2023 I
jefftk
Sep 5, 2023, 2:00 AM
75
points
3
comments
7
min read
LW
link
(www.jefftk.com)
Action theory is not policy theory is not agent theory
Cole Wyeth
Sep 5, 2023, 1:38 AM
20
points
4
comments
6
min read
LW
link
(colewyeth.com)
The purpose of the (Mosaic) law
mruwnik
Sep 4, 2023, 11:38 PM
7
points
5
comments
6
min read
LW
link
Against the Open Source / Closed Source Dichotomy: Regulated Source as a Model for Responsible AI Development
alex.herwix
Sep 4, 2023, 8:25 PM
4
points
12
comments
6
min read
LW
link
(forum.effectivealtruism.org)
Notes on nukes, IR, and AI from “Arsenals of Folly” (and other books)
tlevin
Sep 4, 2023, 7:02 PM
11
points
0
comments
6
min read
LW
link
Hertford, Sourbut (rationality lessons from University Challenge)
Oliver Sourbut
Sep 4, 2023, 6:44 PM
28
points
7
comments
14
min read
LW
link
(www.oliversourbut.net)
a rant on politician-engineer coalitional conflict
bhauth
Sep 4, 2023, 5:15 PM
64
points
12
comments
4
min read
LW
link
How ForumMagnum builds communities of inquiry
Jim Fisher
Sep 4, 2023, 4:52 PM
33
points
21
comments
5
min read
LW
link
Interpreting a matrix-valued word embedding with a mathematically proven characterization of all optima
Joseph Van Name
Sep 4, 2023, 4:19 PM
3
points
4
comments
12
min read
LW
link
Hard Questions Are Language Bugs
George3d6
Sep 4, 2023, 2:44 PM
30
points
13
comments
7
min read
LW
link
(ontologi.cc)
Defunding My Mistake
ymeskhout
Sep 4, 2023, 2:43 PM
178
points
41
comments
6
min read
LW
link
The omnizoid—Heighn FDT Debate #1: Why FDT Isn’t Crazy
Heighn
Sep 4, 2023, 12:57 PM
24
points
4
comments
6
min read
LW
link
Paper: On measuring situational awareness in LLMs
Owain_Evans
,
Daniel Kokotajlo
,
Mikita Balesni
,
Tomek Korbak
,
Asa Cooper Stickland
,
Meg
and
Maximilian Kaufmann
Sep 4, 2023, 12:54 PM
109
points
16
comments
5
min read
LW
link
(arxiv.org)
Impending AGI doesn’t make everything else unimportant
Igor Ivanov
Sep 4, 2023, 12:34 PM
29
points
12
comments
5
min read
LW
link
Open Thread – Autumn 2023
Raemon
Sep 3, 2023, 10:54 PM
26
points
111
comments
1
min read
LW
link
What must be the case that ChatGPT would have memorized “To be or not to be”? – Three kinds of conceptual objects for LLMs
Bill Benzon
Sep 3, 2023, 6:39 PM
19
points
0
comments
12
min read
LW
link
Fundamental question: What determines a mind’s effects?
TsviBT
Sep 3, 2023, 5:15 PM
15
points
4
comments
13
min read
LW
link
An embedding decoder model, trained with a different objective on a different dataset, can decode another model’s embeddings surprisingly accurately
Logan Zoellner
Sep 3, 2023, 11:34 AM
20
points
1
comment
1
min read
LW
link
Series of absurd upgrades in nature’s great search
lemonhope
Sep 3, 2023, 9:35 AM
15
points
8
comments
1
min read
LW
link
Conservation of Expected Evidence and Random Sampling in Anthropics
Ape in the coat
Sep 3, 2023, 6:55 AM
9
points
9
comments
7
min read
LW
link
The goal of physics
Jim Pivarski
Sep 2, 2023, 11:08 PM
46
points
4
comments
5
min read
LW
link
Will value of paid sex drop right before the end of the world?
azamatvaliev
Sep 2, 2023, 7:03 PM
−13
points
0
comments
4
min read
LW
link
PIBBSS Summer Symposium 2023
Nora_Ammann
and
DusanDNesic
Sep 2, 2023, 5:22 PM
25
points
2
comments
3
min read
LW
link
The smallest possible button (or: moth traps!)
Neil
Sep 2, 2023, 3:24 PM
122
points
18
comments
3
min read
LW
link
(neilwarren.substack.com)
Steven Harnad: Symbol grounding and the structure of dictionaries
Bill Benzon
Sep 2, 2023, 12:28 PM
5
points
3
comments
2
min read
LW
link
Is Metaethics Unnecessary Given Intent-Aligned AI?
Caleb Biddulph
Sep 2, 2023, 9:48 AM
10
points
0
comments
7
min read
LW
link
Rational Agents Cooperate in the Prisoner’s Dilemma
Isaac King
Sep 2, 2023, 6:15 AM
17
points
68
comments
12
min read
LW
link
[Linkpost] Large language models converge toward human-like concept organization
Bogdan Ionut Cirstea
Sep 2, 2023, 6:00 AM
22
points
1
comment
1
min read
LW
link
Plum Cooking Temperature
jefftk
Sep 2, 2023, 1:30 AM
11
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel