Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
[Linkpost] Silver Bulletin: For most people, politics is about fitting in
Gunnar_Zarncke
May 1, 2024, 6:12 PM
18
points
4
comments
1
min read
LW
link
(www.natesilver.net)
Launching applications for AI Safety Careers Course India 2024
Axiom_Futures
May 1, 2024, 5:55 PM
4
points
1
comment
1
min read
LW
link
[Question]
Shane Legg’s necessary properties for every AGI Safety plan
jacquesthibs
May 1, 2024, 5:15 PM
58
points
12
comments
1
min read
LW
link
KAN: Kolmogorov-Arnold Networks
Gunnar_Zarncke
May 1, 2024, 4:50 PM
18
points
15
comments
1
min read
LW
link
(arxiv.org)
Manifund Q1 Retro: Learnings from impact certs
Austin Chen
May 1, 2024, 4:48 PM
40
points
1
comment
LW
link
ACX Covid Origins Post convinced readers
ErnestScribbler
May 1, 2024, 1:06 PM
77
points
7
comments
2
min read
LW
link
LessWrong Community Weekend 2024, open for applications
UnplannedCauliflower
and
jt
May 1, 2024, 10:18 AM
79
points
2
comments
7
min read
LW
link
Take SCIFs, it’s dangerous to go alone
latterframe
,
Jeffrey Ladish
and
schroederdewitt
May 1, 2024, 8:02 AM
42
points
1
comment
3
min read
LW
link
AXRP Episode 30 - AI Security with Jeffrey Ladish
DanielFilan
May 1, 2024, 2:50 AM
25
points
0
comments
79
min read
LW
link
Neuro/BCI/WBE for Safe AI Workshop
Allison Duettmann
May 1, 2024, 12:46 AM
3
points
0
comments
1
min read
LW
link
AGI: Cryptography, Security & Multipolar Scenarios Workshop
Allison Duettmann
May 1, 2024, 12:42 AM
8
points
1
comment
1
min read
LW
link
The formal goal is a pointer
Morphism
May 1, 2024, 12:27 AM
20
points
10
comments
1
min read
LW
link
Arch-anarchy:Theory and practice
Peter lawless
Apr 30, 2024, 11:20 PM
−6
points
0
comments
2
min read
LW
link
“Open Source AI” is a lie, but it doesn’t have to be
jacobhaimes
Apr 30, 2024, 11:10 PM
19
points
5
comments
6
min read
LW
link
(jacob-haimes.github.io)
Questions for labs
Zach Stein-Perlman
Apr 30, 2024, 10:15 PM
77
points
11
comments
8
min read
LW
link
Reality comprehensibility: are there illogical things in reality?
DDthinker
Apr 30, 2024, 9:30 PM
−3
points
0
comments
10
min read
LW
link
Mechanistically Eliciting Latent Behaviors in Language Models
Andrew Mack
and
TurnTrout
Apr 30, 2024, 6:51 PM
210
points
43
comments
45
min read
LW
link
[Question]
What is the easiest/funnest way to build up a comprehensive understanding of AI and AI Safety?
Jordan Arel
Apr 30, 2024, 6:41 PM
4
points
2
comments
1
min read
LW
link
Transcoders enable fine-grained interpretable circuit analysis for language models
Jacob Dunefsky
,
Philippe Chlenski
and
Neel Nanda
Apr 30, 2024, 5:58 PM
74
points
14
comments
17
min read
LW
link
Announcing the 2024 Roots of Progress Blog-Building Intensive
jasoncrawford
Apr 30, 2024, 5:37 PM
14
points
0
comments
2
min read
LW
link
(rootsofprogress.org)
The Intentional Stance, LLMs Edition
Eleni Angelou
Apr 30, 2024, 5:12 PM
30
points
3
comments
8
min read
LW
link
Introducing AI Lab Watch
Zach Stein-Perlman
Apr 30, 2024, 5:00 PM
225
points
30
comments
1
min read
LW
link
(ailabwatch.org)
Why I’m doing PauseAI
Joseph Miller
Apr 30, 2024, 4:21 PM
108
points
16
comments
4
min read
LW
link
LLMs could be as conscious as human emulations, potentially
Canaletto
Apr 30, 2024, 11:36 AM
15
points
15
comments
3
min read
LW
link
An interesting mathematical model of how LLMs work
Bill Benzon
Apr 30, 2024, 11:01 AM
5
points
0
comments
1
min read
LW
link
Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers
hugofry
Apr 29, 2024, 8:57 PM
93
points
8
comments
11
min read
LW
link
Towards a formalization of the agent structure problem
Alex_Altair
Apr 29, 2024, 8:28 PM
55
points
6
comments
14
min read
LW
link
Ironing Out the Squiggles
Zack_M_Davis
Apr 29, 2024, 4:13 PM
157
points
36
comments
11
min read
LW
link
Super additivity of consciousness
Arturo Macias
Apr 29, 2024, 3:41 PM
−2
points
13
comments
2
min read
LW
link
AISC9 has ended and there will be an AISC10
Linda Linsefors
Apr 29, 2024, 10:53 AM
75
points
4
comments
2
min read
LW
link
Open-Source AI: A Regulatory Review
Elliot Mckernon
and
Deric Cheng
Apr 29, 2024, 10:10 AM
18
points
0
comments
8
min read
LW
link
Big-endian is better than little-endian
Menotim
Apr 29, 2024, 2:30 AM
29
points
17
comments
3
min read
LW
link
The Prop-room and Stage Cognitive Architecture
Robert Kralisch
Apr 29, 2024, 12:48 AM
14
points
4
comments
14
min read
LW
link
How are Simulators and Agents related?
Robert Kralisch
Apr 29, 2024, 12:22 AM
6
points
0
comments
7
min read
LW
link
Extended Embodiment
Robert Kralisch
Apr 29, 2024, 12:18 AM
8
points
1
comment
3
min read
LW
link
Referential Containment
Robert Kralisch
Apr 29, 2024, 12:16 AM
2
points
4
comments
3
min read
LW
link
Disentangling Competence and Intelligence
Robert Kralisch
Apr 29, 2024, 12:12 AM
23
points
7
comments
6
min read
LW
link
List your AI X-Risk cruxes!
Aryeh Englander
Apr 28, 2024, 6:26 PM
42
points
7
comments
2
min read
LW
link
Things I tell myself to be more agentic
DMMF
Apr 28, 2024, 5:44 PM
9
points
0
comments
3
min read
LW
link
(danfrank.ca)
Estimating the Number of Players from Game Result Percentages
Daniel L
Apr 28, 2024, 5:42 PM
1
point
2
comments
1
min read
LW
link
The Science Algorithm—AISC 2024 Final Presentation
Johannes C. Mayer
Apr 28, 2024, 2:55 PM
4
points
0
comments
1
min read
LW
link
(www.youtube.com)
[Aspiration-based designs] Outlook: dealing with complexity
Jobst Heitzig
,
jossoliver
,
thomasfinn
and
Simon Dima
Apr 28, 2024, 1:06 PM
13
points
3
comments
2
min read
LW
link
[Aspiration-based designs] 3. Performance and safety criteria, and aspiration intervals
Jobst Heitzig
Apr 28, 2024, 1:04 PM
10
points
0
comments
12
min read
LW
link
[Aspiration-based designs] 2. Formal framework, basic algorithm
Jobst Heitzig
,
Simon Dima
and
Simon Fischer
28 Apr 2024 13:02 UTC
18
points
2
comments
16
min read
LW
link
[Aspiration-based designs] 1. Informal introduction
B Jacobs
,
Jobst Heitzig
,
Simon Fischer
and
Simon Dima
28 Apr 2024 13:00 UTC
44
points
4
comments
8
min read
LW
link
Playing Northboro with Lily and Rick
jefftk
28 Apr 2024 2:40 UTC
10
points
1
comment
2
min read
LW
link
(www.jefftk.com)
Release of UN’s draft related to the governance of AI (a summary of the Simon Institute’s response)
Sebastian Schmidt
27 Apr 2024 18:34 UTC
7
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Mercy to the Machine: Thoughts & Rights
False Name
27 Apr 2024 16:36 UTC
7
points
6
comments
17
min read
LW
link
Constructability: Plainly-coded AGIs may be feasible in the near future
Épiphanie Gédéon
and
Charbel-Raphaël
27 Apr 2024 16:04 UTC
91
points
13
comments
13
min read
LW
link
So What’s Up With PUFAs Chemically?
J Bostock
27 Apr 2024 13:32 UTC
57
points
23
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel