Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Manifold: If okay AGI, why?
Eliezer Yudkowsky
Mar 25, 2023, 10:43 PM
120
points
37
comments
1
min read
LW
link
(manifold.markets)
A stylized dialogue on John Wentworth’s claims about markets and optimization
So8res
Mar 25, 2023, 10:32 PM
169
points
22
comments
8
min read
LW
link
Reproject on Cropping
jefftk
Mar 25, 2023, 9:50 PM
21
points
5
comments
1
min read
LW
link
(www.jefftk.com)
Sam Altman on GPT-4, ChatGPT, and the Future of AI | Lex Fridman Podcast #367
Gabe M
Mar 25, 2023, 7:08 PM
63
points
4
comments
2
min read
LW
link
(www.youtube.com)
$500 Bounty/Contest: Explain Infra-Bayes In The Language Of Game Theory
johnswentworth
Mar 25, 2023, 5:29 PM
83
points
7
comments
2
min read
LW
link
Aligned AI as a wrapper around an LLM
cousin_it
Mar 25, 2023, 3:58 PM
31
points
19
comments
1
min read
LW
link
Good News, Everyone!
jbash
Mar 25, 2023, 1:48 PM
132
points
23
comments
2
min read
LW
link
ChatGPT Plugins—The Beginning of the End
Bary Levy
Mar 25, 2023, 11:45 AM
15
points
4
comments
1
min read
LW
link
AI Capabilities vs. AI Products
Darmani
Mar 25, 2023, 1:14 AM
13
points
1
comment
3
min read
LW
link
Nudging Polarization
jefftk
Mar 24, 2023, 11:50 PM
41
points
14
comments
3
min read
LW
link
(www.jefftk.com)
Why There Is No Answer to Your Philosophical Question
Bryan Frances
Mar 24, 2023, 11:22 PM
−12
points
10
comments
12
min read
LW
link
“Slightly Evil” AI Apps
intellectronica
Mar 24, 2023, 10:52 PM
1
point
2
comments
2
min read
LW
link
(intellectronica.net)
[Question]
Seeking Advice on Raising AI X-Risk Awareness on Social Media
MrThink
Mar 24, 2023, 10:25 PM
2
points
1
comment
1
min read
LW
link
Hutter-Prize for Prompts
rokosbasilisk
Mar 24, 2023, 9:26 PM
5
points
10
comments
1
min read
LW
link
How likely do you think worse-than-extinction type fates to be?
span1
Mar 24, 2023, 9:03 PM
5
points
4
comments
1
min read
LW
link
Meetup Tip: The Greeter
Screwtape
Mar 24, 2023, 8:31 PM
32
points
1
comment
4
min read
LW
link
Metaculus Predicts Weak AGI in 2 Years and AGI in 10
Chris_Leong
Mar 24, 2023, 7:43 PM
29
points
14
comments
LW
link
[Question]
How to model uncertainty about preferences?
quetzal_rainbow
Mar 24, 2023, 7:04 PM
10
points
2
comments
1
min read
LW
link
Exploring Tacit Linked Premises with GPT
romeostevensit
Mar 24, 2023, 6:09 PM
42
points
3
comments
3
min read
LW
link
More experiments in GPT-4 agency: writing memos
Christopher King
Mar 24, 2023, 5:51 PM
5
points
2
comments
10
min read
LW
link
Why consumerism is good actually
jasoncrawford
Mar 24, 2023, 5:42 PM
11
points
16
comments
1
min read
LW
link
(rootsofprogress.org)
Are extrapolation-based AIs alignable?
cousin_it
Mar 24, 2023, 3:55 PM
24
points
15
comments
1
min read
LW
link
Does GPT-4 exhibit agency when summarizing articles?
Christopher King
Mar 24, 2023, 3:49 PM
16
points
2
comments
5
min read
LW
link
GPT-2005: A conversation with ChatGPT (featuring semi-functional Wolfram Alpha plugin!)
Lone Pine
Mar 24, 2023, 2:03 PM
19
points
0
comments
22
min read
LW
link
Microsoft Research Paper Claims Sparks of Artificial Intelligence in GPT-4
Zvi
Mar 24, 2023, 1:20 PM
72
points
14
comments
6
min read
LW
link
(thezvi.wordpress.com)
So, just why do GPTs have to operate by continuing an existing string?
Bill Benzon
Mar 24, 2023, 12:08 PM
−4
points
0
comments
3
min read
LW
link
Apply now to rationality camps: ESPR & PAIR—new Program on AI and Reasoning (ages 16-20)
Anna Gajdova
Mar 24, 2023, 11:40 AM
42
points
0
comments
1
min read
LW
link
[Question]
What does the economy do?
tailcalled
Mar 24, 2023, 10:49 AM
9
points
20
comments
1
min read
LW
link
[Question]
Can independent researchers get a sponsored visa for the US or UK?
jacquesthibs
Mar 24, 2023, 6:10 AM
23
points
1
comment
1
min read
LW
link
Wittgenstein and ML — parameters vs architecture
Cleo Nardo
Mar 24, 2023, 4:54 AM
44
points
9
comments
5
min read
LW
link
Grinding slimes in the dungeon of AI alignment research
Max H
Mar 24, 2023, 4:51 AM
10
points
2
comments
4
min read
LW
link
A crazy hypothesis: GPT-4 already is agentic and is trying to take over the world!
Christopher King
Mar 24, 2023, 1:19 AM
−2
points
11
comments
9
min read
LW
link
Abstracts should be either Actually Short™, or broken into paragraphs
Raemon
Mar 24, 2023, 12:51 AM
93
points
27
comments
5
min read
LW
link
Using GPT-4 to Understand Code
sid
Mar 24, 2023, 12:09 AM
25
points
2
comments
6
min read
LW
link
Kingfisher Album Kickstarter
jefftk
Mar 23, 2023, 11:20 PM
8
points
0
comments
2
min read
LW
link
(www.jefftk.com)
Is your job replaceable by GPT-4? (as of March 2023)
Bezzi
Mar 23, 2023, 10:16 PM
18
points
6
comments
1
min read
LW
link
ACX meetup [April]
sallatik
Mar 23, 2023, 8:40 PM
1
point
0
comments
1
min read
LW
link
Feature idea: extra info about post author’s response to comments.
Nathan Helm-Burger
Mar 23, 2023, 8:14 PM
6
points
0
comments
1
min read
LW
link
Limit intelligent weapons
Lucas Pfeifer
Mar 23, 2023, 5:54 PM
−11
points
36
comments
1
min read
LW
link
We have to Upgrade
Jed McCaleb
Mar 23, 2023, 5:53 PM
131
points
35
comments
2
min read
LW
link
The Overton Window widens: Examples of AI risk in the media
Orpheus16
Mar 23, 2023, 5:10 PM
107
points
24
comments
6
min read
LW
link
GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that’s incorrect if they differ
Christopher King
Mar 23, 2023, 4:16 PM
7
points
4
comments
8
min read
LW
link
Is “FOXP2 speech & language disorder” really “FOXP2 forebrain fine-motor crappiness”?
Steven Byrnes
Mar 23, 2023, 4:09 PM
22
points
8
comments
6
min read
LW
link
EAI Alignment Speaker Series #1: Challenges for Safe & Beneficial Brain-Like Artificial General Intelligence with Steve Byrnes
Curtis Huebner
and
Steven Byrnes
Mar 23, 2023, 2:32 PM
28
points
0
comments
27
min read
LW
link
(youtu.be)
[Question]
Alignment-related jobs outside of London/SF
kwiat.dev
23 Mar 2023 13:24 UTC
26
points
14
comments
1
min read
LW
link
Zuzalu
vincentweisser
23 Mar 2023 11:24 UTC
3
points
0
comments
1
min read
LW
link
How Do Induction Heads Actually Work in Transformers With Finite Capacity?
Fabien Roger
23 Mar 2023 9:09 UTC
27
points
0
comments
5
min read
LW
link
ChatGPT’s “fuzzy alignment” isn’t evidence of AGI alignment: the banana test
Michael Tontchev
23 Mar 2023 7:12 UTC
23
points
6
comments
4
min read
LW
link
Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research
DragonGod
23 Mar 2023 5:45 UTC
68
points
23
comments
1
min read
LW
link
(arxiv.org)
Transcript: NBC Nightly News: AI ‘race to recklessness’ w/ Tristan Harris, Aza Raskin
WilliamKiely
23 Mar 2023 1:04 UTC
63
points
4
comments
3
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel