be the per­son that makes the meet­ing productive

OldmanrahulJan 18, 2025, 10:32 PM
9 points
0 comments1 min readLW link

Beards and Masks?

jefftkJan 18, 2025, 4:00 PM
72 points
5 comments4 min readLW link
(www.jefftk.com)

[Question] How likely is AGI to force us all to be happy for­ever? (much like in the Three Wor­lds Col­lide novel)

uhbif19Jan 18, 2025, 3:39 PM
9 points
5 comments1 min readLW link

Well-be­ing in the mind, and its im­pli­ca­tions for utilitarianism

SjlverJan 18, 2025, 3:32 PM
6 points
2 comments2 min readLW link

[Ex­er­cise] Four Ex­am­ples of Notic­ing Confusion

Logan RiggsJan 18, 2025, 3:29 PM
8 points
8 comments3 min readLW link

Scal­ing Wargam­ing for Global Catas­trophic Risks with AI

Jan 18, 2025, 3:10 PM
38 points
2 comments4 min readLW link
(blog.sentinel-team.org)

Align­ment ideas

qbolecJan 18, 2025, 12:43 PM
11 points
1 comment8 min readLW link

AI-en­abled Cloud Gaming

samuelshadrachJan 18, 2025, 11:56 AM
1 point
0 comments3 min readLW link
(samuelshadrach.com)

Don’t ig­nore bad vibes you get from people

Kaj_SotalaJan 18, 2025, 9:20 AM
154 points
50 comments2 min readLW link
(kajsotala.fi)

Renor­mal­iza­tion Re­dux: QFT Tech­niques for AI Interpretability

Jan 18, 2025, 3:54 AM
44 points
12 comments7 min readLW link

[Question] What’s Wrong With the Si­mu­la­tion Ar­gu­ment?

AynonymousPrsn123Jan 18, 2025, 2:32 AM
6 points
49 comments1 min readLW link

Your AI Safety fo­cus is down­stream of your AGI timeline

Michael FloodJan 17, 2025, 9:24 PM
9 points
0 comments4 min readLW link

Thoughts on the con­ser­va­tive as­sump­tions in AI control

BuckJan 17, 2025, 7:23 PM
91 points
5 comments13 min readLW link

Ti­maeus is hiring re­searchers & engineers

Jan 17, 2025, 7:13 PM
65 points
4 comments4 min readLW link

Model Amnesty Project

themisJan 17, 2025, 6:53 PM
3 points
2 comments3 min readLW link

Ad­dress­ing doubts of AI progress: Why GPT-5 is not late, and why data scarcity isn’t a fun­da­men­tal limiter near term.

LDJJan 17, 2025, 6:53 PM
2 points
0 comments2 min readLW link

Play­ing Dixit with AI: How Well LLMs De­tect ‘Me-ness’

Mariia KoroliukJan 17, 2025, 6:52 PM
5 points
0 comments2 min readLW link

Do­ing a self-ran­dom­ized study of the im­pacts of glycine on sleep (Science is hard)

thedissonance.netJan 17, 2025, 6:49 PM
11 points
5 comments11 min readLW link

How sci-fi can have drama with­out dystopia or doomerism

jasoncrawfordJan 17, 2025, 3:22 PM
19 points
3 comments3 min readLW link
(newsletter.rootsofprogress.org)

[Question] What do you mean with ‘al­ign­ment is solv­able in prin­ci­ple’?

RemmeltJan 17, 2025, 3:03 PM
3 points
9 comments1 min readLW link

Meta Pivots on Con­tent Moderation

ZviJan 17, 2025, 2:20 PM
47 points
3 comments10 min readLW link
(thezvi.wordpress.com)

Tax Price Goug­ing?

jefftkJan 17, 2025, 2:10 PM
55 points
22 comments3 min readLW link
(www.jefftk.com)

The quan­tum red pill or: They lied to you, we live in the (den­sity) matrix

Dmitry VaintrobJan 17, 2025, 1:58 PM
37 points
34 comments12 min readLW link

Bed­nets -- 4 longer malaria studies

HznJan 17, 2025, 8:47 AM
4 points
0 comments4 min readLW link

Pa­tent Trol­ling to Save the World

DoubleJan 17, 2025, 4:13 AM
23 points
7 comments3 min readLW link

Call Booth Ex­ter­nal Monitor

jefftkJan 17, 2025, 3:10 AM
15 points
0 comments1 min readLW link
(www.jefftk.com)

[Cross-post] Wel­come to the Es­say Meta

davekastenJan 16, 2025, 11:36 PM
11 points
2 comments8 min readLW link

AI for Re­solv­ing Fore­cast­ing Ques­tions: An Early Exploration

ozziegooenJan 16, 2025, 9:41 PM
10 points
2 commentsLW link

[Question] How Do You In­ter­pret the Goal of LessWrong and Its Com­mu­nity?

ashen8461Jan 16, 2025, 7:08 PM
−2 points
2 comments1 min readLW link

Ex­perts’ AI timelines are longer than you have been told?

Vasco GriloJan 16, 2025, 6:03 PM
10 points
4 comments3 min readLW link
(bayes.net)

Num­ber­wang: LLMs Do­ing Au­tonomous Re­search, and a Call for Input

Jan 16, 2025, 5:20 PM
71 points
30 comments31 min readLW link

Topolog­i­cal De­bate Framework

lunatic_at_largeJan 16, 2025, 5:19 PM
10 points
5 comments9 min readLW link

AI #99: Farewell to Biden

ZviJan 16, 2025, 2:20 PM
54 points
5 comments58 min readLW link
(thezvi.wordpress.com)

De­cep­tive Align­ment and Homuncularity

Jan 16, 2025, 1:55 PM
26 points
12 comments22 min readLW link

In­tro­duc­ing the WeirdML Benchmark

Håvard Tveit IhleJan 16, 2025, 11:38 AM
56 points
13 comments11 min readLW link

The Math­e­mat­i­cal Rea­son You should have 9 Kids

Zero ContradictionsJan 16, 2025, 11:24 AM
−9 points
6 comments1 min readLW link
(eternalanglo.com)

Repli­ca­tors, Gods and Bud­dhist Cosmology

KristianRonnJan 16, 2025, 10:51 AM
15 points
3 comments26 min readLW link

Quan­tum with­out complication

Jan 16, 2025, 8:53 AM
30 points
2 comments10 min readLW link

Per­ma­nents: much more than you wanted to know

Dmitry VaintrobJan 16, 2025, 8:04 AM
17 points
2 comments15 min readLW link

Gam­ing Truth­fulQA: Sim­ple Heuris­tics Ex­posed Dataset Weaknesses

TurnTroutJan 16, 2025, 2:14 AM
64 points
3 comments1 min readLW link
(turntrout.com)

What Is The Align­ment Prob­lem?

johnswentworthJan 16, 2025, 1:20 AM
180 points
49 comments25 min readLW link

Im­prov­ing Our Safety Cases Us­ing Up­per and Lower Bounds

Yonatan CaleJan 16, 2025, 12:01 AM
23 points
0 comments3 min readLW link

Un­reg­u­lated Pep­tides: Does BPC-157 hold its promises?

ChristianKlJan 15, 2025, 11:36 PM
28 points
7 comments4 min readLW link

New, im­proved mul­ti­ple-choice TruthfulQA

Jan 15, 2025, 11:32 PM
72 points
0 comments3 min readLW link

The Differ­ence Between Pre­dic­tion Mar­kets and De­bate (Ar­gu­ment) Maps

Jamie JoyceJan 15, 2025, 11:19 PM
6 points
3 comments3 min readLW link

A Novel Emer­gence of Meta-Aware­ness in LLM Fine-Tuning

rifeJan 15, 2025, 10:59 PM
57 points
32 comments2 min readLW link

Six Small Co­hab­itive Games

ScrewtapeJan 15, 2025, 9:59 PM
40 points
7 comments13 min readLW link

LLMs are re­ally good at k-or­der think­ing (where k is even)

charlieoneillJan 15, 2025, 8:43 PM
7 points
0 comments2 min readLW link

Every­where I Look, I See Kat Woods

just_browsingJan 15, 2025, 7:29 PM
22 points
44 comments5 min readLW link

[un­ti­tled post]

EmreJan 15, 2025, 6:52 PM
−1 points
0 comments1 min readLW link