[Cross-post] Wel­come to the Es­say Meta

davekastenJan 16, 2025, 11:36 PM
11 points
2 comments8 min readLW link

AI for Re­solv­ing Fore­cast­ing Ques­tions: An Early Exploration

ozziegooenJan 16, 2025, 9:41 PM
10 points
2 commentsLW link

[Question] How Do You In­ter­pret the Goal of LessWrong and Its Com­mu­nity?

ashen8461Jan 16, 2025, 7:08 PM
−2 points
2 comments1 min readLW link

Ex­perts’ AI timelines are longer than you have been told?

Vasco GriloJan 16, 2025, 6:03 PM
10 points
4 comments3 min readLW link
(bayes.net)

Num­ber­wang: LLMs Do­ing Au­tonomous Re­search, and a Call for Input

Jan 16, 2025, 5:20 PM
71 points
30 comments31 min readLW link

Topolog­i­cal De­bate Framework

lunatic_at_largeJan 16, 2025, 5:19 PM
10 points
5 comments9 min readLW link

AI #99: Farewell to Biden

ZviJan 16, 2025, 2:20 PM
54 points
5 comments58 min readLW link
(thezvi.wordpress.com)

De­cep­tive Align­ment and Homuncularity

Jan 16, 2025, 1:55 PM
26 points
12 comments22 min readLW link

In­tro­duc­ing the WeirdML Benchmark

Håvard Tveit IhleJan 16, 2025, 11:38 AM
56 points
13 comments11 min readLW link

The Math­e­mat­i­cal Rea­son You should have 9 Kids

Zero ContradictionsJan 16, 2025, 11:24 AM
−9 points
6 comments1 min readLW link
(eternalanglo.com)

Repli­ca­tors, Gods and Bud­dhist Cosmology

KristianRonnJan 16, 2025, 10:51 AM
15 points
3 comments26 min readLW link

Quan­tum with­out complication

Jan 16, 2025, 8:53 AM
30 points
2 comments10 min readLW link

Per­ma­nents: much more than you wanted to know

Dmitry VaintrobJan 16, 2025, 8:04 AM
17 points
2 comments15 min readLW link

Gam­ing Truth­fulQA: Sim­ple Heuris­tics Ex­posed Dataset Weaknesses

TurnTroutJan 16, 2025, 2:14 AM
64 points
3 comments1 min readLW link
(turntrout.com)

What Is The Align­ment Prob­lem?

johnswentworthJan 16, 2025, 1:20 AM
180 points
49 comments25 min readLW link

Im­prov­ing Our Safety Cases Us­ing Up­per and Lower Bounds

Yonatan CaleJan 16, 2025, 12:01 AM
23 points
0 comments3 min readLW link

Un­reg­u­lated Pep­tides: Does BPC-157 hold its promises?

ChristianKlJan 15, 2025, 11:36 PM
28 points
7 comments4 min readLW link

New, im­proved mul­ti­ple-choice TruthfulQA

Jan 15, 2025, 11:32 PM
72 points
0 comments3 min readLW link

The Differ­ence Between Pre­dic­tion Mar­kets and De­bate (Ar­gu­ment) Maps

Jamie JoyceJan 15, 2025, 11:19 PM
6 points
3 comments3 min readLW link

A Novel Emer­gence of Meta-Aware­ness in LLM Fine-Tuning

rifeJan 15, 2025, 10:59 PM
57 points
32 comments2 min readLW link

Six Small Co­hab­itive Games

ScrewtapeJan 15, 2025, 9:59 PM
40 points
7 comments13 min readLW link

LLMs are re­ally good at k-or­der think­ing (where k is even)

charlieoneillJan 15, 2025, 8:43 PM
7 points
0 comments2 min readLW link

Every­where I Look, I See Kat Woods

just_browsingJan 15, 2025, 7:29 PM
22 points
44 comments5 min readLW link

[un­ti­tled post]

EmreJan 15, 2025, 6:52 PM
−1 points
0 comments1 min readLW link

“Pick Two” AI Trilemma: Gen­er­al­ity, Agency, Align­ment.

Black FlagJan 15, 2025, 6:52 PM
7 points
0 comments2 min readLW link

Myths about Non­d­u­al­ity and Science by Gary Weber

Vadim GolubJan 15, 2025, 6:33 PM
2 points
0 comments23 min readLW link

Marx and the Machine

DALJan 15, 2025, 6:33 PM
5 points
2 comments9 min readLW link

Code4Com­pas­sion 2025: a hackathon trans­form­ing an­i­mal ad­vo­cacy through technology

superbeneficiaryJan 15, 2025, 6:31 PM
2 points
0 comments1 min readLW link

Ap­pli­ca­tions Open for the Co­op­er­a­tive AI Sum­mer School 2025!

JesseCliftonJan 15, 2025, 6:16 PM
7 points
0 comments1 min readLW link

List of AI safety pa­pers from com­pa­nies, 2023–2024

Zach Stein-PerlmanJan 15, 2025, 6:00 PM
11 points
0 comments1 min readLW link

AI Align­ment Meme Viruses

RationalDinoJan 15, 2025, 3:55 PM
4 points
0 comments2 min readLW link

Look­ing for hu­man­ness in the world wide social

Itay DreyfusJan 15, 2025, 2:50 PM
11 points
0 comments6 min readLW link
(productidentity.co)

On the OpenAI Eco­nomic Blueprint

ZviJan 15, 2025, 2:30 PM
81 points
2 comments9 min readLW link
(thezvi.wordpress.com)

A prob­lem shared by many differ­ent al­ign­ment targets

ThomasCederborgJan 15, 2025, 2:22 PM
12 points
18 comments36 min readLW link

LLMs for lan­guage learning

BenquoJan 15, 2025, 2:08 PM
10 points
2 comments7 min readLW link
(benjaminrosshoffman.com)

Fea­ture re­quest: com­ment bookmarks

dirkJan 15, 2025, 6:45 AM
18 points
2 comments1 min readLW link

How do fic­tional sto­ries illus­trate AI mis­al­ign­ment?

Jan 15, 2025, 6:11 AM
13 points
4 comments2 min readLW link
(aisafety.info)

We prob­a­bly won’t just play sta­tus games with each other af­ter AGI

Matthew BarnettJan 15, 2025, 4:56 AM
93 points
21 comments4 min readLW link

In­fer­ence-Time-Com­pute: More Faith­ful? A Re­search Note

Jan 15, 2025, 4:43 AM
69 points
10 comments11 min readLW link

Vol­un­tary Salary Reduction

jefftkJan 15, 2025, 3:40 AM
37 points
2 comments1 min readLW link
(www.jefftk.com)

[Question] Where should one post to get into the train­ing data?

keltanJan 15, 2025, 12:41 AM
11 points
5 comments1 min readLW link

Pre­dict 2025 AI ca­pa­bil­ities (by Sun­day)

Jan 15, 2025, 12:16 AM
54 points
3 comments1 min readLW link

Lec­ture Series on Tiling Agents

abramdemskiJan 14, 2025, 9:34 PM
38 points
14 comments1 min readLW link

Is AI Phys­i­cal?

Lauren GreenspanJan 14, 2025, 9:21 PM
23 points
6 comments7 min readLW link

Her­i­ta­bil­ity: Five Battles

Steven ByrnesJan 14, 2025, 6:21 PM
88 points
23 comments60 min readLW link

The Philo­soph­i­cal Glos­sary of AI

David GrossJan 14, 2025, 5:36 PM
11 points
0 comments1 min readLW link
(www.aiglossary.co.uk)

I’m offer­ing free math con­sul­ta­tions!

GurkenglasJan 14, 2025, 4:30 PM
80 points
7 comments1 min readLW link

Why aban­don “prob­a­bil­ity is in the mind” when it comes to quan­tum dy­nam­ics?

Maxwell PetersonJan 14, 2025, 3:53 PM
20 points
17 comments1 min readLW link

How do you deal w/​ Su­per Stim­uli?

Logan RiggsJan 14, 2025, 3:14 PM
106 points
25 comments3 min readLW link

curate

technicalitiesJan 14, 2025, 2:40 PM
12 points
0 comments2 min readLW link