[Cross-post] Wel­come to the Es­say Meta

davekasten16 Jan 2025 23:36 UTC
14 points
2 comments8 min readLW link

AI for Re­solv­ing Fore­cast­ing Ques­tions: An Early Exploration

ozziegooen16 Jan 2025 21:41 UTC
10 points
2 comments9 min readLW link

[Question] How Do You In­ter­pret the Goal of LessWrong and Its Com­mu­nity?

ashen846116 Jan 2025 19:08 UTC
−2 points
2 comments1 min readLW link

Ex­perts’ AI timelines are longer than you have been told?

Vasco Grilo16 Jan 2025 18:03 UTC
10 points
4 comments3 min readLW link
(bayes.net)

Num­ber­wang: LLMs Do­ing Au­tonomous Re­search, and a Call for Input

16 Jan 2025 17:20 UTC
71 points
30 comments31 min readLW link

Topolog­i­cal De­bate Framework

lunatic_at_large16 Jan 2025 17:19 UTC
10 points
5 comments9 min readLW link

AI #99: Farewell to Biden

Zvi16 Jan 2025 14:20 UTC
54 points
5 comments58 min readLW link
(thezvi.wordpress.com)

De­cep­tive Align­ment and Homuncularity

16 Jan 2025 13:55 UTC
26 points
12 comments22 min readLW link

In­tro­duc­ing the WeirdML Benchmark

Håvard Tveit Ihle16 Jan 2025 11:38 UTC
57 points
13 comments11 min readLW link

The Math­e­mat­i­cal Rea­son You should have 9 Kids

Zero Contradictions16 Jan 2025 11:24 UTC
−9 points
6 comments1 min readLW link
(eternalanglo.com)

Quan­tum with­out complication

16 Jan 2025 8:53 UTC
30 points
2 comments10 min readLW link

Per­ma­nents: much more than you wanted to know

Dmitry Vaintrob16 Jan 2025 8:04 UTC
17 points
2 comments15 min readLW link

Gam­ing Truth­fulQA: Sim­ple Heuris­tics Ex­posed Dataset Weaknesses

TurnTrout16 Jan 2025 2:14 UTC
65 points
3 comments1 min readLW link
(turntrout.com)

What Is The Align­ment Prob­lem?

johnswentworth16 Jan 2025 1:20 UTC
181 points
49 comments25 min readLW link

Im­prov­ing Our Safety Cases Us­ing Up­per and Lower Bounds

Yonatan Cale16 Jan 2025 0:01 UTC
23 points
0 comments3 min readLW link

Un­reg­u­lated Pep­tides: Does BPC-157 hold its promises?

ChristianKl15 Jan 2025 23:36 UTC
28 points
7 comments4 min readLW link

New, im­proved mul­ti­ple-choice TruthfulQA

15 Jan 2025 23:32 UTC
72 points
1 comment3 min readLW link

The Differ­ence Between Pre­dic­tion Mar­kets and De­bate (Ar­gu­ment) Maps

Jamie Joyce15 Jan 2025 23:19 UTC
7 points
3 comments3 min readLW link

A Novel Emer­gence of Meta-Aware­ness in LLM Fine-Tuning

rife15 Jan 2025 22:59 UTC
57 points
32 comments2 min readLW link

Six Small Co­hab­itive Games

Screwtape15 Jan 2025 21:59 UTC
40 points
7 comments13 min readLW link

LLMs are re­ally good at k-or­der think­ing (where k is even)

charlieoneill15 Jan 2025 20:43 UTC
7 points
0 comments2 min readLW link

Every­where I Look, I See Kat Woods

just_browsing15 Jan 2025 19:29 UTC
19 points
45 comments5 min readLW link

[un­ti­tled post]

Emre15 Jan 2025 18:52 UTC
−1 points
0 comments1 min readLW link

“Pick Two” AI Trilemma: Gen­er­al­ity, Agency, Align­ment.

Black Flag15 Jan 2025 18:52 UTC
7 points
0 comments2 min readLW link

Myths about Non­d­u­al­ity and Science by Gary Weber

Vadim Golub15 Jan 2025 18:33 UTC
2 points
0 comments23 min readLW link

Marx and the Machine

DAL15 Jan 2025 18:33 UTC
5 points
2 comments9 min readLW link

Code4Com­pas­sion 2025: a hackathon trans­form­ing an­i­mal ad­vo­cacy through technology

superbeneficiary15 Jan 2025 18:31 UTC
3 points
0 comments1 min readLW link

Ap­pli­ca­tions Open for the Co­op­er­a­tive AI Sum­mer School 2025!

JesseClifton15 Jan 2025 18:16 UTC
7 points
0 comments1 min readLW link

List of AI safety pa­pers from com­pa­nies, 2023–2024

Zach Stein-Perlman15 Jan 2025 18:00 UTC
11 points
0 comments1 min readLW link

AI Align­ment Meme Viruses

RationalDino15 Jan 2025 15:55 UTC
5 points
0 comments2 min readLW link

Look­ing for hu­man­ness in the world wide social

Itay Dreyfus15 Jan 2025 14:50 UTC
11 points
0 comments6 min readLW link
(productidentity.co)

On the OpenAI Eco­nomic Blueprint

Zvi15 Jan 2025 14:30 UTC
81 points
2 comments9 min readLW link
(thezvi.wordpress.com)

A prob­lem shared by many differ­ent al­ign­ment targets

ThomasCederborg15 Jan 2025 14:22 UTC
13 points
18 comments36 min readLW link

LLMs for lan­guage learning

Benquo15 Jan 2025 14:08 UTC
10 points
2 comments7 min readLW link
(benjaminrosshoffman.com)

Fea­ture re­quest: com­ment bookmarks

dirk15 Jan 2025 6:45 UTC
18 points
2 comments1 min readLW link

How do fic­tional sto­ries illus­trate AI mis­al­ign­ment?

15 Jan 2025 6:11 UTC
13 points
4 comments2 min readLW link
(aisafety.info)

We prob­a­bly won’t just play sta­tus games with each other af­ter AGI

Matthew Barnett15 Jan 2025 4:56 UTC
97 points
21 comments4 min readLW link

In­fer­ence-Time-Com­pute: More Faith­ful? A Re­search Note

15 Jan 2025 4:43 UTC
69 points
10 comments11 min readLW link

Vol­un­tary Salary Reduction

jefftk15 Jan 2025 3:40 UTC
37 points
2 comments1 min readLW link
(www.jefftk.com)

[Question] Where should one post to get into the train­ing data?

keltan15 Jan 2025 0:41 UTC
11 points
5 comments1 min readLW link

Pre­dict 2025 AI ca­pa­bil­ities (by Sun­day)

15 Jan 2025 0:16 UTC
55 points
3 comments1 min readLW link

Lec­ture Series on Tiling Agents

abramdemski14 Jan 2025 21:34 UTC
38 points
14 comments1 min readLW link

Is AI Phys­i­cal?

Lauren Greenspan14 Jan 2025 21:21 UTC
23 points
6 comments7 min readLW link

Her­i­ta­bil­ity: Five Battles

Steven Byrnes14 Jan 2025 18:21 UTC
94 points
23 comments60 min readLW link

The Philo­soph­i­cal Glos­sary of AI

David Gross14 Jan 2025 17:36 UTC
11 points
0 comments1 min readLW link
(www.aiglossary.co.uk)

I’m offer­ing free math con­sul­ta­tions!

Gurkenglas14 Jan 2025 16:30 UTC
83 points
7 comments1 min readLW link

Why aban­don “prob­a­bil­ity is in the mind” when it comes to quan­tum dy­nam­ics?

Maxwell Peterson14 Jan 2025 15:53 UTC
23 points
24 comments1 min readLW link

How do you deal w/​ Su­per Stim­uli?

Logan Riggs14 Jan 2025 15:14 UTC
112 points
25 comments3 min readLW link

curate

technicalities14 Jan 2025 14:40 UTC
12 points
0 comments2 min readLW link

Our new video about goal mis­gen­er­al­iza­tion, plus an apology

Writer14 Jan 2025 14:07 UTC
33 points
0 comments7 min readLW link
(youtu.be)