Emo­tional Superrationality

nullproxyJan 2, 2025, 10:54 PM
−6 points
4 comments11 min readLW link

Play­ing with Otamatones

jefftkJan 2, 2025, 7:50 PM
12 points
0 comments1 min readLW link
(www.jefftk.com)

7. Iter­ate the Game: Rac­ing Where?

Allison DuettmannJan 2, 2025, 7:06 PM
11 points
0 comments9 min readLW link

6. In­crease In­tel­li­gence: Wel­come AI Players

Allison DuettmannJan 2, 2025, 7:06 PM
6 points
1 comment19 min readLW link

5. Uphold Vol­un­tarism: Digi­tal Defense

Allison DuettmannJan 2, 2025, 7:05 PM
3 points
0 comments18 min readLW link

4. Uphold Vol­un­tarism: Phys­i­cal Defense

Allison DuettmannJan 2, 2025, 7:04 PM
6 points
2 comments23 min readLW link

3. Im­prove Co­op­er­a­tion: Bet­ter Technologies

Allison DuettmannJan 2, 2025, 7:03 PM
4 points
2 comments23 min readLW link

2. Skim the Man­ual: In­tel­li­gent Vol­un­tary Cooperation

Allison DuettmannJan 2, 2025, 7:02 PM
13 points
3 comments18 min readLW link

1. Meet the Play­ers: Value Diversity

Allison DuettmannJan 2, 2025, 7:00 PM
32 points
2 comments11 min readLW link

Preface

Allison DuettmannJan 2, 2025, 6:59 PM
26 points
2 comments7 min readLW link

The AI Agent Revolu­tion: Beyond the Hype of 2025

DimaGJan 2, 2025, 6:55 PM
−7 points
1 comment28 min readLW link

On False Dichotomies

nullproxyJan 2, 2025, 6:54 PM
−3 points
0 comments5 min readLW link

Prefer­ence Inversion

BenquoJan 2, 2025, 6:15 PM
51 points
48 comments4 min readLW link
(benjaminrosshoffman.com)

Align­ment Is Not All You Need

Adam JonesJan 2, 2025, 5:50 PM
43 points
10 comments6 min readLW link
(adamjones.me)

What’s the short timeline plan?

Marius HobbhahnJan 2, 2025, 2:59 PM
352 points
49 comments23 min readLW link

AI #97: 4

ZviJan 2, 2025, 2:10 PM
45 points
4 comments40 min readLW link
(thezvi.wordpress.com)

[Question] Can pri­vate com­pa­nies test LVTs?

Yair HalberstadtJan 2, 2025, 11:08 AM
7 points
0 comments1 min readLW link

Gram­mars, sub­gram­mars, and com­bi­na­torics of gen­er­al­iza­tion in transformers

Dmitry VaintrobJan 2, 2025, 9:37 AM
36 points
0 comments17 min readLW link

[Question] 2025 Align­ment Predictions

anagumaJan 2, 2025, 5:37 AM
3 points
3 comments1 min readLW link

Grad­ing my 2024 AI predictions

Nikola JurkovicJan 2, 2025, 5:01 AM
19 points
1 comment3 min readLW link

Prac­tic­ing Bayesian Episte­mol­ogy with “Two Boys” Prob­a­bil­ity Puzzles

LironJan 2, 2025, 4:42 AM
43 points
14 comments6 min readLW link

Im­pli­ca­tions of Mo­ral Real­ism on AI Safety

Myles HJan 2, 2025, 2:58 AM
7 points
1 comment3 min readLW link

Read The Se­quences As If They Were Writ­ten Today

Peter BerggrenJan 2, 2025, 2:51 AM
63 points
7 comments4 min readLW link

A Col­lec­tion of Em­piri­cal Frames about Lan­guage Models

Daniel TanJan 2, 2025, 2:49 AM
27 points
0 comments3 min readLW link

My Jan­uary al­ign­ment the­ory Nanowrimo

Dmitry VaintrobJan 2, 2025, 12:07 AM
42 points
2 comments2 min readLW link

In­tranasal mRNA Vac­cines?

J BostockJan 1, 2025, 11:46 PM
26 points
2 comments3 min readLW link

Ex­am­ple of GPU-ac­cel­er­ated sci­en­tific com­put­ing with PyTorch

TahpJan 1, 2025, 11:01 PM
6 points
0 comments6 min readLW link
(passwordpaper.com)

Eco­nomic Post-ASI Transition

Joel BurgetJan 1, 2025, 10:37 PM
20 points
11 comments1 min readLW link

2024 in AI predictions

jessicataJan 1, 2025, 8:29 PM
117 points
3 comments8 min readLW link

Ap­proaches to Group Singing

jefftkJan 1, 2025, 12:50 PM
12 points
1 comment3 min readLW link
(www.jefftk.com)

Alien­able (not Inalien­able) Right to Buy

FlorianHJan 1, 2025, 12:19 PM
7 points
6 comments4 min readLW link

AGI is what gen­er­ates evolu­tion­ar­ily fit and novel information

onurJan 1, 2025, 9:22 AM
1 point
0 comments6 min readLW link
(solmaz.io)

The OODA Loop—Ob­serve, Ori­ent, De­cide, Act

Davis_KingsleyJan 1, 2025, 8:00 AM
53 points
2 comments11 min readLW link

Com­ment on “Death and the Gor­gon”

Zack_M_DavisJan 1, 2025, 5:47 AM
103 points
33 comments8 min readLW link

Fire­place and Can­dle Smoke

jefftkJan 1, 2025, 1:50 AM
36 points
4 comments1 min readLW link
(www.jefftk.com)

Merry Science­mas: A Rat Sols­tice Retrospective

leebriskCyranoJan 1, 2025, 1:08 AM
−8 points
0 comments1 min readLW link
(leebriskcyrano.com)

Riffing on Machines of Lov­ing Grace

an1lamJan 1, 2025, 1:06 AM
9 points
0 comments1 min readLW link
(an1lam.substack.com)

new chi­nese stealth aircraft

bhauthJan 1, 2025, 12:19 AM
58 points
3 comments6 min readLW link
(bhauth.com)

The Roots of Progress 2024 in review

jasoncrawfordJan 1, 2025, 12:02 AM
27 points
0 comments11 min readLW link
(newsletter.rootsofprogress.org)

Genesis

PeterMcCluskeyDec 31, 2024, 10:01 PM
18 points
0 comments2 min readLW link
(bayesianinvestor.com)

Fa­vorite col­ors of some LLMs.

CanalettoDec 31, 2024, 9:22 PM
10 points
3 comments7 min readLW link

My AGI safety re­search—2024 re­view, ’25 plans

Steven ByrnesDec 31, 2024, 9:05 PM
109 points
4 comments8 min readLW link

How Busi­ness Solved (?) the Hu­man Align­ment Problem

Gianluca CalcagniDec 31, 2024, 8:39 PM
−2 points
1 comment8 min readLW link

Tur­ing-Test-Pass­ing AI im­plies Aligned AI

RokoDec 31, 2024, 7:59 PM
−1 points
29 comments5 min readLW link

DeekSeek v3: The Six Million Dol­lar Model

ZviDec 31, 2024, 3:10 PM
50 points
6 comments14 min readLW link
(thezvi.wordpress.com)

I Recom­mend More Train­ing Rationales

Gianluca CalcagniDec 31, 2024, 2:06 PM
2 points
0 comments6 min readLW link

The Plan − 2024 Update

johnswentworthDec 31, 2024, 1:29 PM
117 points
28 comments4 min readLW link

Zom­bies among us

Declan MolonyDec 31, 2024, 5:14 AM
12 points
4 comments2 min readLW link

So you want to be a witch

lucid_levi_ackermanDec 31, 2024, 4:31 AM
−32 points
3 comments28 min readLW link

Two Weeks Without Sweets

jefftkDec 31, 2024, 3:30 AM
31 points
0 comments2 min readLW link
(www.jefftk.com)