How i’m build­ing my ai sys­tem, how it’s go­ing so far, and my thoughts on it

ollie_4 Jan 2025 18:20 UTC
−9 points
3 comments5 min readLW link

Park­in­son’s Law and the Ide­ol­ogy of Statistics

Benquo4 Jan 2025 15:49 UTC
130 points
7 comments8 min readLW link
(benjaminrosshoffman.com)

The Laws of Large Numbers

Dmitry Vaintrob4 Jan 2025 11:54 UTC
38 points
11 comments12 min readLW link

The Golden Op­por­tu­nity for Amer­i­can AI

Annapurna4 Jan 2025 10:26 UTC
2 points
8 comments1 min readLW link
(blogs.microsoft.com)

A Gen­er­al­iza­tion of the Good Reg­u­la­tor Theorem

Alfred Harwood4 Jan 2025 9:55 UTC
20 points
6 comments10 min readLW link

Logic vs in­tu­ition ⇔ al­gorithm vs ML

pchvykov4 Jan 2025 9:06 UTC
5 points
0 comments7 min readLW link

de­bat­ing buy­ing NVDA in 2019

bhauth4 Jan 2025 5:06 UTC
27 points
3 comments3 min readLW link
(bhauth.com)

Mak­ing progress bars for Alignment

Kabir Kumar3 Jan 2025 21:25 UTC
2 points
0 comments1 min readLW link
(lu.ma)

The In­tel­li­gence Curse

lukedrago3 Jan 2025 19:07 UTC
142 points
27 comments18 min readLW link
(lukedrago.substack.com)

In­tro­duc­ing Squig­gle AI

ozziegooen3 Jan 2025 17:53 UTC
92 points
15 comments8 min readLW link

Hu­man study on AI spear phish­ing campaigns

3 Jan 2025 15:11 UTC
81 points
8 comments5 min readLW link

Mearsheimer’s Dou­ble Stan­dard: Real­ism for Rus­sia, Ideal­ism for Israel

Ghdz3 Jan 2025 13:52 UTC
−15 points
2 comments4 min readLW link

The sub­set par­ity learn­ing prob­lem: much more than you wanted to know

Dmitry Vaintrob3 Jan 2025 9:13 UTC
95 points
18 comments11 min readLW link

Build­ing AI safety bench­mark en­vi­ron­ments on themes of uni­ver­sal hu­man values

Roland Pihlakas3 Jan 2025 4:24 UTC
18 points
3 comments8 min readLW link
(docs.google.com)

Emo­tional Superrationality

nullproxy2 Jan 2025 22:54 UTC
−6 points
4 comments11 min readLW link

Play­ing with Otamatones

jefftk2 Jan 2025 19:50 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

7. Iter­ate the Game: Rac­ing Where?

Allison Duettmann2 Jan 2025 19:06 UTC
11 points
0 comments9 min readLW link

6. In­crease In­tel­li­gence: Wel­come AI Players

Allison Duettmann2 Jan 2025 19:06 UTC
6 points
1 comment19 min readLW link

5. Uphold Vol­un­tarism: Digi­tal Defense

Allison Duettmann2 Jan 2025 19:05 UTC
3 points
0 comments18 min readLW link

4. Uphold Vol­un­tarism: Phys­i­cal Defense

Allison Duettmann2 Jan 2025 19:04 UTC
6 points
2 comments23 min readLW link

3. Im­prove Co­op­er­a­tion: Bet­ter Technologies

Allison Duettmann2 Jan 2025 19:03 UTC
4 points
2 comments23 min readLW link

2. Skim the Man­ual: In­tel­li­gent Vol­un­tary Cooperation

Allison Duettmann2 Jan 2025 19:02 UTC
13 points
3 comments18 min readLW link

1. Meet the Play­ers: Value Diversity

Allison Duettmann2 Jan 2025 19:00 UTC
32 points
2 comments11 min readLW link

Preface

Allison Duettmann2 Jan 2025 18:59 UTC
31 points
2 comments7 min readLW link

The AI Agent Revolu­tion: Beyond the Hype of 2025

DimaG2 Jan 2025 18:55 UTC
−7 points
1 comment28 min readLW link

On False Dichotomies

nullproxy2 Jan 2025 18:54 UTC
−3 points
0 comments5 min readLW link

Prefer­ence Inversion

Benquo2 Jan 2025 18:15 UTC
53 points
48 comments4 min readLW link
(benjaminrosshoffman.com)

Align­ment Is Not All You Need

Adam Jones2 Jan 2025 17:50 UTC
43 points
10 comments6 min readLW link
(adamjones.me)

What’s the short timeline plan?

Marius Hobbhahn2 Jan 2025 14:59 UTC
361 points
51 comments23 min readLW link

AI #97: 4

Zvi2 Jan 2025 14:10 UTC
45 points
4 comments40 min readLW link
(thezvi.wordpress.com)

[Question] Can pri­vate com­pa­nies test LVTs?

Yair Halberstadt2 Jan 2025 11:08 UTC
7 points
0 comments1 min readLW link

Gram­mars, sub­gram­mars, and com­bi­na­torics of gen­er­al­iza­tion in transformers

Dmitry Vaintrob2 Jan 2025 9:37 UTC
36 points
0 comments17 min readLW link

[Question] 2025 Align­ment Predictions

anaguma2 Jan 2025 5:37 UTC
3 points
3 comments1 min readLW link

Grad­ing my 2024 AI predictions

Nikola Jurkovic2 Jan 2025 5:01 UTC
19 points
1 comment3 min readLW link

Prac­tic­ing Bayesian Episte­mol­ogy with “Two Boys” Prob­a­bil­ity Puzzles

Liron2 Jan 2025 4:42 UTC
43 points
14 comments6 min readLW link

Im­pli­ca­tions of Mo­ral Real­ism on AI Safety

Myles H2 Jan 2025 2:58 UTC
7 points
1 comment3 min readLW link

Read The Se­quences As If They Were Writ­ten Today

Peter Berggren2 Jan 2025 2:51 UTC
65 points
7 comments4 min readLW link

A Col­lec­tion of Em­piri­cal Frames about Lan­guage Models

Daniel Tan2 Jan 2025 2:49 UTC
27 points
0 comments3 min readLW link

My Jan­uary al­ign­ment the­ory Nanowrimo

Dmitry Vaintrob2 Jan 2025 0:07 UTC
42 points
2 comments2 min readLW link

In­tranasal mRNA Vac­cines?

J Bostock1 Jan 2025 23:46 UTC
26 points
2 comments3 min readLW link

Ex­am­ple of GPU-ac­cel­er­ated sci­en­tific com­put­ing with PyTorch

Tahp1 Jan 2025 23:01 UTC
6 points
0 comments6 min readLW link
(passwordpaper.com)

Eco­nomic Post-ASI Transition

Joel Burget1 Jan 2025 22:37 UTC
19 points
11 comments1 min readLW link

2024 in AI predictions

jessicata1 Jan 2025 20:29 UTC
125 points
3 comments8 min readLW link

Ap­proaches to Group Singing

jefftk1 Jan 2025 12:50 UTC
12 points
1 comment3 min readLW link
(www.jefftk.com)

Alien­able (not Inalien­able) Right to Buy

FlorianH1 Jan 2025 12:19 UTC
9 points
6 comments4 min readLW link

AGI is what gen­er­ates evolu­tion­ar­ily fit and novel information

onur1 Jan 2025 9:22 UTC
1 point
0 comments6 min readLW link
(solmaz.io)

The OODA Loop—Ob­serve, Ori­ent, De­cide, Act

Davis_Kingsley1 Jan 2025 8:00 UTC
55 points
2 comments11 min readLW link

Com­ment on “Death and the Gor­gon”

Zack_M_Davis1 Jan 2025 5:47 UTC
106 points
35 comments8 min readLW link

Fire­place and Can­dle Smoke

jefftk1 Jan 2025 1:50 UTC
36 points
4 comments1 min readLW link
(www.jefftk.com)

Merry Science­mas: A Rat Sols­tice Retrospective

leebriskCyrano1 Jan 2025 1:08 UTC
−8 points
0 comments1 min readLW link
(leebriskcyrano.com)