Far­mKind’s Illu­sory Offer

jefftkAug 9, 2024, 11:30 AM
71 points
5 comments3 min readLW link
(www.jefftk.com)

Please do not use AI to write for you

Richard_KennawayAug 21, 2024, 9:53 AM
69 points
34 comments4 min readLW link

What is it to solve the al­ign­ment prob­lem? (Notes)

Joe CarlsmithAug 24, 2024, 9:19 PM
69 points
18 comments53 min readLW link

The Hes­sian rank bounds the learn­ing coefficient

Lucius BushnaqAug 8, 2024, 8:55 PM
68 points
10 comments4 min readLW link

Show­ing SAE La­tents Are Not Atomic Us­ing Meta-SAEs

Aug 24, 2024, 12:56 AM
68 points
10 comments20 min readLW link

GPT-4o Sys­tem Card

Zach Stein-PerlmanAug 8, 2024, 8:30 PM
68 points
11 comments2 min readLW link
(openai.com)

AI #79: Ready for Some Football

ZviAug 29, 2024, 1:30 PM
68 points
16 comments32 min readLW link
(thezvi.wordpress.com)

Why Large Bureau­cratic Or­ga­ni­za­tions?

johnswentworthAug 27, 2024, 6:30 PM
68 points
52 comments12 min readLW link

The eco­nomics of space tethers

harsimonyAug 22, 2024, 4:15 PM
67 points
22 comments7 min readLW link
(splittinginfinity.substack.com)

Fear of cen­tral­ized power vs. fear of mis­al­igned AGI: Vi­talik Bu­terin on 80,000 Hours

Seth HerdAug 5, 2024, 3:38 PM
66 points
22 comments5 min readLW link

A primer on why com­pu­ta­tional pre­dic­tive tox­i­col­ogy is hard

Abhishaike MahajanAug 19, 2024, 5:16 PM
63 points
2 comments12 min readLW link
(www.owlposting.com)

In­ter­dic­tor Ship

lsusrAug 19, 2024, 4:59 AM
63 points
9 comments7 min readLW link

Ou­trage Bonding

Jonathan MoregårdAug 9, 2024, 1:46 PM
63 points
12 comments2 min readLW link
(honestliving.substack.com)

Ra­tion­al­ists are miss­ing a core piece for agent-like struc­ture (en­ergy vs in­for­ma­tion over­load)

tailcalledAug 17, 2024, 9:57 AM
62 points
9 comments4 min readLW link

AI #78: Some Wel­come Calm

ZviAug 22, 2024, 2:20 PM
61 points
15 comments33 min readLW link
(thezvi.wordpress.com)

Self-ex­plain­ing SAE features

Aug 5, 2024, 10:20 PM
60 points
13 comments10 min readLW link

… Wait, our mod­els of se­man­tics should in­form fluid me­chan­ics?!?

Aug 26, 2024, 4:38 PM
59 points
18 comments4 min readLW link

An­nounc­ing the $200k EA Com­mu­nity Choice

Austin ChenAug 14, 2024, 12:39 AM
58 points
8 commentsLW link
(manifund.substack.com)

Con­gres­sional In­sider Trading

Maxwell TabarrokAug 30, 2024, 1:32 PM
57 points
6 comments7 min readLW link
(www.maximum-progress.com)

You’re a Space Wizard, Luke

lsusrAug 18, 2024, 5:35 AM
57 points
6 comments2 min readLW link

Referen­dum Me­chan­ics in a Mar­ket­place of Ideas

Martin SustrikAug 25, 2024, 8:30 AM
57 points
2 comments5 min readLW link
(250bpm.substack.com)

The Bit­ter Les­son for AI Safety Research

Aug 2, 2024, 6:39 PM
57 points
5 comments3 min readLW link

Some Unortho­dox Ways To Achieve High GDP Growth

Aug 8, 2024, 6:58 PM
57 points
6 comments6 min readLW link

John Schul­man leaves OpenAI for An­thropic [and then left An­thropic again for Think­ing Machines]

SodiumAug 6, 2024, 1:23 AM
57 points
0 comments1 min readLW link

Mea­sur­ing Struc­ture Devel­op­ment in Al­gorith­mic Transformers

Aug 22, 2024, 8:38 AM
56 points
4 comments11 min readLW link

Thiel on AI & Rac­ing with China

Ben PaceAug 20, 2024, 3:19 AM
55 points
10 comments12 min readLW link

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-PerlmanAug 16, 2024, 12:00 AM
55 points
8 comments3 min readLW link
(www.youtube.com)

Owain Evans on Si­tu­a­tional Aware­ness and Out-of-Con­text Rea­son­ing in LLMs

Michaël TrazziAug 24, 2024, 4:30 AM
55 points
0 comments5 min readLW link

[LDSL#0] Some episte­molog­i­cal conundrums

tailcalledAug 7, 2024, 7:52 PM
54 points
11 comments10 min readLW link

Prov­ably Safe AI: Wor­ld­view and Projects

Aug 9, 2024, 11:21 PM
54 points
44 comments7 min readLW link

Cal­en­dar fea­ture ge­om­e­try in GPT-2 layer 8 resi­d­ual stream SAEs

Aug 17, 2024, 1:16 AM
53 points
0 comments5 min readLW link

Ex­tended In­ter­view with Zhu­keepa on Religion

Aug 18, 2024, 3:19 AM
53 points
61 comments119 min readLW link

AI Rights for Hu­man Safety

Simon GoldsteinAug 1, 2024, 11:01 PM
53 points
6 comments1 min readLW link
(papers.ssrn.com)

AI #76: Six Shorts Sto­ries About OpenAI

ZviAug 8, 2024, 1:50 PM
53 points
10 comments48 min readLW link
(thezvi.wordpress.com)

Rewil­d­ing the Gut VS the Au­toim­mune Epidemic

GGDAug 16, 2024, 6:00 PM
51 points
0 comments3 min readLW link

De­ci­sion The­ory in Space

lsusrAug 18, 2024, 7:02 AM
50 points
18 comments2 min readLW link

In­ter­op­er­a­ble High Level Struc­tures: Early Thoughts on Adjectives

Aug 22, 2024, 9:12 PM
49 points
1 comment7 min readLW link

SRE’s re­view of Democracy

Martin SustrikAug 3, 2024, 7:20 AM
48 points
2 comments3 min readLW link
(250bpm.substack.com)

What’s im­por­tant in “AI for epistemics”?

Lukas FinnvedenAug 24, 2024, 1:27 AM
48 points
0 comments28 min readLW link
(www.forethought.org)

Trust­wor­thy and un­trust­wor­thy models

Olli JärviniemiAug 19, 2024, 4:27 PM
47 points
3 comments8 min readLW link

All The Lat­est Hu­man tFUS Studies

sarahconstantinAug 9, 2024, 10:20 PM
46 points
2 comments8 min readLW link
(sarahconstantin.substack.com)

Hu­man­ity isn’t re­motely longter­mist, so ar­gu­ments for AGI x-risk should fo­cus on the near term

Seth Herd12 Aug 2024 18:10 UTC
46 points
10 comments1 min readLW link

We’re not as 3-Di­men­sional as We Think

silentbob4 Aug 2024 14:39 UTC
46 points
17 comments5 min readLW link

How to hire some­body bet­ter than yourself

lemonhope28 Aug 2024 8:12 UTC
46 points
5 comments5 min readLW link

AI #75: Math is Easier

Zvi1 Aug 2024 13:40 UTC
46 points
25 comments72 min readLW link
(thezvi.wordpress.com)

Prin­ci­pled Satis­fic­ing To Avoid Goodhart

JenniferRM16 Aug 2024 19:05 UTC
45 points
2 comments8 min readLW link

Startup Roundup #2

Zvi6 Aug 2024 13:30 UTC
45 points
0 comments32 min readLW link
(thezvi.wordpress.com)

Case Study: In­ter­pret­ing, Ma­nipu­lat­ing, and Con­trol­ling CLIP With Sparse Autoencoders

Gytis Daujotas1 Aug 2024 21:08 UTC
45 points
7 comments7 min readLW link

[Question] “De­cep­tion Genre” What Books are like Pro­ject Lawful?

Double28 Aug 2024 17:19 UTC
45 points
20 comments1 min readLW link

In defense of tech­nolog­i­cal un­em­ploy­ment as the main AI concern

tailcalled27 Aug 2024 17:58 UTC
44 points
36 comments1 min readLW link