Panology

JenniferRMDec 23, 2024, 9:40 PM
17 points
10 comments5 min readLW link

Aris­to­tle, Aquinas, and the Evolu­tion of Tele­ol­ogy: From Pur­pose to Mean­ing.

Spiritus DeiDec 23, 2024, 7:37 PM
−9 points
0 comments6 min readLW link

Peo­ple aren’t prop­erly cal­ibrated on FrontierMath

cakubiloDec 23, 2024, 7:35 PM
31 points
4 comments3 min readLW link

Near- and medium-term AI Con­trol Safety Cases

Martín SotoDec 23, 2024, 5:37 PM
9 points
0 comments6 min readLW link

[Ra­tion­al­ity Malaysia] 2024 year-end meetup!

Doris LiewDec 23, 2024, 4:02 PM
1 point
0 comments1 min readLW link

Printable book of some ra­tio­nal­ist cre­ative writ­ing (from Scott A. & Eliezer)

CounterBlunderDec 23, 2024, 3:44 PM
10 points
0 comments1 min readLW link

Monthly Roundup #25: De­cem­ber 2024

ZviDec 23, 2024, 2:20 PM
18 points
3 comments26 min readLW link
(thezvi.wordpress.com)

Ex­plor­ing the pe­ter­todd /​ Leilan du­al­ity in GPT-2 and GPT-J

mwatkinsDec 23, 2024, 1:17 PM
12 points
1 comment17 min readLW link

[Question] What are the strongest ar­gu­ments for very short timelines?

Kaj_SotalaDec 23, 2024, 9:38 AM
101 points
79 comments1 min readLW link

Re­duce AI Self-Alle­giance by say­ing “he” in­stead of “I”

Knight LeeDec 23, 2024, 9:32 AM
10 points
4 comments2 min readLW link

Fund­ing Case: AI Safety Camp 11

Dec 23, 2024, 8:51 AM
60 points
4 comments6 min readLW link
(manifund.org)

What is com­pute gov­er­nance?

Dec 23, 2024, 6:32 AM
6 points
0 comments2 min readLW link
(aisafety.info)

Stop Mak­ing Sense

JenniferRMDec 23, 2024, 5:16 AM
16 points
0 comments3 min readLW link

Hire (or Be­come) a Think­ing Assistant

RaemonDec 23, 2024, 3:58 AM
138 points
49 comments8 min readLW link

Non-Ob­vi­ous Benefits of Insurance

jefftkDec 23, 2024, 3:40 AM
21 points
5 comments2 min readLW link
(www.jefftk.com)

Vi­sion of a pos­i­tive Singularity

RussellThorDec 23, 2024, 2:19 AM
4 points
0 comments4 min readLW link

Ide­olo­gies are slow and nec­es­sary, for now

Gabriel AlfourDec 23, 2024, 1:57 AM
15 points
1 comment1 min readLW link
(cognition.cafe)

[Question] Has An­thropic checked if Claude fakes al­ign­ment for in­tended val­ues too?

MaloewDec 23, 2024, 12:43 AM
4 points
1 comment1 min readLW link

Ve­gans need to eat just enough Meat—em­per­i­cally eval­u­ate the min­i­mum am­mount of meat that max­i­mizes utility

Johannes C. MayerDec 22, 2024, 10:08 PM
55 points
35 comments3 min readLW link

We are in a New Paradigm of AI Progress—OpenAI’s o3 model makes huge gains on the tough­est AI bench­marks in the world

garrisonDec 22, 2024, 9:45 PM
17 points
3 commentsLW link
(garrisonlovely.substack.com)

My AI timelines

samuelshadrachDec 22, 2024, 9:06 PM
12 points
2 comments5 min readLW link
(samuelshadrach.com)

A break­down of AI ca­pa­bil­ity lev­els fo­cused on AI R&D la­bor acceleration

ryan_greenblattDec 22, 2024, 8:56 PM
104 points
6 comments6 min readLW link

How I saved 1 hu­man life (in ex­pec­ta­tion) with­out over­think­ing it

Christopher KingDec 22, 2024, 8:53 PM
19 points
0 comments4 min readLW link

Check­ing in on Scott’s com­po­si­tion image bet with ima­gen 3

Dave OrrDec 22, 2024, 7:04 PM
65 points
0 comments1 min readLW link

Woloch & Wosatan

JackOfAllTradesDec 22, 2024, 3:46 PM
−11 points
0 comments2 min readLW link

A primer on ma­chine learn­ing in cryo-elec­tron microscopy (cryo-EM)

Abhishaike MahajanDec 22, 2024, 3:11 PM
18 points
0 comments25 min readLW link
(www.owlposting.com)

Notes from Copen­hagen Sec­u­lar Sols­tice 2024

Søren ElverlinDec 22, 2024, 3:08 PM
9 points
0 comments3 min readLW link

Proof Ex­plained for “Ro­bust Agents Learn Causal World Model”

DalcyDec 22, 2024, 3:06 PM
25 points
0 comments15 min readLW link

sub­func­tional over­laps in at­ten­tional se­lec­tion his­tory im­plies mo­men­tum for de­ci­sion-trajectories

EmrikDec 22, 2024, 2:12 PM
19 points
1 comment2 min readLW link

It looks like there are some good fund­ing op­por­tu­ni­ties in AI safety right now

Benjamin_ToddDec 22, 2024, 12:41 PM
20 points
0 comments4 min readLW link
(benjamintodd.substack.com)

What o3 Be­comes by 2028

Vladimir_NesovDec 22, 2024, 12:37 PM
147 points
15 comments5 min readLW link

The Align­ment Simulator

Yair HalberstadtDec 22, 2024, 11:45 AM
28 points
3 comments2 min readLW link
(yairhalberstadt.github.io)

The­o­ret­i­cal Align­ment’s Se­cond Chance

lunatic_at_largeDec 22, 2024, 5:03 AM
27 points
3 comments2 min readLW link

Ori­ent­ing to 3 year AGI timelines

Nikola JurkovicDec 22, 2024, 1:15 AM
281 points
51 comments8 min readLW link

ARC-AGI is a gen­uine AGI test but o3 cheated :(

Knight LeeDec 22, 2024, 12:58 AM
3 points
6 comments2 min readLW link

When AI 10x’s AI R&D, What Do We Do?

Logan RiggsDec 21, 2024, 11:56 PM
72 points
16 comments4 min readLW link

AI as sys­tems, not just models

Andy ArditiDec 21, 2024, 11:19 PM
28 points
0 comments7 min readLW link
(andyrdt.com)

Towards a Unified In­ter­pretabil­ity of Ar­tifi­cial and Biolog­i­cal Neu­ral Networks

jan_bauerDec 21, 2024, 11:10 PM
2 points
0 comments1 min readLW link

Rob­bin’s Farm Sled­ding Route

jefftkDec 21, 2024, 10:10 PM
13 points
1 comment1 min readLW link
(www.jefftk.com)

AGI with RL is Bad News for Safety

Nadav BrandesDec 21, 2024, 7:36 PM
19 points
22 comments2 min readLW link

Bet­ter differ­ence-mak­ing views

MichaelStJulesDec 21, 2024, 6:27 PM
7 points
0 commentsLW link

Re­view: Good Strat­egy, Bad Strategy

L Rudolf LDec 21, 2024, 5:17 PM
43 points
0 comments23 min readLW link
(nosetgauge.substack.com)

Last Line of Defense: Min­i­mum Vi­able Shelters for Mir­ror Bacteria

Ulrik HornDec 21, 2024, 8:28 AM
12 points
26 comments21 min readLW link

Elon Musk and So­lar Futurism

transhumanist_atom_understanderDec 21, 2024, 2:55 AM
32 points
27 comments5 min readLW link

Good Rea­sons for Alts

jefftkDec 21, 2024, 1:30 AM
24 points
2 comments1 min readLW link
(www.jefftk.com)

Up­dat­ing on Bad Arguments

GuiveDec 21, 2024, 1:19 AM
11 points
2 comments2 min readLW link
(guive.substack.com)

Bird’s eye view: An in­ter­ac­tive rep­re­sen­ta­tion to see large col­lec­tion of text “from above”.

Alexandre VariengienDec 21, 2024, 12:15 AM
10 points
4 comments5 min readLW link
(alexandrevariengien.com)

The nihilism of NeurIPS

charlieoneillDec 20, 2024, 11:58 PM
107 points
6 comments4 min readLW link

Fore­cast 2025 With Vox’s Fu­ture Perfect Team — $2,500 Prize Pool

ChristianWilliamsDec 20, 2024, 11:00 PM
19 points
0 commentsLW link
(www.metaculus.com)

[Question] How do we quan­tify non-philan­thropic con­tri­bu­tions from Buffet and Soros?

PhilosophistryDec 20, 2024, 10:50 PM
3 points
0 comments1 min readLW link