We Need Ma­jor, But Not Rad­i­cal, FDA Reform

Maxwell TabarrokFeb 24, 2024, 4:54 PM
42 points
12 comments7 min readLW link
(www.maximum-progress.com)

Re­quire­ments for a Basin of At­trac­tion to Alignment

RogerDearnaleyFeb 14, 2024, 7:10 AM
41 points
12 comments31 min readLW link

The Poin­ter Re­s­olu­tion Problem

JozdienFeb 16, 2024, 9:25 PM
41 points
20 comments3 min readLW link

China-AI forecasts

NathanBarnardFeb 25, 2024, 4:49 PM
40 points
29 comments6 min readLW link

Tech­nolo­gies and Ter­minol­ogy: AI isn’t Soft­ware, it’s… Deep­ware?

Feb 13, 2024, 1:37 PM
40 points
10 comments8 min readLW link

Nat­u­ral ab­strac­tions are ob­server-de­pen­dent: a con­ver­sa­tion with John Wentworth

Martín SotoFeb 12, 2024, 5:28 PM
40 points
13 comments7 min readLW link

Choos­ing My Quest (Part 2 of “The Sense Of Phys­i­cal Ne­ces­sity”)

LoganStrohlFeb 24, 2024, 9:31 PM
40 points
7 comments12 min readLW link

“What if we could re­design so­ciety from scratch? The promise of char­ter cities.” [Ra­tional An­i­ma­tions video]

Jackson WagnerFeb 18, 2024, 12:57 AM
40 points
7 comments8 min readLW link
(www.youtube.com)

In­stru­men­tal de­cep­tion and ma­nipu­la­tion in LLMs—a case study

Olli JärviniemiFeb 24, 2024, 2:07 AM
39 points
13 comments12 min readLW link

Nitric ox­ide for covid and other viral infections

ElizabethFeb 7, 2024, 9:30 PM
39 points
6 comments6 min readLW link
(acesounderglass.com)

Lan­guage Models Don’t Learn the Phys­i­cal Man­i­fes­ta­tion of Language

Feb 22, 2024, 6:52 PM
39 points
23 comments1 min readLW link
(arxiv.org)

Tort Law Can Play an Im­por­tant Role in Miti­gat­ing AI Risk

Gabriel WeilFeb 12, 2024, 5:17 PM
39 points
9 comments5 min readLW link

The “con­text win­dow” anal­ogy for hu­man minds

RubyFeb 13, 2024, 7:29 PM
38 points
0 comments2 min readLW link

De­con­fus­ing In-Con­text Learning

Arjun PanicksseryFeb 25, 2024, 9:48 AM
37 points
1 comment2 min readLW link

AI #49: Bioweapon Test­ing Begins

ZviFeb 1, 2024, 3:30 PM
37 points
11 comments42 min readLW link
(thezvi.wordpress.com)

Drone Wars Endgame

RussellThorFeb 1, 2024, 2:30 AM
36 points
71 comments8 min readLW link

On Dwarkesh’s 3rd Pod­cast With Tyler Cowen

ZviFeb 2, 2024, 7:30 PM
36 points
9 comments21 min readLW link
(thezvi.wordpress.com)

A sketch of acausal trade in practice

Richard_NgoFeb 4, 2024, 12:32 AM
36 points
4 comments7 min readLW link

One True Love

ZviFeb 9, 2024, 3:10 PM
34 points
7 comments10 min readLW link
(thezvi.wordpress.com)

Difficulty classes for al­ign­ment properties

JozdienFeb 20, 2024, 9:08 AM
34 points
5 comments2 min readLW link

More on the Ap­ple Vi­sion Pro

ZviFeb 13, 2024, 5:40 PM
33 points
5 comments8 min readLW link
(thezvi.wordpress.com)

I played the AI box game as the Gate­keeper — and lost

datawitchFeb 12, 2024, 6:39 PM
33 points
54 comments4 min readLW link

FTX ex­pects to re­turn all cus­tomer money; claw­backs may go away

Mikhail SaminFeb 14, 2024, 3:43 AM
33 points
1 comment1 min readLW link
(www.nytimes.com)

Why you, per­son­ally, should want a larger hu­man population

jasoncrawfordFeb 23, 2024, 7:48 PM
32 points
32 comments5 min readLW link
(rootsofprogress.org)

The By­ronic Hero Always Loses

Cole WyethFeb 22, 2024, 1:31 AM
32 points
4 comments2 min readLW link

How I build and run be­hav­ioral interviews

benkuhnFeb 26, 2024, 5:50 AM
32 points
6 comments4 min readLW link
(www.benkuhn.net)

On Not Re­quiring Vaccination

jefftkFeb 1, 2024, 7:20 PM
31 points
21 comments1 min readLW link
(www.jefftk.com)

Map­ping the se­man­tic void II: Above, be­low and be­tween to­ken em­bed­dings

mwatkinsFeb 15, 2024, 11:00 PM
31 points
4 comments10 min readLW link

Ret­ro­spec­tive: PIBBSS Fel­low­ship 2023

Feb 16, 2024, 5:48 PM
31 points
1 comment8 min readLW link

Put­ting mul­ti­modal LLMs to the Tetris test

Feb 1, 2024, 4:02 PM
30 points
5 comments7 min readLW link

The Third Gemini

ZviFeb 20, 2024, 7:50 PM
30 points
2 comments9 min readLW link
(thezvi.wordpress.com)

In­ter­pret­ing Quan­tum Me­chan­ics in In­fra-Bayesian Physicalism

YegregFeb 12, 2024, 6:56 PM
30 points
6 comments43 min readLW link

Run­ning the Num­bers on a Heat Pump

jefftkFeb 9, 2024, 3:00 AM
30 points
12 comments4 min readLW link
(www.jefftk.com)

Abs-E (or, speak only in the pos­i­tive)

dkl9Feb 19, 2024, 9:14 PM
29 points
24 comments2 min readLW link
(dkl9.net)

[Question] Weigh­ing rep­u­ta­tional and moral con­se­quences of leav­ing Rus­sia or staying

spzaFeb 18, 2024, 7:36 PM
29 points
24 comments1 min readLW link

Au­dit­ing LMs with coun­ter­fac­tual search: a tool for con­trol and ELK

Jacob PfauFeb 20, 2024, 12:02 AM
28 points
6 comments10 min readLW link

flow­ing like wa­ter; hard like stone

Feb 20, 2024, 3:20 AM
27 points
4 comments4 min readLW link

The econ­omy is mostly newbs (strat pre­dic­tions)

lemonhopeFeb 1, 2024, 7:15 PM
27 points
6 comments2 min readLW link

Solv­ing al­ign­ment isn’t enough for a flour­ish­ing future

micFeb 2, 2024, 6:23 PM
27 points
0 comments22 min readLW link
(papers.ssrn.com)

Weak vs Quan­ti­ta­tive Ex­tinc­tion-level Good­hart’s Law

Feb 21, 2024, 5:38 PM
27 points
1 comment2 min readLW link

A Strange ACH Corner Case

jefftkFeb 10, 2024, 3:00 AM
27 points
2 comments2 min readLW link
(www.jefftk.com)

Meetup In a Box: Year In Review

CzynskiFeb 14, 2024, 1:18 AM
26 points
1 comment4 min readLW link

Man­i­fold Markets

PeterMcCluskeyFeb 2, 2024, 5:48 PM
26 points
9 comments4 min readLW link
(bayesianinvestor.com)

Ar­ro­gance and Peo­ple Pleasing

Jonathan MoregårdFeb 6, 2024, 6:43 PM
26 points
7 comments4 min readLW link
(honestliving.substack.com)

Use­ful start­ing code for interpretability

eggsyntaxFeb 13, 2024, 11:13 PM
26 points
2 comments1 min readLW link

Why I think it’s net harm­ful to do tech­ni­cal safety re­search at AGI labs

RemmeltFeb 7, 2024, 4:17 AM
26 points
24 comments1 min readLW link

Causal­ity is Everywhere

silentbobFeb 13, 2024, 1:44 PM
26 points
12 comments8 min readLW link

Eval­u­at­ing Solar

jefftkFeb 17, 2024, 9:50 PM
26 points
5 comments2 min readLW link
(www.jefftk.com)

The nat­u­ral bound­aries be­tween people

Chris LakinFeb 23, 2024, 1:09 AM
26 points
2 comments8 min readLW link
(chipmonk.substack.com)

The Math of Sus­pi­cious Coincidences

RokoFeb 7, 2024, 1:32 PM
25 points
3 comments4 min readLW link