Lec­ture Series on Tiling Agents

abramdemskiJan 14, 2025, 9:34 PM
38 points
14 comments1 min readLW link

Is AI Phys­i­cal?

Lauren GreenspanJan 14, 2025, 9:21 PM
23 points
6 comments7 min readLW link

Her­i­ta­bil­ity: Five Battles

Steven ByrnesJan 14, 2025, 6:21 PM
88 points
23 comments60 min readLW link

The Philo­soph­i­cal Glos­sary of AI

David GrossJan 14, 2025, 5:36 PM
11 points
0 comments1 min readLW link
(www.aiglossary.co.uk)

I’m offer­ing free math con­sul­ta­tions!

GurkenglasJan 14, 2025, 4:30 PM
80 points
7 comments1 min readLW link

Why aban­don “prob­a­bil­ity is in the mind” when it comes to quan­tum dy­nam­ics?

Maxwell PetersonJan 14, 2025, 3:53 PM
20 points
17 comments1 min readLW link

How do you deal w/​ Su­per Stim­uli?

Logan RiggsJan 14, 2025, 3:14 PM
106 points
25 comments3 min readLW link

curate

technicalitiesJan 14, 2025, 2:40 PM
12 points
0 comments2 min readLW link

Our new video about goal mis­gen­er­al­iza­tion, plus an apology

WriterJan 14, 2025, 2:07 PM
33 points
0 comments7 min readLW link
(youtu.be)

NYC Conges­tion Pric­ing: Early Days

ZviJan 14, 2025, 2:00 PM
29 points
0 comments15 min readLW link
(thezvi.wordpress.com)

Do hu­mans re­ally learn from “lit­tle” data?

Alice WanderlandJan 14, 2025, 10:46 AM
14 points
5 comments1 min readLW link
(aliceandbobinwanderland.substack.com)

Ba­sics of Bayesian learning

Dmitry VaintrobJan 14, 2025, 10:00 AM
12 points
0 comments13 min readLW link

[Question] Why do fu­tur­ists care about the cul­ture war?

Knight LeeJan 14, 2025, 7:35 AM
22 points
22 comments2 min readLW link

Don’t Le­gal­ize Drugs

Declan MolonyJan 14, 2025, 6:51 AM
36 points
9 comments9 min readLW link

Mini Go: Gate­way Game

jefftkJan 14, 2025, 3:30 AM
32 points
1 comment1 min readLW link
(www.jefftk.com)

Find­ing Fea­tures Causally Up­stream of Refusal

Jan 14, 2025, 2:30 AM
54 points
5 comments12 min readLW link

Im­pli­ca­tions of the in­fer­ence scal­ing paradigm for AI safety

Ryan KiddJan 14, 2025, 2:14 AM
93 points
70 comments5 min readLW link

Bi­den ad­minis­tra­tion un­veils global AI ex­port con­trols aimed at China

Chris_LeongJan 14, 2025, 1:01 AM
9 points
0 comments1 min readLW link
(www.axios.com)

My lat­est at­tempt to un­der­stand de­ci­sion the­ory: I asked ChatGPT to de­bate me.

bokovJan 13, 2025, 7:37 PM
−8 points
5 comments19 min readLW link

AI mod­els in­her­ently al­ter “hu­man val­ues.” So, al­ign­ment-based AI safety ap­proaches must bet­ter ac­count for value drift

bfitzgerald3132Jan 13, 2025, 7:22 PM
5 points
2 comments13 min readLW link

Chance is in the Map, not the Territory

Jan 13, 2025, 7:17 PM
67 points
18 comments7 min readLW link

Progress links and short notes, 2025-01-13

jasoncrawfordJan 13, 2025, 6:35 PM
13 points
2 comments3 min readLW link
(newsletter.rootsofprogress.org)

Bet­ter an­ti­bod­ies by en­g­ineer­ing tar­gets, not en­g­ineer­ing an­ti­bod­ies (Nabla Bio)

Abhishaike MahajanJan 13, 2025, 3:05 PM
4 points
0 comments14 min readLW link
(www.owlposting.com)

Zvi’s 2024 In Movies

ZviJan 13, 2025, 1:40 PM
44 points
4 comments15 min readLW link
(thezvi.wordpress.com)

Paper club: He et al. on mod­u­lar ar­ith­metic (part I)

Dmitry VaintrobJan 13, 2025, 11:18 AM
14 points
0 comments8 min readLW link

Cast it into the fire! De­stroy it!

Aram PanasencoJan 13, 2025, 7:30 AM
6 points
9 comments2 min readLW link

Moder­ately More Than You Wanted To Know: De­pres­sive Realism

JustisMillsJan 13, 2025, 2:57 AM
73 points
4 comments6 min readLW link
(justismills.substack.com)

Ap­ply­ing tra­di­tional eco­nomic think­ing to AGI: a trilemma

Steven ByrnesJan 13, 2025, 1:23 AM
144 points
32 comments3 min readLW link

Build­ing AI Re­search Fleets

Jan 12, 2025, 6:23 PM
130 points
11 comments5 min readLW link

Do An­tide­pres­sants work? (First Take)

Jacob GoldsmithJan 12, 2025, 5:11 PM
7 points
9 comments7 min readLW link

A Novel Idea for Har­ness­ing Mag­netic Re­con­nec­tion as an En­ergy Source

resonovaJan 12, 2025, 5:11 PM
0 points
8 comments3 min readLW link

How quickly could robots scale up?

Benjamin_ToddJan 12, 2025, 5:01 PM
47 points
25 comments1 min readLW link
(benjamintodd.substack.com)

AGI Will Not Make La­bor Worthless

Maxwell TabarrokJan 12, 2025, 3:09 PM
−7 points
16 comments5 min readLW link
(www.maximum-progress.com)

The pur­pose­ful drunkard

Dmitry VaintrobJan 12, 2025, 12:27 PM
98 points
13 comments6 min readLW link

No one has the ball on 1500 Rus­sian olympiad win­ners who’ve re­ceived HPMOR

Mikhail SaminJan 12, 2025, 11:43 AM
80 points
21 comments1 min readLW link

Why mod­el­ling multi-ob­jec­tive home­osta­sis is es­sen­tial for AI al­ign­ment (and how it helps with AI safety as well)

Roland PihlakasJan 12, 2025, 3:37 AM
46 points
7 comments10 min readLW link

Ex­tend­ing con­trol eval­u­a­tions to non-schem­ing threats

joshcJan 12, 2025, 1:42 AM
30 points
1 comment12 min readLW link

Rol­ling Thresh­olds for AGI Scal­ing Regulation

LarksJan 12, 2025, 1:30 AM
40 points
6 commentsLW link

AI Safety at the Fron­tier: Paper High­lights, De­cem­ber ’24

gasteigerjoJan 11, 2025, 10:54 PM
7 points
2 comments7 min readLW link
(aisafetyfrontier.substack.com)

Fluori­da­tion: The RCT We Still Haven’t Run (But Should)

ChristianKlJan 11, 2025, 9:02 PM
22 points
5 comments2 min readLW link

In Defense of a But­le­rian Jihad

sloonzJan 11, 2025, 7:30 PM
10 points
25 comments9 min readLW link

Near term dis­cus­sions need some­thing smaller and more con­crete than AGI

ryan_bJan 11, 2025, 6:24 PM
13 points
0 comments6 min readLW link

A pro­posal for iter­ated in­ter­pretabil­ity with known-in­ter­pretable nar­row AIs

Peter BerggrenJan 11, 2025, 2:43 PM
6 points
0 comments2 min readLW link

Have fron­tier AI sys­tems sur­passed the self-repli­cat­ing red line?

nsageJan 11, 2025, 5:31 AM
4 points
0 comments4 min readLW link

We need a uni­ver­sal defi­ni­tion of ‘agency’ and re­lated words

CstineSublimeJan 11, 2025, 3:22 AM
18 points
1 comment5 min readLW link

[Question] AI for med­i­cal care for hard-to-treat dis­eases?

CronoDASJan 10, 2025, 11:55 PM
12 points
1 comment1 min readLW link

Beliefs and state of mind into 2025

RussellThorJan 10, 2025, 10:07 PM
18 points
9 comments7 min readLW link

Recom­men­da­tions for Tech­ni­cal AI Safety Re­search Directions

Sam MarksJan 10, 2025, 7:34 PM
64 points
1 comment17 min readLW link
(alignment.anthropic.com)

Is AI Align­ment Enough?

Aram PanasencoJan 10, 2025, 6:57 PM
28 points
6 comments6 min readLW link

[Question] What are some sce­nar­ios where an al­igned AGI ac­tu­ally helps hu­man­ity, but many/​most peo­ple don’t like it?

RomanSJan 10, 2025, 6:13 PM
13 points
6 comments3 min readLW link