Why We Need More Shovel-Ready AI Notkil­lev­ery­oneism Me­gapro­ject Proposals

Peter BerggrenJan 20, 2025, 10:38 PM
36 points
1 comment6 min readLW link

Tips and Code for Em­piri­cal Re­search Workflows

Jan 20, 2025, 10:31 PM
94 points
14 comments20 min readLW link

Lec­ture Series on Tiling Agents #2

abramdemskiJan 20, 2025, 9:02 PM
16 points
0 comments1 min readLW link

An­nounce­ment: Learn­ing The­ory On­line Course

Jan 20, 2025, 7:55 PM
63 points
33 comments4 min readLW link

The Hid­den Sta­tus Game in Hospi­tal Slacking

EpistemicExplorerJan 20, 2025, 6:35 PM
2 points
4 comments3 min readLW link

Monthly Roundup #26: Jan­uary 2025

ZviJan 20, 2025, 3:30 PM
34 points
15 comments43 min readLW link
(thezvi.wordpress.com)

Things I have been us­ing LLMs for

Kaj_SotalaJan 20, 2025, 2:20 PM
48 points
6 comments7 min readLW link
(kajsotala.fi)

[Question] What are the chances that Su­per­hu­man Agents are already be­ing tested on the in­ter­net?

artemiumJan 20, 2025, 11:09 AM
3 points
1 comment1 min readLW link

Detroit Lions—over con­fi­dence is over rated?

HznJan 20, 2025, 10:53 AM
6 points
0 comments1 min readLW link

Log­its, log-odds, and loss for par­allel circuits

Dmitry VaintrobJan 20, 2025, 9:56 AM
57 points
4 comments11 min readLW link

Wor­ries about la­tent rea­son­ing in LLMs

Caleb BiddulphJan 20, 2025, 9:09 AM
45 points
6 comments7 min readLW link

SIGMI Cer­tifi­ca­tion Criteria

a littoral wizardJan 20, 2025, 2:41 AM
6 points
0 comments1 min readLW link

AXRP Epi­sode 38.5 - Adrià Gar­riga-Alonso on De­tect­ing AI Scheming

DanielFilanJan 20, 2025, 12:40 AM
9 points
0 comments16 min readLW link

The Mon­ster in Our Heads

testingthewatersJan 19, 2025, 11:58 PM
33 points
4 comments5 min readLW link

AI: How We Got Here—A Neu­ro­science Perspective

Mordechai RorvigJan 19, 2025, 11:51 PM
5 points
0 comments2 min readLW link
(www.kickstarter.com)

Agent Foun­da­tions 2025 at CMU

Jan 19, 2025, 11:48 PM
90 points
10 comments1 min readLW link

Who is mar­ket­ing AI al­ign­ment?

MrThinkJan 19, 2025, 9:37 PM
23 points
4 comments1 min readLW link

Some les­sons from the OpenAI-Fron­tierMath debacle

7vikJan 19, 2025, 9:09 PM
69 points
9 comments4 min readLW link

Max­i­mally Eggy Crepes

jefftkJan 19, 2025, 8:40 PM
12 points
0 comments1 min readLW link
(www.jefftk.com)

The sec­ond bit­ter les­son — there’s a fun­da­men­tal prob­lem with al­ign­ing dis­tributed AI

aelwoodJan 19, 2025, 7:00 PM
−5 points
0 comments5 min readLW link
(pursuingreality.substack.com)

The Gen­tle Romance

Richard_NgoJan 19, 2025, 6:29 PM
242 points
46 comments15 min readLW link
(www.asimov.press)

Is the­ory good or bad for AI safety?

Dmitry VaintrobJan 19, 2025, 10:32 AM
27 points
1 comment5 min readLW link

[Question] What’s the Right Way to think about In­for­ma­tion The­o­retic quan­tities in Neu­ral Net­works?

DalcyJan 19, 2025, 8:04 AM
45 points
13 comments3 min readLW link

Per Trib­al­is­mum ad Astra

Martin SustrikJan 19, 2025, 6:50 AM
30 points
5 comments2 min readLW link
(250bpm.substack.com)

Five Re­cent AI Tu­tor­ing Studies

Arjun PanicksseryJan 19, 2025, 3:53 AM
93 points
0 comments2 min readLW link
(arjunpanickssery.substack.com)

Shut Up and Calcu­late: Gam­bling, Div­ina­tion, and the Aba­cus as Tantra

leebriskCyranoJan 19, 2025, 3:03 AM
−1 points
0 comments5 min readLW link
(leebriskcyrano.com)

Does So­ciety need a cul­tural out­let in tur­bu­lent poli­ti­cal times?

Freya McneillJan 19, 2025, 2:45 AM
−3 points
0 comments7 min readLW link

On Thiel’s New Amer­i­can Regime

shawkisukkarJan 19, 2025, 2:45 AM
−3 points
0 comments5 min readLW link
(shawkisukkar.substack.com)

be the per­son that makes the meet­ing productive

OldmanrahulJan 18, 2025, 10:32 PM
9 points
0 comments1 min readLW link

Beards and Masks?

jefftkJan 18, 2025, 4:00 PM
72 points
5 comments4 min readLW link
(www.jefftk.com)

[Question] How likely is AGI to force us all to be happy for­ever? (much like in the Three Wor­lds Col­lide novel)

uhbif19Jan 18, 2025, 3:39 PM
9 points
5 comments1 min readLW link

Well-be­ing in the mind, and its im­pli­ca­tions for utilitarianism

SjlverJan 18, 2025, 3:32 PM
6 points
2 comments2 min readLW link

[Ex­er­cise] Four Ex­am­ples of Notic­ing Confusion

Logan RiggsJan 18, 2025, 3:29 PM
8 points
8 comments3 min readLW link

Scal­ing Wargam­ing for Global Catas­trophic Risks with AI

Jan 18, 2025, 3:10 PM
38 points
2 comments4 min readLW link
(blog.sentinel-team.org)

Align­ment ideas

qbolecJan 18, 2025, 12:43 PM
11 points
1 comment8 min readLW link

AI-en­abled Cloud Gaming

samuelshadrachJan 18, 2025, 11:56 AM
1 point
0 comments3 min readLW link
(samuelshadrach.com)

Don’t ig­nore bad vibes you get from people

Kaj_SotalaJan 18, 2025, 9:20 AM
154 points
50 comments2 min readLW link
(kajsotala.fi)

Renor­mal­iza­tion Re­dux: QFT Tech­niques for AI Interpretability

Jan 18, 2025, 3:54 AM
44 points
12 comments7 min readLW link

[Question] What’s Wrong With the Si­mu­la­tion Ar­gu­ment?

AynonymousPrsn123Jan 18, 2025, 2:32 AM
6 points
49 comments1 min readLW link

Your AI Safety fo­cus is down­stream of your AGI timeline

Michael FloodJan 17, 2025, 9:24 PM
9 points
0 comments4 min readLW link

Thoughts on the con­ser­va­tive as­sump­tions in AI control

BuckJan 17, 2025, 7:23 PM
91 points
5 comments13 min readLW link

Ti­maeus is hiring re­searchers & engineers

Jan 17, 2025, 7:13 PM
65 points
4 comments4 min readLW link

Model Amnesty Project

themisJan 17, 2025, 6:53 PM
3 points
2 comments3 min readLW link

Ad­dress­ing doubts of AI progress: Why GPT-5 is not late, and why data scarcity isn’t a fun­da­men­tal limiter near term.

LDJJan 17, 2025, 6:53 PM
2 points
0 comments2 min readLW link

Play­ing Dixit with AI: How Well LLMs De­tect ‘Me-ness’

Mariia KoroliukJan 17, 2025, 6:52 PM
5 points
0 comments2 min readLW link

Do­ing a self-ran­dom­ized study of the im­pacts of glycine on sleep (Science is hard)

thedissonance.netJan 17, 2025, 6:49 PM
11 points
5 comments11 min readLW link

How sci-fi can have drama with­out dystopia or doomerism

jasoncrawfordJan 17, 2025, 3:22 PM
19 points
3 comments3 min readLW link
(newsletter.rootsofprogress.org)

[Question] What do you mean with ‘al­ign­ment is solv­able in prin­ci­ple’?

Remmelt17 Jan 2025 15:03 UTC
3 points
9 comments1 min readLW link

Meta Pivots on Con­tent Moderation

Zvi17 Jan 2025 14:20 UTC
47 points
3 comments10 min readLW link
(thezvi.wordpress.com)

Tax Price Goug­ing?

jefftk17 Jan 2025 14:10 UTC
55 points
22 comments3 min readLW link
(www.jefftk.com)