Lec­tures on AI for high school stu­dents (and oth­ers)

Radford Neal3 Jun 2025 23:54 UTC
6 points
0 comments1 min readLW link
(radfordneal.wordpress.com)

Does the Taiwan in­va­sion pre­vent mankind from ob­tain­ing the al­igned ASI?

StanislavKrym3 Jun 2025 23:35 UTC
−14 points
1 comment5 min readLW link

Self-inquiry

Vadim Golub3 Jun 2025 22:15 UTC
−3 points
0 comments5 min readLW link

Ques­tion to LW devs: does LessWrong tries to be face­booky?

Roman Malov3 Jun 2025 22:08 UTC
5 points
1 comment1 min readLW link

Your Strat­egy Roadmap: Ex­pert Tips + Live Training

Deena Englander3 Jun 2025 21:10 UTC
−4 points
0 comments4 min readLW link

Steer­ing Vec­tors Can Help LLM Judges De­tect Sub­tle Dishonesty

3 Jun 2025 20:33 UTC
12 points
1 comment5 min readLW link

Schel­ling Co­or­di­na­tion via Agen­tic Loops

Callum-Luis Kindred3 Jun 2025 20:13 UTC
10 points
1 comment9 min readLW link

Vi­sual Prompt In­jec­tions: Re­sults on test­ing AI spam-defense and AI vuln­er­a­bil­ity to de­cep­tive web ads.

Seon Gunness3 Jun 2025 20:10 UTC
4 points
0 comments12 min readLW link

Broad-Spec­trum Cancer Treatments

sarahconstantin3 Jun 2025 19:40 UTC
150 points
10 comments7 min readLW link
(sarahconstantin.substack.com)

How to work through the ARENA pro­gram on your own

Leon Lang3 Jun 2025 17:38 UTC
38 points
5 comments6 min readLW link

How the veil of ig­no­rance grounds sentientism

HoVY3 Jun 2025 17:29 UTC
−3 points
23 comments6 min readLW link
(forum.effectivealtruism.org)

In Which I Make the Mis­take of Fully Cover­ing an Epi­sode of the All-In Podcast

Zvi3 Jun 2025 15:50 UTC
42 points
2 comments28 min readLW link
(thezvi.wordpress.com)

Trans­former Mo­du­lar Ad­di­tion Through A Sig­nal Pro­cess­ing Lens

Benjamin Kelley3 Jun 2025 15:32 UTC
1 point
0 comments1 min readLW link

AXRP Epi­sode 41 - Lee Sharkey on At­tri­bu­tion-based Pa­ram­e­ter Decomposition

DanielFilan3 Jun 2025 3:40 UTC
28 points
1 comment61 min readLW link

Notes on dy­namism, power, & virtue

Lizka3 Jun 2025 1:40 UTC
19 points
0 comments12 min readLW link

Trends – Ar­tifi­cial Intelligence

Archimedes3 Jun 2025 0:48 UTC
1 point
1 comment1 min readLW link
(www.bondcap.com)

LLMs might have sub­jec­tive ex­pe­riences, but no con­cepts for them

No77e2 Jun 2025 21:18 UTC
17 points
5 comments2 min readLW link

In defense of memes (and thought-ter­mi­nat­ing clichés)

Harjas2 Jun 2025 20:18 UTC
11 points
4 comments10 min readLW link

He­donic adap­ta­tion: you should not seeks pleasure

Crazy philosopher2 Jun 2025 19:23 UTC
0 points
6 comments2 min readLW link

Un­faith­ful Rea­son­ing Can Fool Chain-of-Thought Monitoring

2 Jun 2025 19:08 UTC
78 points
17 comments3 min readLW link

Frank Her­bert’s great in­sight into hu­man agency—Muad’Dib the tool?

Nerret2 Jun 2025 18:52 UTC
2 points
1 comment1 min readLW link

Hem­ing­way Case

Martin Sustrik2 Jun 2025 18:50 UTC
19 points
2 comments1 min readLW link
(www.250bpm.com)

[Question] What AI apps are sur­pris­ingly ab­sent given cur­rent ca­pa­bil­ities?

azergante2 Jun 2025 18:46 UTC
4 points
8 comments1 min readLW link

[Be­neath Psy­chol­ogy] Chronic pain challenge part 2: the solution

jimmy2 Jun 2025 17:30 UTC
39 points
3 comments34 min readLW link

The Value Propo­si­tion of Ro­man­tic Relationships

johnswentworth2 Jun 2025 13:51 UTC
208 points
43 comments13 min readLW link

1. The challenge of un­aware­ness for im­par­tial al­tru­ist ac­tion guidance: Introduction

Anthony DiGiovanni2 Jun 2025 8:54 UTC
48 points
6 comments13 min readLW link

‘Wicked’: thoughts

KatjaGrace2 Jun 2025 6:20 UTC
25 points
3 comments3 min readLW link
(worldspiritsockpuppet.com)

Hu­man­ity needs a Ulysses Pact for AI

Lukas N.P. Egger1 Jun 2025 20:56 UTC
1 point
2 comments1 min readLW link

Text Steers Vision

Woody Gan1 Jun 2025 20:30 UTC
5 points
0 comments7 min readLW link

[Question] Pos­si­ble AI reg­u­la­tion emer­gency?

CronoDAS1 Jun 2025 20:30 UTC
19 points
1 comment1 min readLW link

Eliezer Yud­kowsky & Con­nor Leahy | AI Risk, Safety & Align­ment Q&A [4K Re­mas­ter + HQ Au­dio]

Dex Volkov1 Jun 2025 20:20 UTC
−8 points
2 comments1 min readLW link
(www.youtube.com)

Own­er­ship: the prin­ci­ple of “Deprive first, ask ques­tions later”

MillardJMelnyk1 Jun 2025 20:19 UTC
−27 points
22 comments1 min readLW link

Economists should track the speed and mag­ni­tude of AI im­ple­men­ta­tion projects

ParrotRobot1 Jun 2025 20:15 UTC
3 points
0 comments2 min readLW link

Ingroup

JenniferRM1 Jun 2025 19:47 UTC
−3 points
12 comments1 min readLW link

Ap­ply to the AI Se­cu­rity Boot­camp [Aug 4 - Aug 29]

1 Jun 2025 19:47 UTC
27 points
2 comments4 min readLW link

See­ing how well an agen­tic AI cod­ing tool can do com­pared to me us­ing an ac­tual real-world example

Massimog1 Jun 2025 19:24 UTC
32 points
2 comments1 min readLW link
(blog.massimogauthier.com)

Ni­co­tine ad­dic­tion, cloves, and need­ing to take a shit

eyesack1 Jun 2025 19:13 UTC
4 points
1 comment1 min readLW link

2nd Ger­many-wide ACX/​LW event

Fernand01 Jun 2025 13:56 UTC
1 point
0 comments1 min readLW link

An Opinionated Guide to P-Values

amitlevy491 Jun 2025 11:48 UTC
11 points
0 comments8 min readLW link
(ivy0.substack.com)

Le­gal Per­son­hood for Models: Novelli et. al & Mocanu

Stephen Martin1 Jun 2025 8:18 UTC
2 points
0 comments10 min readLW link

Is Es­ca­la­tion Inevitable?

Lennart Wijers31 May 2025 22:10 UTC
5 points
0 comments3 min readLW link

Policy En­tropy, Learn­ing, and Align­ment (Or Maybe Your LLM Needs Ther­apy)

sdeture31 May 2025 22:09 UTC
15 points
6 comments8 min readLW link

The Unseen Hand: AI’s Prob­lem Preemp­tion and the True Fu­ture of Labor

Ben Kassan31 May 2025 22:04 UTC
8 points
0 comments20 min readLW link

The 80/​20 play­book for miti­gat­ing AI schem­ing in 2025

Charbel-Raphaël31 May 2025 21:17 UTC
40 points
2 comments4 min readLW link

Col­lec­tive Ac­tion for AI Safety (June 4, NYC)

Jordan Braunstein31 May 2025 20:27 UTC
1 point
0 comments1 min readLW link

The best ap­proaches for miti­gat­ing “the in­tel­li­gence curse” (or grad­ual dis­em­pow­er­ment); my quick guesses at the best ob­ject-level interventions

ryan_greenblatt31 May 2025 18:20 UTC
78 points
19 comments5 min readLW link

Would It Be Bet­ter to Dispense with Good and Evil?

arusarda31 May 2025 16:40 UTC
−2 points
10 comments6 min readLW link

How Epistemic Col­lapse Looks from Inside

Martin Sustrik31 May 2025 16:30 UTC
9 points
11 comments1 min readLW link
(250bpm.substack.com)

When will AI au­to­mate all men­tal work, and how fast?

31 May 2025 16:18 UTC
10 points
0 comments7 min readLW link
(youtu.be)

Progress links and short notes, 2025-05-31: RPI fel­low­ship dead­line to­mor­row, Edge Es­mer­alda next week, and more

jasoncrawford31 May 2025 15:20 UTC
11 points
0 comments7 min readLW link
(newsletter.rootsofprogress.org)