[Question] how do the CEOs re­spond to our con­cerns?

KvmanThinking11 Feb 2025 23:39 UTC
−10 points
7 comments1 min readLW link

Where Would Good Fore­casts Most Help AI Gover­nance Efforts?

Violet Hour11 Feb 2025 18:15 UTC
11 points
1 comment6 min readLW link

AI Safety at the Fron­tier: Paper High­lights, Jan­uary ’25

gasteigerjo11 Feb 2025 16:14 UTC
7 points
0 comments8 min readLW link
(aisafetyfrontier.substack.com)

If Neu­ro­scien­tists Succeed

Mordechai Rorvig11 Feb 2025 15:33 UTC
9 points
6 comments18 min readLW link

The News is Never Neglected

lsusr11 Feb 2025 14:59 UTC
113 points
18 comments1 min readLW link

Re­think­ing AI Safety Ap­proach in the Era of Open-Source AI

Weibing Wang11 Feb 2025 14:01 UTC
4 points
0 comments6 min readLW link

What About The Horses?

Maxwell Tabarrok11 Feb 2025 13:59 UTC
16 points
17 comments7 min readLW link
(www.maximum-progress.com)

On De­liber­a­tive Alignment

Zvi11 Feb 2025 13:00 UTC
51 points
2 comments6 min readLW link
(thezvi.wordpress.com)

De­tect­ing AI Agent Failure Modes in Simulations

Michael Soareverix11 Feb 2025 11:10 UTC
17 points
0 comments8 min readLW link

World Ci­ti­zen Assem­bly about AI—Announcement

Camille Berger 11 Feb 2025 10:51 UTC
26 points
1 comment5 min readLW link

Vi­sual Refer­ence for Fron­tier Large Lan­guage Models

kenakofer11 Feb 2025 5:14 UTC
14 points
0 comments1 min readLW link
(kenan.schaefkofer.com)

Effec­tive Utopia & Startup Ways There: Math-Proven Safe Static mAX-In­tel­li­gence (AXI), Mul­tiver­sal Align­ment, Phys­i­cal­ized Ethics...

ank11 Feb 2025 3:21 UTC
13 points
8 comments40 min readLW link

Ar­gu­ing for the Truth? An In­fer­ence-Only Study into AI Debate

denisemester11 Feb 2025 3:04 UTC
7 points
0 comments16 min readLW link

Why Did Elon Musk Just Offer to Buy Con­trol of OpenAI for $100 Billion?

garrison11 Feb 2025 0:20 UTC
208 points
8 comments6 min readLW link
(garrisonlovely.substack.com)

Pos­i­tive Directions

G Wood11 Feb 2025 0:00 UTC
0 points
0 comments4 min readLW link

Log­i­cal Correlation

niplav10 Feb 2025 23:29 UTC
24 points
7 comments10 min readLW link

Proof idea: SLT to AIT

Lucius Bushnaq10 Feb 2025 23:14 UTC
42 points
15 comments6 min readLW link

LW/​ACX so­cial meetup

Stefan10 Feb 2025 21:12 UTC
2 points
0 comments1 min readLW link

A Bear­ish Take on AI, as a Treat

rats10 Feb 2025 19:22 UTC
11 points
0 comments4 min readLW link
(open.substack.com)

Beyond ELO: Re­think­ing Chess Skill as a Mul­tidi­men­sional Ran­dom Variable

Oliver Oswald10 Feb 2025 19:19 UTC
6 points
7 comments2 min readLW link

Claude is More Anx­ious than GPT; Per­son­al­ity is an axis of in­ter­pretabil­ity in lan­guage models

future_detective10 Feb 2025 19:19 UTC
2 points
2 comments8 min readLW link
(dhealy.substack.com)

Notes on Oc­cam via Solomonoff vs. hi­er­ar­chi­cal Bayes

JesseClifton10 Feb 2025 17:55 UTC
29 points
7 comments4 min readLW link

Sleep­ing Beauty: an Ac­cu­racy-based Approach

glauberdebona10 Feb 2025 15:40 UTC
7 points
2 comments7 min readLW link

Poli­ti­cal Idolatry

Arturo Macias10 Feb 2025 15:26 UTC
−8 points
7 comments2 min readLW link

ML4Good Colom­bia—Ap­pli­ca­tions Open to LatAm Participants

10 Feb 2025 15:03 UTC
5 points
0 comments1 min readLW link

Non­par­ti­san AI safety

Yair Halberstadt10 Feb 2025 14:55 UTC
30 points
4 comments2 min readLW link

Opinion Ar­ti­cle Scor­ing System

ciaran 10 Feb 2025 14:32 UTC
1 point
0 comments5 min readLW link

Levels of Friction

Zvi10 Feb 2025 13:10 UTC
155 points
8 comments12 min readLW link
(thezvi.wordpress.com)

Bau­mol effect vs Jevons paradox

Hzn10 Feb 2025 8:28 UTC
0 points
0 comments1 min readLW link
(hzn33.neocities.org)

[Question] A Si­mu­la­tion of Au­toma­tion eco­nomics?

qbolec10 Feb 2025 8:11 UTC
10 points
1 comment1 min readLW link

[Question] Should I Divest from AI?

Oliver Kuperman10 Feb 2025 3:29 UTC
6 points
4 comments1 min readLW link

OpenAI lied about SFT vs. RLHF

sanxiyn10 Feb 2025 3:24 UTC
10 points
2 comments1 min readLW link
(x.com)

“Self-Black­mail” and Alternatives

jessicata9 Feb 2025 23:20 UTC
20 points
12 comments7 min readLW link
(unstableontology.com)

Alt­man blog on post-AGI world

Julian Bradshaw9 Feb 2025 21:52 UTC
29 points
10 comments1 min readLW link
(blog.samaltman.com)

Fore­cast­ing newslet­ter #2/​2025: Fore­cast­ing meetup network

NunoSempere9 Feb 2025 18:07 UTC
13 points
0 comments4 min readLW link
(forecasting.substack.com)

How iden­ti­cal twin sisters feel about nieces vs their own daughters

Dave92F19 Feb 2025 17:36 UTC
4 points
19 comments1 min readLW link

Two hemi­spheres—I do not think it means what you think it means

Viliam9 Feb 2025 15:33 UTC
112 points
21 comments14 min readLW link

The Struc­ture of Pro­fes­sional Revolutions

SebastianG 9 Feb 2025 13:23 UTC
8 points
0 comments4 min readLW link

Gary Mar­cus now say­ing AI can’t do things it can already do

Benjamin_Todd9 Feb 2025 12:24 UTC
62 points
12 comments1 min readLW link
(benjamintodd.substack.com)

How do you make a 250x bet­ter vac­cine at 1/​10 the cost? Develop it in In­dia.

Abhishaike Mahajan9 Feb 2025 3:53 UTC
4 points
5 comments1 min readLW link
(www.owlposting.com)

Less Lap­top Velcro

jefftk9 Feb 2025 3:30 UTC
19 points
0 comments1 min readLW link
(www.jefftk.com)

AXRP Epi­sode 38.7 - An­thony Aguirre on the Fu­ture of Life Institute

DanielFilan9 Feb 2025 1:10 UTC
10 points
0 comments12 min readLW link

[Job ad] LISA CEO

9 Feb 2025 0:18 UTC
18 points
4 comments2 min readLW link

“Think it Faster” worksheet

Raemon8 Feb 2025 22:02 UTC
69 points
11 comments4 min readLW link

Seven sources of goals in LLM agents

Seth Herd8 Feb 2025 21:54 UTC
23 points
3 comments2 min readLW link

[Question] p(s-risks to con­tem­po­rary hu­mans)?

MattAlexander8 Feb 2025 21:19 UTC
6 points
5 comments6 min readLW link

Cross-Layer Fea­ture Align­ment and Steer­ing in Large Lan­guage Model

dlaptev8 Feb 2025 20:18 UTC
9 points
0 comments6 min readLW link

Towards build­ing blocks of ontologies

8 Feb 2025 16:03 UTC
29 points
0 comments26 min readLW link

Can Knowl­edge Hurt You? The Dangers of In­fo­haz­ards (and Exfo­haz­ards)

8 Feb 2025 15:51 UTC
19 points
0 comments5 min readLW link
(www.youtube.com)

Distill­ing the In­ter­nal Model Principle

JoseFaustino8 Feb 2025 14:59 UTC
21 points
0 comments16 min readLW link