Gothen­burg LW /​ ACX meetup

StefanJan 8, 2025, 9:39 PM
2 points
0 comments1 min readLW link

Aris­toc­racy and Hostage Capital

Arjun PanicksseryJan 8, 2025, 7:38 PM
108 points
7 comments3 min readLW link
(arjunpanickssery.substack.com)

[Question] What is the most im­pres­sive game LLMs can play well?

Cole WyethJan 8, 2025, 7:38 PM
19 points
20 comments1 min readLW link

The Type of Writ­ing that Pushes Women Away

DahliaJan 8, 2025, 6:54 PM
22 points
4 comments2 min readLW link

Ann Alt­man has filed a law­suit in US fed­eral court alleg­ing that she was sex­u­ally abused by Sam Altman

quanticleJan 8, 2025, 2:59 PM
7 points
3 comments1 min readLW link

AI Safety Outreach Sem­i­nar & So­cial (on­line)

Linda LinseforsJan 8, 2025, 1:25 PM
9 points
0 comments1 min readLW link

XX by Rian Hughes: Pre­ten­tious Bullshit

Yair HalberstadtJan 8, 2025, 1:02 PM
33 points
5 comments5 min readLW link

Ac­ti­va­tion space in­ter­pretabil­ity may be doomed

Jan 8, 2025, 12:49 PM
148 points
34 comments8 min readLW link

AI Safety as a YC Startup

Lukas PeterssonJan 8, 2025, 10:46 AM
56 points
9 comments5 min readLW link

The ab­solute ba­sics of rep­re­sen­ta­tion the­ory of finite groups

Dmitry VaintrobJan 8, 2025, 9:47 AM
21 points
1 comment10 min readLW link

Im­pli­ca­tions of the AI Se­cu­rity Gap

Dan BraunJan 8, 2025, 8:31 AM
45 points
0 comments9 min readLW link

What are poly­se­man­tic neu­rons?

Jan 8, 2025, 7:35 AM
8 points
0 comments4 min readLW link
(aisafety.info)

Tips On Em­piri­cal Re­search Slides

Jan 8, 2025, 5:06 AM
90 points
4 comments6 min readLW link

On Eat­ing the Sun

jessicataJan 8, 2025, 4:57 AM
94 points
96 comments3 min readLW link
(unstablerontology.substack.com)

Book re­view: Range by David Epstein

PatrickDFarleyJan 8, 2025, 4:27 AM
12 points
0 comments15 min readLW link

Can we have Epipha­nies and Eureka mo­ments more fre­quently?

CstineSublimeJan 8, 2025, 2:20 AM
2 points
0 comments4 min readLW link

Job Open­ing: SWE to help im­prove grant-mak­ing software

Ethan AshkieJan 8, 2025, 12:54 AM
22 points
1 comment2 min readLW link
(survivalandflourishing.com)

Markov’s Inequal­ity Explained

criticalpointsJan 8, 2025, 12:31 AM
13 points
2 comments3 min readLW link
(eregis.github.io)

Stream Entry

lsusrJan 7, 2025, 11:56 PM
76 points
11 comments4 min readLW link

Don’t fall for on­tol­ogy pyra­mid schemes

LorecJan 7, 2025, 11:29 PM
16 points
8 comments2 min readLW link

Bridge­wa­ter x Me­tac­u­lus Fore­cast­ing Con­test Goes Global — Feb 3, $25k, Opportunities

ChristianWilliamsJan 7, 2025, 9:40 PM
10 points
0 commentsLW link
(www.metaculus.com)

A Prin­ci­pled Car­toon Guide to NVC

Jan 7, 2025, 9:01 PM
39 points
5 comments5 min readLW link

Disagree­ment on AGI Suggests It’s Near

tangerineJan 7, 2025, 8:42 PM
30 points
15 comments1 min readLW link

Role em­bed­dings: mak­ing au­thor­ship more salient to LLMs

Jan 7, 2025, 8:13 PM
50 points
0 comments8 min readLW link

Will bird flu be the next Covid? “Lit­tle chance” says my dash­board.

Nathan YoungJan 7, 2025, 8:10 PM
27 points
0 comments1 min readLW link

[Fic­tion] [Comic] Effec­tive Altru­ism and Ra­tion­al­ity meet at a Sec­u­lar Sols­tice afterparty

tandemJan 7, 2025, 7:11 PM
137 points
5 comments1 min readLW link

Pre­dict­ing AI Re­leases Through Side Channels

Reworr RJan 7, 2025, 7:06 PM
16 points
1 comment1 min readLW link

Re­but­tals for ~all crit­i­cisms of AIXI

Cole WyethJan 7, 2025, 5:41 PM
25 points
17 comments14 min readLW link

OpenAI #10: Reflections

ZviJan 7, 2025, 5:00 PM
149 points
7 comments11 min readLW link
(thezvi.wordpress.com)

Some im­pli­ca­tions of rad­i­cal empathy

MichaelStJulesJan 7, 2025, 4:10 PM
3 points
0 commentsLW link

Ac­tu­al­ism, asym­me­try and extinction

MichaelStJulesJan 7, 2025, 4:02 PM
1 point
4 commentsLW link

Med­i­ta­tion in­sights as phase shifts in your self-model

Jonas HallgrenJan 7, 2025, 10:09 AM
13 points
3 comments3 min readLW link

Alle­vi­at­ing shrimp pain is im­moral.

G WoodJan 7, 2025, 7:28 AM
−7 points
6 comments4 min readLW link

D&D.Sci Dun­geon­build­ing: the Dun­geon Tour­na­ment Eval­u­a­tion & Ruleset

aphyerJan 7, 2025, 5:02 AM
33 points
8 comments5 min readLW link

Incredibow

jefftkJan 7, 2025, 3:30 AM
17 points
3 comments1 min readLW link
(www.jefftk.com)

Build­ing Big Science from the Bot­tom-Up: A Frac­tal Ap­proach to AI Safety

Lauren GreenspanJan 7, 2025, 3:08 AM
37 points
2 comments12 min readLW link

My Ex­pe­rience With A Mag­net Implant

ValeJan 7, 2025, 3:01 AM
9 points
2 comments1 min readLW link
(vale.rocks)

You should de­lay en­g­ineer­ing-heavy re­search in light of R&D automation

Daniel PalekaJan 7, 2025, 2:11 AM
36 points
3 comments5 min readLW link
(newsletter.danielpaleka.com)

Test­ing for Schem­ing with Model Deletion

GuiveJan 7, 2025, 1:54 AM
59 points
21 comments21 min readLW link
(guive.substack.com)

Guilt, Shame, and Depravity

BenquoJan 7, 2025, 1:16 AM
15 points
12 comments4 min readLW link

Turn­ing up the Heat on De­cep­tively-Misal­igned AI

J BostockJan 7, 2025, 12:13 AM
19 points
16 comments4 min readLW link

(My) self-refer­en­tial rea­son to be­lieve in free will

jacekJan 6, 2025, 11:35 PM
12 points
6 comments1 min readLW link

Defi­ni­tion of al­ign­ment sci­ence I like

quetzal_rainbowJan 6, 2025, 8:40 PM
19 points
0 comments3 min readLW link

How will we up­date about schem­ing?

ryan_greenblattJan 6, 2025, 8:21 PM
171 points
20 comments37 min readLW link

What Indi­ca­tors Should We Watch to Disam­biguate AGI Timelines?

snewmanJan 6, 2025, 7:57 PM
139 points
57 comments13 min readLW link

Gen­er­at­ing Cog­nate­ful Sen­tences with Large Lan­guage Models

vkethanaJan 6, 2025, 6:40 PM
8 points
0 comments10 min readLW link

Really rad­i­cal empathy

MichaelStJulesJan 6, 2025, 5:46 PM
19 points
0 commentsLW link

In­de­pen­dent re­search ar­ti­cle an­a­lyz­ing con­sis­tent self-re­ports of ex­pe­rience in ChatGPT and Claude

rifeJan 6, 2025, 5:34 PM
4 points
20 comments1 min readLW link
(awakenmoon.ai)

[Question] Meal Re­place­ments in 2025?

alkjashJan 6, 2025, 3:37 PM
24 points
9 comments1 min readLW link

AI safety con­tent you could create

Adam JonesJan 6, 2025, 3:35 PM
19 points
0 comments5 min readLW link
(adamjones.me)