Sur­vey: How Do Elite Chi­nese Stu­dents Feel About the Risks of AI?

Nick CorvinoSep 2, 2024, 6:11 PM
141 points
13 comments10 min readLW link

Pas­sages I High­lighted in The Let­ters of J.R.R.Tolkien

Ivan VendrovNov 25, 2024, 1:47 AM
139 points
39 comments31 min readLW link

Hire (or Be­come) a Think­ing Assistant

RaemonDec 23, 2024, 3:58 AM
138 points
49 comments8 min readLW link

My ex­pe­rience us­ing fi­nan­cial com­mit­ments to over­come akrasia

William HowardApr 15, 2024, 10:57 PM
137 points
33 comments18 min readLW link

[Question] Which things were you sur­prised to learn are not metaphors?

Eric NeymanNov 21, 2024, 6:56 PM
136 points
88 comments1 min readLW link

No­tice When Peo­ple Are Direc­tion­ally Correct

Chris_LeongJan 14, 2024, 2:12 PM
136 points
8 comments2 min readLW link

An Ex­tremely Opinionated An­no­tated List of My Favourite Mechanis­tic In­ter­pretabil­ity Papers v2

Neel NandaJul 7, 2024, 5:39 PM
136 points
16 comments25 min readLW link

Read the Roon

ZviMar 5, 2024, 1:50 PM
136 points
6 comments19 min readLW link
(thezvi.wordpress.com)

On say­ing “Thank you” in­stead of “I’m Sorry”

Michael CohnJul 8, 2024, 3:13 AM
136 points
16 comments3 min readLW link

[Com­pleted] The 2024 Petrov Day Scenario

Sep 26, 2024, 8:08 AM
136 points
114 comments5 min readLW link

“Can AI Scal­ing Con­tinue Through 2030?”, Epoch AI (yes)

gwernAug 24, 2024, 1:40 AM
135 points
4 comments3 min readLW link
(epochai.org)

The Worst Form Of Govern­ment (Ex­cept For Every­thing Else We’ve Tried)

johnswentworthMar 17, 2024, 6:11 PM
135 points
47 comments4 min readLW link

Why I don’t be­lieve in the placebo effect

transhumanist_atom_understanderJun 10, 2024, 2:37 AM
135 points
22 comments9 min readLW link

A Dozen Ways to Get More Dakka

DavidmanheimApr 8, 2024, 4:45 AM
135 points
11 comments3 min readLW link

Lov­ing a world you don’t trust

Joe CarlsmithJun 18, 2024, 7:31 PM
135 points
13 comments33 min readLW link

Up­date­less­ness doesn’t solve most problems

Martín SotoFeb 8, 2024, 5:30 PM
135 points
45 comments12 min readLW link

How it All Went Down: The Puz­zle Hunt that took us way, way Less Online

A*Jun 2, 2024, 8:01 AM
135 points
5 comments5 min readLW link

Pro­ces­sor clock speeds are not how fast AIs think

Ege ErdilJan 29, 2024, 2:39 PM
135 points
55 comments2 min readLW link

Limi­ta­tions on For­mal Ver­ifi­ca­tion for AI Safety

Andrew DicksonAug 19, 2024, 11:03 PM
134 points
60 comments23 min readLW link

“AI achieves silver-medal stan­dard solv­ing In­ter­na­tional Math­e­mat­i­cal Olympiad prob­lems”

gjmJul 25, 2024, 3:58 PM
133 points
38 comments2 min readLW link
(deepmind.google)

Par­a­sites (not a metaphor)

lemonhopeAug 8, 2024, 8:07 PM
133 points
19 comments1 min readLW link

Sim­ple probes can catch sleeper agents

Apr 23, 2024, 9:10 PM
133 points
21 comments1 min readLW link
(www.anthropic.com)

My sim­ple AGI in­vest­ment & in­surance strategy

lcMar 31, 2024, 2:51 AM
131 points
27 comments2 min readLW link

Cir­cuits in Su­per­po­si­tion: Com­press­ing many small neu­ral net­works into one

Oct 14, 2024, 1:06 PM
130 points
9 comments13 min readLW link

The case for train­ing fron­tier AIs on Sume­rian-only corpus

Jan 15, 2024, 4:40 PM
130 points
16 comments3 min readLW link

“The Solomonoff Prior is Mal­ign” is a spe­cial case of a sim­pler argument

David MatolcsiNov 17, 2024, 9:32 PM
130 points
44 comments12 min readLW link

How I started be­liev­ing re­li­gion might ac­tu­ally mat­ter for ra­tio­nal­ity and moral philosophy

zhukeepaAug 23, 2024, 5:40 PM
129 points
41 comments7 min readLW link

Near-mode think­ing on AI

Olli JärviniemiAug 4, 2024, 8:47 PM
128 points
9 comments5 min readLW link

The Pearly Gates

lsusrMay 30, 2024, 4:01 AM
127 points
6 comments3 min readLW link

Com­mu­nity Notes by X

NicholasKeesMar 18, 2024, 5:13 PM
127 points
15 comments7 min readLW link

Pan­theon Interface

Jul 8, 2024, 7:03 PM
127 points
22 comments6 min readLW link

Steer­ing Llama-2 with con­trastive ac­ti­va­tion additions

Jan 2, 2024, 12:47 AM
125 points
29 comments8 min readLW link
(arxiv.org)

The Stan­dard Analogy

Zack_M_DavisJun 3, 2024, 5:15 PM
125 points
28 comments12 min readLW link

A Shut­down Prob­lem Proposal

Jan 21, 2024, 6:12 PM
125 points
61 comments6 min readLW link

BIG-Bench Ca­nary Con­tam­i­na­tion in GPT-4

JozdienOct 22, 2024, 3:40 PM
125 points
14 comments4 min readLW link

Things I’ve Grieved

RaemonFeb 18, 2024, 7:32 PM
125 points
6 comments2 min readLW link

An even deeper atheism

Joe CarlsmithJan 11, 2024, 5:28 PM
125 points
47 comments15 min readLW link

Awakening

lsusrMay 30, 2024, 7:03 AM
125 points
79 comments9 min readLW link

OpenAI’s CBRN tests seem unclear

LucaRighettiNov 21, 2024, 5:28 PM
124 points
6 comments7 min readLW link
(www.planned-obsolescence.org)

[Question] What do co­her­ence ar­gu­ments ac­tu­ally prove about agen­tic be­hav­ior?

sunwillriseJun 1, 2024, 9:37 AM
123 points
39 comments6 min readLW link

In­ves­ti­gat­ing the Chart of the Cen­tury: Why is food so ex­pen­sive?

Maxwell TabarrokAug 16, 2024, 1:21 PM
123 points
26 comments3 min readLW link
(www.maximum-progress.com)

Do you be­lieve in hun­dred dol­lar bills ly­ing on the ground? Con­sider humming

ElizabethMay 16, 2024, 12:00 AM
122 points
22 comments6 min readLW link
(acesounderglass.com)

A List of 45+ Mech In­terp Pro­ject Ideas from Apollo Re­search’s In­ter­pretabil­ity Team

Jul 18, 2024, 2:15 PM
122 points
18 comments18 min readLW link

Why I take short timelines seriously

NicholasKeesJan 28, 2024, 10:27 PM
122 points
29 comments4 min readLW link

A bird’s eye view of ARC’s research

Jacob_HiltonOct 23, 2024, 3:50 PM
121 points
12 comments7 min readLW link
(www.alignment.org)

My Num­ber 1 Episte­mol­ogy Book Recom­men­da­tion: In­vent­ing Temperature

adamShimiSep 8, 2024, 2:30 PM
121 points
18 comments3 min readLW link
(epistemologicalfascinations.substack.com)

Ev­i­dence of Learned Look-Ahead in a Chess-Play­ing Neu­ral Network

Erik JennerJun 4, 2024, 3:50 PM
121 points
14 comments13 min readLW link

The Pareto Best and the Curse of Doom

ScrewtapeFeb 21, 2024, 11:10 PM
120 points
21 comments9 min readLW link

[Question] Which skin­care prod­ucts are ev­i­dence-based?

Vanessa KosoyMay 2, 2024, 3:22 PM
120 points
48 comments1 min readLW link

AI catas­tro­phes and rogue deployments

BuckJun 3, 2024, 5:04 PM
120 points
16 comments8 min readLW link