What does a Gam­bler’s Ver­ity world look like?

ErioirEJul 25, 2024, 10:03 PM
7 points
6 comments1 min readLW link

Pac­ing Out­side the Box: RNNs Learn to Plan in Sokoban

Jul 25, 2024, 10:00 PM
59 points
8 comments2 min readLW link
(arxiv.org)

Sex, Death, and Complexity

Zero ContradictionsJul 25, 2024, 9:22 PM
0 points
0 comments1 min readLW link
(thewaywardaxolotl.blogspot.com)

Does ro­bust­ness im­prove with scale?

Jul 25, 2024, 8:55 PM
14 points
0 comments1 min readLW link
(far.ai)

Or­gani­sa­tion for Pro­gram Equil­ibrium read­ing group

Smaug123Jul 25, 2024, 7:11 PM
11 points
14 comments1 min readLW link

In Text

Valerii KremnevJul 25, 2024, 6:22 PM
−3 points
0 comments5 min readLW link

“AI achieves silver-medal stan­dard solv­ing In­ter­na­tional Math­e­mat­i­cal Olympiad prob­lems”

gjmJul 25, 2024, 3:58 PM
133 points
38 comments2 min readLW link
(deepmind.google)

[Talk tran­script] What “struc­ture” is and why it matters

Alex_AltairJul 25, 2024, 3:49 PM
23 points
0 comments5 min readLW link
(www.youtube.com)

AI #74: GPT-4o Mini Me and Llama 3

ZviJul 25, 2024, 1:50 PM
30 points
6 comments36 min readLW link
(thezvi.wordpress.com)

AI Con­sti­tu­tions are a tool to re­duce so­cietal scale risk

Sammy MartinJul 25, 2024, 11:18 AM
30 points
2 comments18 min readLW link

Deter­min­ing the power of in­vestors over Fron­tier AI Labs is strate­gi­cally im­por­tant to re­duce x-risk

Lucie PhilipponJul 25, 2024, 1:12 AM
18 points
7 comments2 min readLW link

FLI is hiring across Comms and Ops

beisenpressJul 25, 2024, 12:06 AM
1 point
0 comments1 min readLW link

A frame­work for think­ing about AI power-seeking

Joe CarlsmithJul 24, 2024, 10:41 PM
62 points
15 comments16 min readLW link

Llama Llama-3-405B?

ZviJul 24, 2024, 7:40 PM
51 points
9 comments30 min readLW link
(thezvi.wordpress.com)

AI Safety Memes Wiki

Jul 24, 2024, 6:53 PM
37 points
2 comments1 min readLW link
(aisafety.info)

Re­search Dis­cus­sion on PSCA with Claude Son­net 3.5

Robert KralischJul 24, 2024, 4:53 PM
−2 points
0 comments25 min readLW link

Read­ing More Each Day: A Sim­ple $35 Tool

aysajanJul 24, 2024, 1:54 PM
29 points
2 comments1 min readLW link

You should go to ML conferences

Jan_KulveitJul 24, 2024, 11:47 AM
112 points
13 comments4 min readLW link

The last era of hu­man mistakes

owencbJul 24, 2024, 9:58 AM
34 points
2 comments7 min readLW link
(strangecities.substack.com)

Longevity: A crit­i­cal look at “Loss of epi­ge­netic in­for­ma­tion as a cause of mam­malian ag­ing”

Anna CrowJul 24, 2024, 1:40 AM
14 points
2 comments10 min readLW link

The Cancer Re­s­olu­tion?

PeterMcCluskeyJul 24, 2024, 12:25 AM
34 points
28 comments6 min readLW link
(bayesianinvestor.com)

Pos­i­tive vi­sions for AI

Jul 23, 2024, 8:15 PM
27 points
4 comments18 min readLW link
(www.florencehinder.com)

How rea­son­able is tak­ing ex­tinc­tion risk?

FVeldeJul 23, 2024, 6:05 PM
2 points
4 comments4 min readLW link

Un­learn­ing via RMU is mostly shallow

Jul 23, 2024, 4:07 PM
54 points
4 comments6 min readLW link

Monthly Roundup #20: July 2024

ZviJul 23, 2024, 12:50 PM
33 points
9 comments38 min readLW link
(thezvi.wordpress.com)

Con­fus­ing the met­ric for the mean­ing: Per­haps cor­re­lated at­tributes are “nat­u­ral”

NickyPJul 23, 2024, 12:43 PM
33 points
3 comments4 min readLW link

My covid-re­lated be­liefs and questions

Severin T. SeehrichJul 23, 2024, 3:27 AM
10 points
3 comments1 min readLW link

[Question] Is there a Schel­ling point for group house room list­ings?

NoSignalNoNoiseJul 23, 2024, 3:03 AM
4 points
0 comments1 min readLW link

Room Available in Bos­ton Group House

NoSignalNoNoiseJul 23, 2024, 2:55 AM
15 points
1 comment1 min readLW link

D&D.Sci Sce­nario Index

Jul 23, 2024, 2:00 AM
73 points
0 comments2 min readLW link

How to avoid death by AI.

KrantzJul 23, 2024, 1:59 AM
−4 points
13 comments2 min readLW link

ML Safety Re­search Ad­vice—GabeM

Gabe MJul 23, 2024, 1:45 AM
31 points
2 comments14 min readLW link
(open.substack.com)

Ran­somware Pay­ments Should Re­quire a Sin Tax

Brian BienJul 22, 2024, 9:16 PM
20 points
10 comments2 min readLW link

The Elu­sive Root Cause of Schizophre­nia—Th­e­sis In­tro­duc­tion Only

kareempforbesJul 22, 2024, 8:24 PM
−9 points
0 comments2 min readLW link

Is Chi­nese AGI a valid con­cern for the USA?

sammyboizJul 22, 2024, 8:21 PM
0 points
2 comments9 min readLW link

Try­ing to un­der­stand Han­son’s Cul­tural Drift argument

KempJul 22, 2024, 8:20 PM
9 points
3 comments2 min readLW link

Effi­cient Dic­tionary Learn­ing with Switch Sparse Autoencoders

Anish MudideJul 22, 2024, 6:45 PM
118 points
20 comments12 min readLW link

An­a­lyz­ing Deep­Mind’s Prob­a­bil­is­tic Meth­ods for Eval­u­at­ing Agent Capabilities

Jul 22, 2024, 4:17 PM
69 points
0 comments16 min readLW link

The Gar­den of Eden

Alexander TurokJul 22, 2024, 4:07 PM
23 points
2 comments9 min readLW link

Car­ing about excellence

owencbJul 22, 2024, 2:24 PM
47 points
4 commentsLW link

Tim Dillon’s fake busi­ness is the most in­fluen­tial video I have watched in the last 24 months

Stuart JohnsonJul 22, 2024, 12:54 PM
−4 points
0 comments1 min readLW link
(youtu.be)

On the CrowdStrike Incident

ZviJul 22, 2024, 12:40 PM
75 points
14 comments17 min readLW link
(thezvi.wordpress.com)

Auto-En­hance: Devel­op­ing a meta-bench­mark to mea­sure LLM agents’ abil­ity to im­prove other agents

Jul 22, 2024, 12:33 PM
20 points
0 comments14 min readLW link

What does “the uni­verse is quan­tum” ac­tu­ally mean?

TahpJul 22, 2024, 11:52 AM
2 points
0 comments14 min readLW link

Ini­tial Ex­per­i­ments Us­ing SAEs to Help De­tect AI Gen­er­ated Text

Aaron_ScherJul 22, 2024, 5:16 AM
18 points
1 comment14 min readLW link

Cat­e­gories of lead­er­ship on tech­ni­cal teams

benkuhnJul 22, 2024, 4:50 AM
37 points
0 comments8 min readLW link
(www.benkuhn.net)

An ex­per­i­ment on hid­den cognition

Olli JärviniemiJul 22, 2024, 3:26 AM
25 points
2 comments7 min readLW link

OpenAI Boy­cott Revisit

Jake DennieJul 22, 2024, 1:44 AM
17 points
2 comments2 min readLW link

Coal­i­tional agency

Richard_NgoJul 22, 2024, 12:09 AM
56 points
6 comments6 min readLW link

The AI Driver’s Li­cence—A Policy Proposal

Jul 21, 2024, 8:38 PM
0 points
1 comment19 min readLW link