You’re Play­ing a Rough Game

jefftkOct 17, 2024, 7:20 PM
25 points
2 comments2 min readLW link
(www.jefftk.com)

P=NP

OnePolynomialOct 17, 2024, 5:56 PM
−25 points
0 comments8 min readLW link

Fac­tor­ing P(doom) into a bayesian network

Joseph GardiOct 17, 2024, 5:55 PM
1 point
0 comments1 min readLW link

un­der­stand­ing bureaucracy

dhruvmethiOct 17, 2024, 5:55 PM
1 point
2 comments8 min readLW link

AI #86: Just Think of the Potential

ZviOct 17, 2024, 3:10 PM
58 points
8 comments57 min readLW link
(thezvi.wordpress.com)

Con­crete benefits of mak­ing predictions

Oct 17, 2024, 2:23 PM
35 points
5 comments6 min readLW link
(fatebook.io)

Arith­metic is an un­der­rated world-mod­el­ing technology

dynomightOct 17, 2024, 2:00 PM
152 points
33 comments6 min readLW link
(dynomight.net)

The Com­pu­ta­tional Com­plex­ity of Cir­cuit Dis­cov­ery for In­ner Interpretability

Bogdan Ionut CirsteaOct 17, 2024, 1:18 PM
11 points
2 comments1 min readLW link
(arxiv.org)

[Question] is there a big dic­tio­nary some­where with all your jar­gon and acronyms and what­not?

KvmanThinkingOct 17, 2024, 11:30 AM
4 points
7 comments1 min readLW link

[Question] Is there a known method to find oth­ers who came across the same po­ten­tial in­fo­haz­ard with­out spoiling it to the pub­lic?

hiveOct 17, 2024, 10:47 AM
4 points
6 comments1 min readLW link

It is time to start war gam­ing for AGI

yanni kyriacosOct 17, 2024, 5:14 AM
4 points
1 comment1 min readLW link

[Question] Re­in­force­ment Learn­ing: Essen­tial Step Towards AGI or Ir­rele­vant?

DoubleOct 17, 2024, 3:37 AM
1 point
0 comments1 min readLW link

[Question] En­deav­orOTC le­git?

FinalFormal2Oct 17, 2024, 1:33 AM
3 points
0 comments1 min readLW link

The Cog­ni­tive Boot­camp Agreement

RaemonOct 16, 2024, 11:24 PM
36 points
0 comments8 min readLW link

Bit­ter les­sons about lu­cid dreaming

avturchinOct 16, 2024, 9:27 PM
77 points
62 comments2 min readLW link

Towards Quan­ti­ta­tive AI Risk Management

Oct 16, 2024, 7:26 PM
28 points
1 comment6 min readLW link

Why Academia is Mostly Not Truth-Seeking

Zero ContradictionsOct 16, 2024, 7:14 PM
−7 points
6 comments1 min readLW link
(thewaywardaxolotl.blogspot.com)

Launch­ing Ad­ja­cent News

Lucas KohorstOct 16, 2024, 5:58 PM
24 points
0 comments4 min readLW link

[Question] In­ter­est in Leet­code, but for Ra­tion­al­ity?

Gregory Oct 16, 2024, 5:54 PM
74 points
20 comments2 min readLW link

Re­quest for ad­vice: Re­search for Con­ver­sa­tional Game The­ory for LLMs

Rome ViharoOct 16, 2024, 5:53 PM
10 points
0 comments1 min readLW link

Why hu­mans won’t con­trol su­per­hu­man AIs.

Spiritus DeiOct 16, 2024, 4:48 PM
−11 points
1 comment6 min readLW link

Against em­pa­thy-by-default

Steven ByrnesOct 16, 2024, 4:38 PM
60 points
24 comments7 min readLW link

can­cer rates af­ter gene therapy

bhauthOct 16, 2024, 3:32 PM
53 points
2 comments3 min readLW link
(bhauth.com)

Monthly Roundup #23: Oc­to­ber 2024

ZviOct 16, 2024, 1:50 PM
39 points
13 comments50 min readLW link
(thezvi.wordpress.com)

[Question] Change My Mind: Thirders in “Sleep­ing Beauty” are Just Do­ing Episte­mol­ogy Wrong

DragonGodOct 16, 2024, 10:20 AM
8 points
67 comments6 min readLW link

[Question] After up­load­ing your con­scious­ness...

Jinge WangOct 16, 2024, 3:52 AM
−2 points
0 comments1 min readLW link

The ELYSIUM Pro­posal - Ex­trap­o­lated voLi­tions Yield­ing Separate In­di­vi­d­u­al­ized Utopias for Mankind

RokoOct 16, 2024, 1:24 AM
9 points
18 comments1 min readLW link
(transhumanaxiology.substack.com)

Bel­le­vue Meetup

CedarOct 16, 2024, 1:07 AM
3 points
0 comments1 min readLW link

Sin­gu­lar Learn­ing The­ory for Dummies

Rahul ChandOct 15, 2024, 9:13 PM
1 point
0 comments8 min readLW link

Distil­la­tion Of Deep­Seek-Prover V1.5

IvanLinOct 15, 2024, 6:53 PM
4 points
1 comment3 min readLW link

Im­prov­ing Model-Writ­ten Evals for AI Safety Benchmarking

Oct 15, 2024, 6:25 PM
30 points
0 comments18 min readLW link

Tak­ing non­log­i­cal con­cepts seriously

Kris BrownOct 15, 2024, 6:16 PM
7 points
5 comments18 min readLW link
(topos.site)

Rashomon—A news­bet­ting site

ideastheteOct 15, 2024, 6:15 PM
23 points
8 comments1 min readLW link

On the Prac­ti­cal Ap­pli­ca­tions of Interpretability

Nick JiangOct 15, 2024, 5:18 PM
4 points
1 comment7 min readLW link

An­thropic’s up­dated Re­spon­si­ble Scal­ing Policy

Zac Hatfield-DoddsOct 15, 2024, 4:46 PM
38 points
3 comments3 min readLW link
(www.anthropic.com)

[Question] When is re­ward ever the op­ti­miza­tion tar­get?

Noosphere89Oct 15, 2024, 3:09 PM
37 points
17 comments1 min readLW link

An Opinionated Evals Read­ing List

Oct 15, 2024, 2:38 PM
65 points
0 comments13 min readLW link
(www.apolloresearch.ai)

An­thropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:25 PM
46 points
19 comments6 min readLW link

[In­tu­itive self-mod­els] 5. Dis­so­ci­a­tive Iden­tity (Mul­ti­ple Per­son­al­ity) Disorder

Steven ByrnesOct 15, 2024, 1:31 PM
59 points
7 comments11 min readLW link

Eco­nomics Roundup #4

ZviOct 15, 2024, 1:20 PM
19 points
4 comments25 min readLW link
(thezvi.wordpress.com)

[Question] Is School of Thought re­lated to the Ra­tion­al­ity Com­mu­nity?

Shoshannah TekofskyOct 15, 2024, 12:41 PM
7 points
12 comments1 min readLW link

In­verse Prob­lems In Every­day Life

silentbobOct 15, 2024, 11:42 AM
14 points
2 comments8 min readLW link

Think­ing LLMs: Gen­eral In­struc­tion Fol­low­ing with Thought Generation

Bogdan Ionut CirsteaOct 15, 2024, 9:21 AM
7 points
0 comments1 min readLW link
(arxiv.org)

Thoughts On the Na­ture of Ca­pa­bil­ity Elic­i­ta­tion via Fine-tuning

Theodore ChapmanOct 15, 2024, 8:39 AM
8 points
0 comments8 min readLW link

Min­i­mal Mo­ti­va­tion of Nat­u­ral Latents

Oct 14, 2024, 10:51 PM
46 points
14 comments3 min readLW link

How long should poli­ti­cal (and other) terms be?

ohmurphyOct 14, 2024, 9:38 PM
5 points
0 comments1 min readLW link
(ohmurphy.substack.com)

Ex­am­ples of How I Use LLMs

jefftkOct 14, 2024, 5:10 PM
31 points
2 comments2 min readLW link
(www.jefftk.com)

It’s im­por­tant to know when to stop: Mechanis­tic Ex­plo­ra­tion of Gemma 2 List Generation

Gerard BoxoOct 14, 2024, 5:04 PM
9 points
0 comments6 min readLW link
(gboxo.github.io)

[Question] LW re­sources on child­hood ex­pe­riences?

nahir91595Oct 14, 2024, 5:04 PM
10 points
7 comments1 min readLW link

Free Will, Neu­rotyp­i­cal Dom­i­nance, and the Path to ASI and Neu­ral­inks: Evolv­ing Beyond Scarcity

j_passeriOct 14, 2024, 4:54 PM
−2 points
3 comments3 min readLW link