NYT on the Man­i­fest fore­cast­ing conference

Austin Chen9 Oct 2023 21:40 UTC
45 points
14 comments1 min readLW link
(www.nytimes.com)

Fore­cast­ing and pre­dic­tion markets

CarlJ9 Oct 2023 20:43 UTC
3 points
0 comments1 min readLW link

Com­par­ing Two Fore­cast­ers in an Ideal World

nikos9 Oct 2023 19:52 UTC
5 points
0 comments6 min readLW link

The case for af­ter­mar­ket blind spot mirrors

Brendan Long9 Oct 2023 19:30 UTC
57 points
14 comments2 min readLW link
(www.brendanlong.com)

New con­trac­tor role: Web se­cu­rity task force con­trac­tor for AI safety announcements

9 Oct 2023 18:36 UTC
11 points
0 comments2 min readLW link
(survivalandflourishing.com)

[Question] Any­one work­ing on D. Amodei’s Bartlett show tran­script?

Leopard9 Oct 2023 18:17 UTC
10 points
0 comments1 min readLW link

AGI Align­ment is iso­mor­phic to Un­con­di­tional Love

Raghuvar Nadig9 Oct 2023 15:58 UTC
−11 points
0 comments11 min readLW link

Knowl­edge Base 3: Shop­ping ad­vi­sor and other uses of knowl­edge base about products

iwis9 Oct 2023 11:53 UTC
0 points
0 comments4 min readLW link

Knowl­edge Base 2: The struc­ture and the method of building

iwis9 Oct 2023 11:53 UTC
2 points
4 comments8 min readLW link

We don’t un­der­stand what hap­pened with cul­ture enough

Jan_Kulveit9 Oct 2023 9:54 UTC
86 points
21 comments6 min readLW link

Lev­er­ag­ing Bayes’ The­o­rem to Su­per­charge Me­mory Techniques

disoha9 Oct 2023 3:34 UTC
−15 points
1 comment4 min readLW link

Paper: Iden­ti­fy­ing the Risks of LM Agents with an LM-Emu­lated Sand­box—Univer­sity of Toronto 2023 - Bench­mark con­sist­ing of 36 high-stakes tools and 144 test cases!

Singularian25019 Oct 2023 0:00 UTC
5 points
0 comments1 min readLW link

AI Align­ment Break­throughs this week (10/​08/​23)

Logan Zoellner8 Oct 2023 23:30 UTC
30 points
14 comments6 min readLW link

“The Heart of Gam­ing is the Power Fan­tasy”, and Co­hab­itive Games

Raemon8 Oct 2023 21:02 UTC
81 points
49 comments4 min readLW link
(bottomfeeder.substack.com)

FAQ: What the heck is goal ag­nos­ti­cism?

porby8 Oct 2023 19:11 UTC
66 points
36 comments28 min readLW link

Time is ho­mo­ge­neous se­quen­tially-com­pos­able determination

TsviBT8 Oct 2023 14:58 UTC
14 points
0 comments21 min readLW link

Linkpost: Are Emer­gent Abil­ities in Large Lan­guage Models just In-Con­text Learn­ing?

Erich_Grunewald8 Oct 2023 12:14 UTC
12 points
6 comments2 min readLW link
(arxiv.org)

Bird-eye view vi­su­al­iza­tion of LLM activations

Sergii8 Oct 2023 12:12 UTC
11 points
2 comments1 min readLW link
(grgv.xyz)

Per­spec­tive Based Rea­son­ing Could Ab­solve CDT

dadadarren8 Oct 2023 11:22 UTC
4 points
5 comments5 min readLW link

The Gra­di­ent – The Ar­tifi­cial­ity of Alignment

mic8 Oct 2023 4:06 UTC
12 points
1 comment5 min readLW link
(thegradient.pub)

Com­par­ing An­thropic’s Dic­tionary Learn­ing to Ours

Robert_AIZI7 Oct 2023 23:30 UTC
136 points
8 comments4 min readLW link

A thought about the con­straints of debtless­ness in on­line communities

mako yass7 Oct 2023 21:26 UTC
57 points
23 comments1 min readLW link

Ar­gu­ments for util­i­tar­i­anism are im­pos­si­bil­ity ar­gu­ments un­der un­bounded prospects

MichaelStJules7 Oct 2023 21:08 UTC
7 points
7 comments21 min readLW link

Sam Alt­man’s sister, An­nie Alt­man, claims Sam has severely abused her

pl50157 Oct 2023 21:06 UTC
86 points
105 comments28 min readLW link

Griffin Island

jefftk7 Oct 2023 18:40 UTC
14 points
3 comments1 min readLW link
(www.jefftk.com)

Every Men­tion of EA in “Go­ing In­finite”

KirstenH7 Oct 2023 14:42 UTC
48 points
0 comments8 min readLW link
(open.substack.com)

Fix­ing In­sider Threats in the AI Sup­ply Chain

Madhav Malhotra7 Oct 2023 13:19 UTC
20 points
2 comments5 min readLW link

Con­tra Nora Belrose on Orthog­o­nal­ity Th­e­sis Be­ing Trivial

tailcalled7 Oct 2023 11:47 UTC
18 points
21 comments1 min readLW link

Re­lated Dis­cus­sion from Thomas Kwa’s MIRI Re­search Experience

Raemon7 Oct 2023 6:25 UTC
71 points
140 comments1 min readLW link

[Question] Cur­rent State of Prob­a­bil­is­tic Logic

lunatic_at_large7 Oct 2023 5:06 UTC
3 points
2 comments1 min readLW link

On the Re­la­tion­ship Between Vari­abil­ity and the Evolu­tion­ary Out­comes of Sys­tems in Nature

Artyom Shaposhnikov7 Oct 2023 3:06 UTC
2 points
0 comments1 min readLW link

An­nounc­ing Dialogues

Ben Pace7 Oct 2023 2:57 UTC
154 points
51 comments4 min readLW link

Don’t Dis­miss Sim­ple Align­ment Approaches

Chris_Leong7 Oct 2023 0:35 UTC
128 points
9 comments4 min readLW link

Link­ing Alt Accounts

jefftk6 Oct 2023 17:00 UTC
70 points
33 comments1 min readLW link
(www.jefftk.com)

Su­per-Ex­po­nen­tial ver­sus Ex­po­nen­tial Growth in Com­pute Price-Performance

moridinamael6 Oct 2023 16:23 UTC
37 points
21 comments2 min readLW link

A per­sonal ex­pla­na­tion of ELK con­cept and task.

Zeyu Qin6 Oct 2023 3:55 UTC
1 point
0 comments1 min readLW link

The Long-Term Fu­ture Fund is look­ing for a full-time fund chair

5 Oct 2023 22:18 UTC
52 points
0 comments7 min readLW link
(forum.effectivealtruism.org)

Prov­ably Safe AI

PeterMcCluskey5 Oct 2023 22:18 UTC
31 points
15 comments4 min readLW link
(bayesianinvestor.com)

Stampy’s AI Safety Info soft launch

5 Oct 2023 22:13 UTC
120 points
9 comments2 min readLW link

Im­pacts of AI on the hous­ing markets

PottedRosePetal5 Oct 2023 21:24 UTC
8 points
0 comments5 min readLW link

Towards Monose­man­tic­ity: De­com­pos­ing Lan­guage Models With Dic­tionary Learning

Zac Hatfield-Dodds5 Oct 2023 21:01 UTC
286 points
21 comments2 min readLW link
(transformer-circuits.pub)

Ideation and Tra­jec­tory Model­ling in Lan­guage Models

NickyP5 Oct 2023 19:21 UTC
15 points
2 comments10 min readLW link

A well-defined his­tory in mea­surable fac­tor spaces

Matthias G. Mayer5 Oct 2023 18:36 UTC
22 points
0 comments2 min readLW link

Eval­u­at­ing the his­tor­i­cal value mis­speci­fi­ca­tion argument

Matthew Barnett5 Oct 2023 18:34 UTC
162 points
140 comments7 min readLW link

Trans­la­tions Should Invert

abramdemski5 Oct 2023 17:44 UTC
46 points
19 comments3 min readLW link

Cen­sor­ship in LLMs is here to stay be­cause it mir­rors how our own in­tel­li­gence is structured

mnvr5 Oct 2023 17:37 UTC
3 points
0 comments1 min readLW link

Twin Cities ACX Meetup Oc­to­ber 2023

Timothy M.5 Oct 2023 16:29 UTC
1 point
2 comments1 min readLW link

This anime sto­ry­board doesn’t ex­ist: a graphic novel writ­ten and illus­trated by GPT4

RomanS5 Oct 2023 14:01 UTC
12 points
7 comments55 min readLW link

AI #32: Lie Detector

Zvi5 Oct 2023 13:50 UTC
45 points
19 comments44 min readLW link
(thezvi.wordpress.com)

Can the House Leg­is­late?

jefftk5 Oct 2023 13:40 UTC
26 points
6 comments2 min readLW link
(www.jefftk.com)