Head­phones hook

philhSep 29, 2023, 10:50 PM
21 points
1 comment3 min readLW link
(reasonableapproximation.net)

Paul Chris­ti­ano’s views on “doom” (video ex­plainer)

Michaël TrazziSep 29, 2023, 9:56 PM
15 points
0 comments1 min readLW link
(youtu.be)

The Retroac­tive Fund­ing Land­scape: In­no­va­tions for Donors and Grantmakers

Dawn DrescherSep 29, 2023, 5:39 PM
13 points
0 commentsLW link
(impactmarkets.substack.com)

Bids To Defer On Value Judgements

johnswentworthSep 29, 2023, 5:07 PM
58 points
6 comments3 min readLW link

An­nounc­ing FAR Labs, an AI safety cowork­ing space

Ben GoldhaberSep 29, 2023, 4:52 PM
95 points
0 comments1 min readLW link

A tool for search­ing ra­tio­nal­ist & EA webs

Daniel_FriedrichSep 29, 2023, 3:23 PM
4 points
0 comments1 min readLW link
(ratsearch.blogspot.com)

Ba­sic Math­e­mat­ics of Pre­dic­tive Coding

Adam ShaiSep 29, 2023, 2:38 PM
49 points
6 comments9 min readLW link

“Di­a­mon­doid bac­te­ria” nanobots: deadly threat or dead-end? A nan­otech in­ves­ti­ga­tion

titotalSep 29, 2023, 2:01 PM
160 points
79 commentsLW link
(titotal.substack.com)

Steer­ing sub­sys­tems: ca­pa­bil­ities, agency, and alignment

Seth HerdSep 29, 2023, 1:45 PM
31 points
0 comments8 min readLW link

Ap­ply to Us­able Se­cu­rity Prize by Septem­ber 30

Allison DuettmannSep 29, 2023, 1:39 PM
4 points
0 comments1 min readLW link

List of how peo­ple have be­come more hard-working

Chi NguyenSep 29, 2023, 11:30 AM
69 points
7 commentsLW link

Re­solv­ing moral un­cer­tainty with randomization

Sep 29, 2023, 11:23 AM
7 points
1 comment11 min readLW link

EA Ve­gan Ad­vo­cacy is not truth­seek­ing, and it’s ev­ery­one’s problem

ElizabethSep 28, 2023, 11:30 PM
323 points
250 comments22 min readLW link2 reviews
(acesounderglass.com)

Com­pet­i­tive, Co­op­er­a­tive, and Cohabitive

ScrewtapeSep 28, 2023, 11:25 PM
49 points
13 comments5 min readLW link1 review

The Com­ing Wave

PeterMcCluskeySep 28, 2023, 10:59 PM
27 points
1 comment6 min readLW link
(bayesianinvestor.com)

High-level in­ter­pretabil­ity: de­tect­ing an AI’s objectives

Sep 28, 2023, 7:30 PM
72 points
4 comments21 min readLW link

How to Catch an AI Liar: Lie De­tec­tion in Black-Box LLMs by Ask­ing Un­re­lated Questions

Sep 28, 2023, 6:53 PM
187 points
39 comments3 min readLW link1 review

Re­spon­si­ble scal­ing policy TLDR

lemonhopeSep 28, 2023, 6:51 PM
9 points
0 comments1 min readLW link

Align­ment Work­shop talks

Richard_NgoSep 28, 2023, 6:26 PM
37 points
1 comment1 min readLW link
(www.alignment-workshop.com)

My Cur­rent Thoughts on the AI Strate­gic Landscape

Jeffrey HeningerSep 28, 2023, 5:59 PM
11 points
28 comments14 min readLW link

My Ar­ro­gant Plan for Alignment

MrArrogantSep 28, 2023, 5:51 PM
2 points
6 comments6 min readLW link

Dis­cur­sive Com­pe­tence in ChatGPT, Part 2: Me­mory for Texts

Bill BenzonSep 28, 2023, 4:34 PM
1 point
0 comments3 min readLW link

Differ­ent views of al­ign­ment have differ­ent con­se­quences for im­perfect methods

Stuart_ArmstrongSep 28, 2023, 4:31 PM
31 points
0 comments1 min readLW link

AI #31: It Can Do What Now?

ZviSep 28, 2023, 4:00 PM
90 points
6 comments40 min readLW link
(thezvi.wordpress.com)

The point of a game is not to win, and you shouldn’t even pre­tend that it is

mako yassSep 28, 2023, 3:54 PM
51 points
27 comments4 min readLW link
(makopool.com)

Co­hab­itive Games so Far

mako yassSep 28, 2023, 3:41 PM
131 points
146 comments19 min readLW link2 reviews
(makopool.com)

Wob­bly Table The­o­rem in Practice

MorpheusSep 28, 2023, 2:33 PM
24 points
0 comments2 min readLW link

Weigh­ing An­i­mal Worth

jefftkSep 28, 2023, 1:50 PM
25 points
11 comments2 min readLW link
(www.jefftk.com)

ARC Evals: Re­spon­si­ble Scal­ing Policies

Zach Stein-PerlmanSep 28, 2023, 4:30 AM
40 points
10 comments2 min readLW link1 review
(evals.alignment.org)

Petrov Day Ret­ro­spec­tive, 2023 (re: the most im­por­tant virtue of Petrov Day & unilat­er­ally pro­mot­ing it)

RubySep 28, 2023, 2:48 AM
66 points
73 comments6 min readLW link

Jimmy Ap­ples, source of the ru­mor that OpenAI has achieved AGI in­ter­nally, is a cred­ible in­sider.

JorterderSep 28, 2023, 1:20 AM
−6 points
2 comments1 min readLW link
(twitter.com)

In­ves­ti­gat­ing the ru­mors of OpenAI achiev­ing AGI

JorterderSep 28, 2023, 1:17 AM
−4 points
1 comment1 min readLW link

Alibaba Group re­leases Qwen, 14B pa­ram­e­ter LLM

Nikola JurkovicSep 28, 2023, 12:12 AM
5 points
1 comment1 min readLW link
(qianwen-res.oss-cn-beijing.aliyuncs.com)

Me­tac­u­lus Launches 2023/​2024 FluSight Challenge Sup­port­ing CDC, $5K in Prizes

ChristianWilliamsSep 27, 2023, 9:35 PM
5 points
0 commentsLW link
(www.metaculus.com)

Pro­jects I would like to see (pos­si­bly at AI Safety Camp)

Linda LinseforsSep 27, 2023, 9:27 PM
22 points
12 comments4 min readLW link

Towards Bet­ter Mile­stones for Mon­i­tor­ing AI Capabilities

snewmanSep 27, 2023, 9:18 PM
11 points
0 comments14 min readLW link

[Question] Is Bjorn Lom­borg roughly right about cli­mate change policy?

yhoisethSep 27, 2023, 8:06 PM
29 points
14 comments2 min readLW link
(www.sciencedirect.com)

Com­mon­sense Good, Creative Good

jefftkSep 27, 2023, 7:50 PM
44 points
11 comments3 min readLW link
(www.jefftk.com)

Petrov Day [Spoiler Warn­ing]

lsusrSep 27, 2023, 7:20 PM
6 points
6 comments1 min readLW link

The Hid­den Com­plex­ity of Wishes—The Animation

WriterSep 27, 2023, 5:59 PM
33 points
0 comments1 min readLW link
(youtu.be)

MMLU’s Mo­ral Sce­nar­ios Bench­mark Doesn’t Mea­sure What You Think it Measures

corey morrisSep 27, 2023, 5:54 PM
18 points
3 comments4 min readLW link
(medium.com)

[Question] What’s your stan­dard for good work perfor­mance?

Chi NguyenSep 27, 2023, 4:58 PM
30 points
3 comments1 min readLW link

The Role of Groups in the Pro­gres­sion of Hu­man Understanding

Chris_LeongSep 27, 2023, 3:09 PM
11 points
0 comments2 min readLW link

The Great Disembedding

rogersbaconSep 27, 2023, 2:53 PM
16 points
4 comments16 min readLW link
(www.secretorum.life)

[Question] how do short-timelin­ers rea­son about the differ­ences be­tween brain and AI?

JavierCCSep 27, 2023, 8:13 AM
2 points
11 comments1 min readLW link

[Question] Is there a widely ac­cepted met­ric for ‘gen­uine­ness’ in in­ter­per­sonal com­mu­ni­ca­tion?

M. Y. ZuoSep 27, 2023, 5:30 AM
6 points
3 comments1 min readLW link

Bari­a­tric surgery seems like a no-brainer for most mor­bidly obese people

lcSep 27, 2023, 1:05 AM
12 points
12 comments3 min readLW link

Ja­cob on the Precipice

Richard_NgoSep 26, 2023, 9:16 PM
45 points
8 comments11 min readLW link
(narrativeark.substack.com)

Text Posts from the Kids Group: 2022

jefftkSep 26, 2023, 8:40 PM
33 points
2 comments7 min readLW link
(www.jefftk.com)

GPT-4 for per­sonal pro­duc­tivity: on­line dis­trac­tion blocker

SergiiSep 26, 2023, 5:41 PM
65 points
13 comments2 min readLW link
(grgv.xyz)