SIA Is Just Be­ing a Bayesian About the Fact That One Ex­ists

omnizoidNov 14, 2023, 10:55 PM
3 points
5 comments4 min readLW link

AI Align­ment [progress] this Week (11/​12/​2023)

Logan ZoellnerNov 14, 2023, 10:21 PM
6 points
0 comments2 min readLW link
(midwitalignment.substack.com)

[Question] When did Eliezer Yud­kowsky change his mind about neu­ral net­works?

[deactivated]Nov 14, 2023, 9:24 PM
31 points
15 comments1 min readLW link

Bet­ting on what is un-falsifi­able and un-verifiable

Abhimanyu Pallavi SudhirNov 14, 2023, 9:11 PM
13 points
0 comments15 min readLW link

Face­book is Pay­ing Me to Post

jefftkNov 14, 2023, 7:10 PM
26 points
5 comments1 min readLW link
(www.jefftk.com)

Feel­ings, Noth­ing More than Feel­ings, About AI

PaulBeconNov 14, 2023, 6:50 PM
7 points
0 comments3 min readLW link

Kids or No kids

Kids or no kidsNov 14, 2023, 6:37 PM
98 points
10 comments13 min readLW link

Rae­mon’s De­liber­ate (“Pur­pose­ful?”) Prac­tice Club

Nov 14, 2023, 6:24 PM
61 points
11 comments22 min readLW link

More metal less ore

Logan KiellerNov 14, 2023, 4:59 PM
6 points
3 comments2 min readLW link
(logankieller.substack.com)

Monthly Roundup #12: Novem­ber 2023

ZviNov 14, 2023, 3:20 PM
34 points
5 comments33 min readLW link
(thezvi.wordpress.com)

Do you want a first-prin­ci­pled pre­pared­ness guide to pre­pare your­self and loved ones for po­ten­tial catas­tro­phes?

Ulrik HornNov 14, 2023, 12:13 PM
16 points
5 comments15 min readLW link

[Question] Is there Work on Embed­ded Agency in Cel­lu­lar Au­tomata Toy Models?

Johannes C. MayerNov 14, 2023, 9:08 AM
10 points
0 comments1 min readLW link

[Question] Would this be Progress in Solv­ing Embed­ded Agency?

Johannes C. MayerNov 14, 2023, 9:08 AM
9 points
2 comments2 min readLW link

Is In­ter­pretabil­ity All We Need?

RogerDearnaleyNov 14, 2023, 5:31 AM
1 point
1 comment1 min readLW link

What is wis­dom?

TsviBTNov 14, 2023, 2:13 AM
39 points
3 comments13 min readLW link

Fes­ti­val Stats 2023

jefftkNov 14, 2023, 1:20 AM
9 points
0 comments1 min readLW link
(www.jefftk.com)

Out of the Box

jesseduffieldNov 13, 2023, 11:43 PM
5 points
1 comment7 min readLW link

Loudly Give Up, Don’t Quietly Fade

ScrewtapeNov 13, 2023, 11:30 PM
165 points
12 comments6 min readLW link1 review

Great Em­pa­thy and Great Re­sponse Ability

positivesumNov 13, 2023, 11:04 PM
16 points
0 comments3 min readLW link
(tryingtruly.substack.com)

The­o­ries of Change for AI Auditing

Nov 13, 2023, 7:33 PM
54 points
0 comments18 min readLW link
(www.apolloresearch.ai)

They are made of re­peat­ing patterns

quetzal_rainbowNov 13, 2023, 6:17 PM
53 points
4 comments2 min readLW link

How to Upload a Mind (In Three Not-So-Easy Steps)

Nov 13, 2023, 6:13 PM
26 points
0 comments7 min readLW link
(youtu.be)

Non-my­opia stories

lberglundNov 13, 2023, 5:52 PM
29 points
10 comments7 min readLW link

It’s OK to eat shrimp: EAs Make In­valid In­fer­ences About Fish Qualia and Mo­ral Patienthood

Mikhail SaminNov 13, 2023, 4:51 PM
0 points
17 commentsLW link

Sugges­tions for chess puzzles

ZaneNov 13, 2023, 3:39 PM
13 points
1 comment1 min readLW link

Why small phe­nomenons are rele­vant to moral­ity ​

Ryo Nov 13, 2023, 3:25 PM
1 point
0 comments3 min readLW link

Op­tion­al­ity ap­proach to ethics

Ryo Nov 13, 2023, 3:23 PM
7 points
2 comments3 min readLW link

Redi­rect­ing one’s own taxes as an effec­tive al­tru­ism method

David GrossNov 13, 2023, 3:17 PM
−5 points
34 comments16 min readLW link

AISC Pro­ject: Bench­marks for Stable Reflectivity

jacquesthibsNov 13, 2023, 2:51 PM
17 points
0 comments8 min readLW link

Re­search Adenda: Model­ling Tra­jec­to­ries of Lan­guage Models

NickyPNov 13, 2023, 2:33 PM
28 points
0 comments12 min readLW link

Bostrom Goes Unheard

ZviNov 13, 2023, 2:11 PM
81 points
9 comments18 min readLW link

Novem­ber hang­out in Warsaw

ntoxegNov 13, 2023, 1:20 PM
1 point
1 comment1 min readLW link

The Science Al­gorithm AISC Project

Johannes C. MayerNov 13, 2023, 12:52 PM
12 points
0 comments1 min readLW link
(docs.google.com)

You can just spon­ta­neously call peo­ple you haven’t met in years

lcNov 13, 2023, 5:21 AM
167 points
21 comments1 min readLW link

Zvi’s Man­i­fold Mar­kets House Rules

ZviNov 13, 2023, 12:28 AM
53 points
6 comments3 min readLW link

[Question] What’s your best util­i­tar­ian model for risk­ing your best kid­neys?

IlioNov 12, 2023, 11:01 PM
−3 points
4 comments1 min readLW link

Helpful ex­am­ples to get a sense of mod­ern au­to­mated manipulation

trevorNov 12, 2023, 8:49 PM
33 points
4 comments9 min readLW link

The Snug­gle/​Date/​Slap Protocol

MadHatterNov 12, 2023, 8:44 PM
−21 points
4 comments2 min readLW link

Two chil­dren’s stories

Optimization ProcessNov 12, 2023, 8:29 PM
10 points
1 comment7 min readLW link

The Fun­da­men­tal The­o­rem for mea­surable fac­tor spaces

Matthias G. MayerNov 12, 2023, 7:25 PM
38 points
2 comments2 min readLW link

How ac­cu­rate are stan­dard Dark Triad per­son­al­ity scales?

jamesbillNov 12, 2023, 8:21 AM
0 points
2 comments2 min readLW link

[Question] What ML gears do you like?

Ulisse MiniNov 11, 2023, 7:10 PM
25 points
4 comments1 min readLW link

Smart Ses­sions—Fi­nally a (kinda) win­dow-cen­tric ses­sion manager

Eli TyreNov 11, 2023, 6:54 PM
14 points
3 comments5 min readLW link

AISC pro­ject: Satis­fIA – AI that satis­fies with­out over­do­ing it

Jobst HeitzigNov 11, 2023, 6:22 PM
12 points
0 comments1 min readLW link
(docs.google.com)

Con­trol Sym­me­try: why we might want to start in­ves­ti­gat­ing asym­met­ric al­ign­ment interventions

domenicrosatiNov 11, 2023, 5:27 PM
25 points
1 comment2 min readLW link

Game The­ory with­out Argmax [Part 2]

Cleo NardoNov 11, 2023, 4:02 PM
31 points
14 comments13 min readLW link

Game The­ory with­out Argmax [Part 1]

Cleo NardoNov 11, 2023, 3:59 PM
70 points
18 comments19 min readLW link

It’s OK to be bi­ased to­wards humans

dr_sNov 11, 2023, 11:59 AM
54 points
69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Lat­est Recommendations

Dawn DrescherNov 11, 2023, 9:04 AM
3 points
2 commentsLW link

Ar­tifi­cial Gen­eral Horsiness

robotelvisNov 11, 2023, 5:15 AM
4 points
0 comments5 min readLW link
(messyprogress.substack.com)