SIA Is Just Be­ing a Bayesian About the Fact That One Ex­ists

Bentham's BulldogNov 14, 2023, 10:55 PM
3 points

2 votes

Overall karma indicates overall quality.

5 comments4 min readLW link

AI Align­ment [progress] this Week (11/​12/​2023)

Logan ZoellnerNov 14, 2023, 10:21 PM
6 points

1 vote

Overall karma indicates overall quality.

0 comments2 min readLW link
(midwitalignment.substack.com)

[Question] When did Eliezer Yud­kowsky change his mind about neu­ral net­works?

[deactivated]Nov 14, 2023, 9:24 PM
31 points

19 votes

Overall karma indicates overall quality.

15 comments1 min readLW link

Bet­ting on what is un-falsifi­able and un-verifiable

Abhimanyu Pallavi SudhirNov 14, 2023, 9:11 PM
13 points

7 votes

Overall karma indicates overall quality.

0 comments15 min readLW link

Face­book is Pay­ing Me to Post

jefftkNov 14, 2023, 7:10 PM
26 points

10 votes

Overall karma indicates overall quality.

5 comments1 min readLW link
(www.jefftk.com)

Feel­ings, Noth­ing More than Feel­ings, About AI

PaulBeconNov 14, 2023, 6:50 PM
7 points

5 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

Kids or No kids

Kids or no kidsNov 14, 2023, 6:37 PM
98 points

49 votes

Overall karma indicates overall quality.

10 comments13 min readLW link

Rae­mon’s De­liber­ate (“Pur­pose­ful?”) Prac­tice Club

Nov 14, 2023, 6:24 PM
61 points

22 votes

Overall karma indicates overall quality.

11 comments22 min readLW link

More metal less ore

Logan KiellerNov 14, 2023, 4:59 PM
10 points

7 votes

Overall karma indicates overall quality.

3 comments2 min readLW link
(logankieller.substack.com)

Monthly Roundup #12: Novem­ber 2023

ZviNov 14, 2023, 3:20 PM
34 points

13 votes

Overall karma indicates overall quality.

5 comments33 min readLW link
(thezvi.wordpress.com)

Do you want a first-prin­ci­pled pre­pared­ness guide to pre­pare your­self and loved ones for po­ten­tial catas­tro­phes?

Ulrik HornNov 14, 2023, 12:13 PM
16 points

9 votes

Overall karma indicates overall quality.

5 comments15 min readLW link

[Question] Is there Work on Embed­ded Agency in Cel­lu­lar Au­tomata Toy Models?

Johannes C. MayerNov 14, 2023, 9:08 AM
10 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

[Question] Would this be Progress in Solv­ing Embed­ded Agency?

Johannes C. MayerNov 14, 2023, 9:08 AM
9 points

5 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

Is In­ter­pretabil­ity All We Need?

RogerDearnaleyNov 14, 2023, 5:31 AM
1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

What is wis­dom?

TsviBTNov 14, 2023, 2:13 AM
39 points

15 votes

Overall karma indicates overall quality.

3 comments13 min readLW link

Fes­ti­val Stats 2023

jefftkNov 14, 2023, 1:20 AM
9 points

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.jefftk.com)

Out of the Box

jesseduffieldNov 13, 2023, 11:43 PM
5 points

3 votes

Overall karma indicates overall quality.

1 comment7 min readLW link

Loudly Give Up, Don’t Quietly Fade

ScrewtapeNov 13, 2023, 11:30 PM
180 points

103 votes

Overall karma indicates overall quality.

12 comments6 min readLW link1 review

Great Em­pa­thy and Great Re­sponse Ability

positivesumNov 13, 2023, 11:04 PM
16 points

8 votes

Overall karma indicates overall quality.

0 comments3 min readLW link
(tryingtruly.substack.com)

The­o­ries of Change for AI Auditing

Nov 13, 2023, 7:33 PM
54 points

27 votes

Overall karma indicates overall quality.

0 comments18 min readLW link
(www.apolloresearch.ai)

They are made of re­peat­ing patterns

quetzal_rainbowNov 13, 2023, 6:17 PM
61 points

37 votes

Overall karma indicates overall quality.

4 comments2 min readLW link

How to Upload a Mind (In Three Not-So-Easy Steps)

Nov 13, 2023, 6:13 PM
26 points

12 votes

Overall karma indicates overall quality.

0 comments7 min readLW link
(youtu.be)

Non-my­opia stories

lberglundNov 13, 2023, 5:52 PM
29 points

15 votes

Overall karma indicates overall quality.

10 comments7 min readLW link

It’s OK to eat shrimp: EAs Make In­valid In­fer­ences About Fish Qualia and Mo­ral Patienthood

Mikhail SaminNov 13, 2023, 4:51 PM
0 points

37 votes

Overall karma indicates overall quality.

17 comments7 min readLW link

Sugges­tions for chess puzzles

ZaneNov 13, 2023, 3:39 PM
13 points

4 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

Why small phe­nomenons are rele­vant to moral­ity ​

Ryo Nov 13, 2023, 3:25 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments3 min readLW link

Op­tion­al­ity ap­proach to ethics

Ryo Nov 13, 2023, 3:23 PM
7 points

3 votes

Overall karma indicates overall quality.

2 comments3 min readLW link

Redi­rect­ing one’s own taxes as an effec­tive al­tru­ism method

David GrossNov 13, 2023, 3:17 PM
−11 points

95 votes

Overall karma indicates overall quality.

34 comments16 min readLW link

AISC Pro­ject: Bench­marks for Stable Reflectivity

jacquesthibsNov 13, 2023, 2:51 PM
17 points

5 votes

Overall karma indicates overall quality.

0 comments8 min readLW link

Re­search Adenda: Model­ling Tra­jec­to­ries of Lan­guage Models

NickyPNov 13, 2023, 2:33 PM
28 points

13 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

Bostrom Goes Unheard

ZviNov 13, 2023, 2:11 PM
81 points

38 votes

Overall karma indicates overall quality.

9 comments18 min readLW link

Novem­ber hang­out in Warsaw

ntoxegNov 13, 2023, 1:20 PM
1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

The Science Al­gorithm AISC Project

Johannes C. MayerNov 13, 2023, 12:52 PM
12 points

7 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(docs.google.com)

You can just spon­ta­neously call peo­ple you haven’t met in years

lcNov 13, 2023, 5:21 AM
169 points

105 votes

Overall karma indicates overall quality.

21 comments1 min readLW link

Zvi’s Man­i­fold Mar­kets House Rules

ZviNov 13, 2023, 12:28 AM
53 points

23 votes

Overall karma indicates overall quality.

6 comments3 min readLW link

[Question] What’s your best util­i­tar­ian model for risk­ing your best kid­neys?

IlioNov 12, 2023, 11:01 PM
−3 points

6 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

Helpful ex­am­ples to get a sense of mod­ern au­to­mated manipulation

trevorNov 12, 2023, 8:49 PM
33 points

18 votes

Overall karma indicates overall quality.

4 comments9 min readLW link

The Snug­gle/​Date/​Slap Protocol

MadHatterNov 12, 2023, 8:44 PM
−21 points

17 votes

Overall karma indicates overall quality.

4 comments2 min readLW link

Two chil­dren’s stories

Optimization ProcessNov 12, 2023, 8:29 PM
10 points

8 votes

Overall karma indicates overall quality.

1 comment7 min readLW link

The Fun­da­men­tal The­o­rem for mea­surable fac­tor spaces

Matthias G. MayerNov 12, 2023, 7:25 PM
41 points

12 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

How ac­cu­rate are stan­dard Dark Triad per­son­al­ity scales?

jamesbillNov 12, 2023, 8:21 AM
0 points

5 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

[Question] What ML gears do you like?

Ulisse MiniNov 11, 2023, 7:10 PM
25 points

13 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

Smart Ses­sions—Fi­nally a (kinda) win­dow-cen­tric ses­sion manager

Eli TyreNov 11, 2023, 6:54 PM
14 points

5 votes

Overall karma indicates overall quality.

3 comments5 min readLW link

AISC pro­ject: Satis­fIA – AI that satis­fies with­out over­do­ing it

Jobst HeitzigNov 11, 2023, 6:22 PM
12 points

7 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(docs.google.com)

Con­trol Sym­me­try: why we might want to start in­ves­ti­gat­ing asym­met­ric al­ign­ment interventions

domenicrosatiNov 11, 2023, 5:27 PM
25 points

13 votes

Overall karma indicates overall quality.

1 comment2 min readLW link

Game The­ory with­out Argmax [Part 2]

Cleo NardoNov 11, 2023, 4:02 PM
31 points

12 votes

Overall karma indicates overall quality.

14 comments13 min readLW link

Game The­ory with­out Argmax [Part 1]

Cleo NardoNov 11, 2023, 3:59 PM
70 points

28 votes

Overall karma indicates overall quality.

18 comments19 min readLW link

It’s OK to be bi­ased to­wards humans

dr_sNov 11, 2023, 11:59 AM
54 points

30 votes

Overall karma indicates overall quality.

69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Lat­est Recommendations

Dawn DrescherNov 11, 2023, 9:04 AM
3 points

6 votes

Overall karma indicates overall quality.

2 comments8 min readLW link

Ar­tifi­cial Gen­eral Horsiness

robotelvisNov 11, 2023, 5:15 AM
4 points

9 votes

Overall karma indicates overall quality.

0 comments5 min readLW link
(messyprogress.substack.com)