The ‘Ne­glected Ap­proaches’ Ap­proach: AE Stu­dio’s Align­ment Agenda

Dec 18, 2023, 8:35 PM
187 points

74 votes

Overall karma indicates overall quality.

23 comments12 min readLW link1 review

The Short­est Path Between Scylla and Charybdis

Thane RuthenisDec 18, 2023, 8:08 PM
50 points

21 votes

Overall karma indicates overall quality.

8 comments5 min readLW link

OpenAI: Pre­pared­ness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM
70 points

30 votes

Overall karma indicates overall quality.

23 comments4 min readLW link
(openai.com)

[Valence se­ries] 5. “Valence Di­sor­ders” in Men­tal Health & Personality

Steven ByrnesDec 18, 2023, 3:26 PM
45 points

15 votes

Overall karma indicates overall quality.

13 comments13 min readLW link

Dis­cus­sion: Challenges with Un­su­per­vised LLM Knowl­edge Discovery

Dec 18, 2023, 11:58 AM
149 points

57 votes

Overall karma indicates overall quality.

21 comments10 min readLW link

In­ter­pret­ing the Learn­ing of Deceit

RogerDearnaleyDec 18, 2023, 8:12 AM
30 points

11 votes

Overall karma indicates overall quality.

14 comments9 min readLW link

Talk: “AI Would Be A Lot Less Alarm­ing If We Un­der­stood Agents”

johnswentworthDec 17, 2023, 11:46 PM
58 points

21 votes

Overall karma indicates overall quality.

3 comments1 min readLW link
(www.youtube.com)

∀: a story

Richard_NgoDec 17, 2023, 10:42 PM
40 points

30 votes

Overall karma indicates overall quality.

1 comment8 min readLW link
(www.narrativeark.xyz)

Re­viv­ing a 2015 MacBook

jefftkDec 17, 2023, 9:00 PM
11 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.jefftk.com)

A Com­mon-Sense Case For Mu­tu­ally-Misal­igned AGIs Ally­ing Against Humans

Thane RuthenisDec 17, 2023, 8:28 PM
29 points

16 votes

Overall karma indicates overall quality.

7 comments11 min readLW link

The Limits of Ar­tifi­cial Con­scious­ness: A Biol­ogy-Based Cri­tique of Chalmers’ Fad­ing Qualia Argument

Štěpán LosDec 17, 2023, 7:11 PM
−6 points

5 votes

Overall karma indicates overall quality.

9 comments17 min readLW link

What makes teach­ing math special

ViliamDec 17, 2023, 2:15 PM
45 points

20 votes

Overall karma indicates overall quality.

27 comments11 min readLW link

The pre­dic­tive power of dis­si­pa­tive adaptation

dr_sDec 17, 2023, 2:01 PM
56 points

25 votes

Overall karma indicates overall quality.

14 comments19 min readLW link

Linkpost: Francesca v Harvard

LinchDec 17, 2023, 6:18 AM
5 points

11 votes

Overall karma indicates overall quality.

5 comments2 min readLW link
(www.francesca-v-harvard.org)

The Serendipity of Density

jefftkDec 17, 2023, 3:50 AM
40 points

19 votes

Overall karma indicates overall quality.

4 comments1 min readLW link
(www.jefftk.com)

Bounty: Di­verse hard tasks for LLM agents

Dec 17, 2023, 1:04 AM
49 points

27 votes

Overall karma indicates overall quality.

31 comments16 min readLW link

2022 (and All Time) Posts by Ping­back Count

RaemonDec 16, 2023, 9:17 PM
53 points

21 votes

Overall karma indicates overall quality.

14 comments6 min readLW link

“Hu­man­ity vs. AGI” Will Never Look Like “Hu­man­ity vs. AGI” to Humanity

Thane RuthenisDec 16, 2023, 8:08 PM
192 points

97 votes

Overall karma indicates overall quality.

34 comments5 min readLW link

A vi­sual anal­ogy for text gen­er­a­tion by LLMs?

Bill BenzonDec 16, 2023, 5:58 PM
3 points

2 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

cold alu­minum for medicine

bhauthDec 16, 2023, 2:38 PM
42 points

19 votes

Overall karma indicates overall quality.

4 comments4 min readLW link
(www.bhauth.com)

Scal­able Over­sight and Weak-to-Strong Gen­er­al­iza­tion: Com­pat­i­ble ap­proaches to the same problem

Dec 16, 2023, 5:49 AM
76 points

27 votes

Overall karma indicates overall quality.

4 comments6 min readLW link1 review

Weak-to-Strong Gen­er­al­iza­tion: Elic­it­ing Strong Ca­pa­bil­ities With Weak Supervision

leogaoDec 16, 2023, 5:39 AM
55 points

19 votes

Overall karma indicates overall quality.

5 comments1 min readLW link

Pope Fran­cis shares thoughts on re­spon­si­ble AI development

corruptedCatapillarDec 16, 2023, 3:49 AM
15 points

8 votes

Overall karma indicates overall quality.

4 comments1 min readLW link
(www.vatican.va)

Cur­rent AIs Provide Nearly No Data Rele­vant to AGI Alignment

Thane RuthenisDec 15, 2023, 8:16 PM
132 points

89 votes

Overall karma indicates overall quality.

157 comments8 min readLW link1 review

Ag­glomer­a­tion of ‘Ought’

DavidAndresBloomDec 15, 2023, 7:07 PM
1 point

1 vote

Overall karma indicates overall quality.

1 comment11 min readLW link

Pre­dict­ing the fu­ture with the power of the In­ter­net (and piss­ing off Rob Miles)

WriterDec 15, 2023, 5:37 PM
23 points

12 votes

Overall karma indicates overall quality.

9 comments4 min readLW link
(youtu.be)

Progress links di­gest, 2023-12-15: Vi­talik on d/​acc, $100M+ in prizes, and more

jasoncrawfordDec 15, 2023, 3:52 PM
20 points

7 votes

Overall karma indicates overall quality.

0 comments12 min readLW link
(rootsofprogress.org)

“AI Align­ment” is a Danger­ously Over­loaded Term

RokoDec 15, 2023, 2:34 PM
108 points

69 votes

Overall karma indicates overall quality.

100 comments3 min readLW link

[Valence se­ries] 4. Valence & So­cial Sta­tus (de­p­re­cated)

Steven ByrnesDec 15, 2023, 2:24 PM
35 points

11 votes

Overall karma indicates overall quality.

19 comments11 min readLW link

Con­tra Scott on Abol­ish­ing the FDA

Maxwell TabarrokDec 15, 2023, 2:00 PM
46 points

25 votes

Overall karma indicates overall quality.

3 comments6 min readLW link
(maximumprogress.substack.com)

[Paper] Tra­jec­to­ries through se­man­tic spaces in schizophre­nia and the re­la­tion­ship to rip­ple bursts

bvbvbvbvbvbvbvbvbvbvbvDec 15, 2023, 1:37 PM
3 points

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.pnas.org)

Take­aways from a Mechanis­tic In­ter­pretabil­ity pro­ject on “For­bid­den Facts”

Dec 15, 2023, 11:05 AM
34 points

17 votes

Overall karma indicates overall quality.

8 comments10 min readLW link

Refine­ment of Ac­tive In­fer­ence agency ontology

Roman LeventovDec 15, 2023, 9:31 AM
16 points

7 votes

Overall karma indicates overall quality.

0 comments5 min readLW link
(arxiv.org)

EU poli­cy­mak­ers reach an agree­ment on the AI Act

tlevinDec 15, 2023, 6:02 AM
78 points

39 votes

Overall karma indicates overall quality.

7 comments7 min readLW link

Where Does Ad­ver­sar­ial Pres­sure Come From?

quetzal_rainbowDec 14, 2023, 10:31 PM
17 points

9 votes

Overall karma indicates overall quality.

1 comment2 min readLW link

Epoch wise crit­i­cal pe­ri­ods, and sin­gu­lar learn­ing theory

Garrett BakerDec 14, 2023, 8:55 PM
16 points

3 votes

Overall karma indicates overall quality.

1 comment5 min readLW link

OpenAI Su­per­al­ign­ment: Weak-to-strong generalization

DalmertDec 14, 2023, 7:47 PM
25 points

10 votes

Overall karma indicates overall quality.

3 comments1 min readLW link
(openai.com)

Ap­pli­ca­tions for EA Global are still open!

Eli_NathanDec 14, 2023, 7:10 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Per­sonal Devel­op­ment Sys­tem: Win­ning Re­peat­edly and Grow­ing Effec­tively With The BIG4

Paul RohdeDec 14, 2023, 6:49 PM
13 points

7 votes

Overall karma indicates overall quality.

0 comments33 min readLW link
(blog.paul-rohde.com)

In­tro­duc­ing The ‘From Big Ideas To Real-World Re­sults’: A Series for Effec­tive Per­sonal Development

Paul RohdeDec 14, 2023, 6:49 PM
13 points

8 votes

Overall karma indicates overall quality.

1 comment8 min readLW link
(blog.paul-rohde.com)

Talk­ing With Peo­ple Who Speak to Con­gres­sional Staffers about AI risk

EneaszDec 14, 2023, 5:55 PM
32 points

10 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.thebayesianconspiracy.com)

Bayesian Injustice

Kevin DorstDec 14, 2023, 3:44 PM
124 points

59 votes

Overall karma indicates overall quality.

10 comments6 min readLW link
(kevindorst.substack.com)

AI #42: The Wrong Answer

ZviDec 14, 2023, 2:50 PM
67 points

23 votes

Overall karma indicates overall quality.

6 comments54 min readLW link
(thezvi.wordpress.com)

Some for-profit AI al­ign­ment org ideas

Eric HoDec 14, 2023, 2:23 PM
87 points

53 votes

Overall karma indicates overall quality.

19 comments9 min readLW link

Map­ping the se­man­tic void: Strange go­ings-on in GPT em­bed­ding spaces

mwatkinsDec 14, 2023, 1:10 PM
115 points

57 votes

Overall karma indicates overall quality.

31 comments14 min readLW link

Cat­e­gor­i­cal Or­ga­ni­za­tion in Me­mory: ChatGPT Or­ga­nizes the 665 Topic Tags from My New Sa­vanna Blog

Bill BenzonDec 14, 2023, 1:02 PM
0 points

6 votes

Overall karma indicates overall quality.

6 comments2 min readLW link

Mo­ral Mountains

Adam ZernerDec 14, 2023, 10:40 AM
8 points

3 votes

Overall karma indicates overall quality.

10 comments2 min readLW link

Up­date on Chi­nese IQ-re­lated gene panels

Lao MeinDec 14, 2023, 10:12 AM
70 points

28 votes

Overall karma indicates overall quality.

7 comments1 min readLW link

Red Line Ash­mont Train is Now Approaching

jefftkDec 14, 2023, 2:50 AM
23 points

13 votes

Overall karma indicates overall quality.

2 comments1 min readLW link
(www.jefftk.com)

Var­i­ous AI doom path­ways (and how likely they are)

Logan ZoellnerDec 14, 2023, 12:54 AM
1 point

8 votes

Overall karma indicates overall quality.

1 comment4 min readLW link
(midwitalignment.substack.com)