27 Sep 2022 23:13 UTC

180 points

10 comments4 min readLW link

Failure modes in a shard theory alignment plan

Thomas Kwa27 Sep 2022 22:34 UTC

26 points

2 comments7 min readLW link

[Question] Is a PhD necessary to contribute meaningfully to a field?

TrudosKudos27 Sep 2022 21:27 UTC

4 points

7 comments1 min readLW link

Why we’re not founding a human-data-for-alignment org

L Rudolf L and Matt Putz

27 Sep 2022 20:14 UTC

88 points

6 comments29 min readLW link

(forum.effectivealtruism.org)

A Poorly Planned Loft Bed

jefftk27 Sep 2022 17:50 UTC

9 points

2 comments1 min readLW link

(www.jefftk.com)

Wise Crowd & Democratic Spirit

Hristo Zaykov27 Sep 2022 17:45 UTC

1 point

0 comments2 min readLW link

(www.hristo.blog)

Soft skills for meetups

mingyuan27 Sep 2022 17:26 UTC

51 points

3 comments5 min readLW link

[Question] Enriching Youtube content recommendations

Martín Soto27 Sep 2022 16:54 UTC

9 points

4 comments1 min readLW link

The Onion Test for Personal and Institutional Honesty

chanamessinger and Andrew_Critch

27 Sep 2022 15:26 UTC

173 points

32 comments3 min readLW link 3 reviews

Book review: “The Heart of the Brain: The Hypothalamus and Its Hormones”

Steven Byrnes27 Sep 2022 13:20 UTC

66 points

3 comments18 min readLW link

My Thoughts on the ML Safety Course

zeshen27 Sep 2022 13:15 UTC

50 points

3 comments17 min readLW link

Summary of ML Safety Course

zeshen27 Sep 2022 13:05 UTC

7 points

0 comments6 min readLW link

Probabilistic reasoning for description and experience

Q Home27 Sep 2022 10:57 UTC

0 points

0 comments26 min readLW link

A Prince, a Pauper, Power, Panama

Alok Singh27 Sep 2022 7:10 UTC

10 points

0 comments1 min readLW link

(alok.github.io)

Double Asteroid Redirection Test succeeds

sanxiyn27 Sep 2022 6:37 UTC

19 points

5 comments1 min readLW link

(twitter.com)

[Question] How would I know if a PhD is the right career path?

Bob Guran27 Sep 2022 5:49 UTC

4 points

4 comments1 min readLW link

Review of Examine.com’s vitamin write-ups

Elizabeth and Martin Bernstorff

26 Sep 2022 23:40 UTC

60 points

1 comment5 min readLW link

(acesounderglass.com)

D&D.Sci September 2022 Evaluation and Ruleset

abstractapplic26 Sep 2022 22:19 UTC

30 points

5 comments3 min readLW link

[MLSN #5]: Prize Compilation

Dan H26 Sep 2022 21:55 UTC

15 points

1 comment2 min readLW link

Loss of Alignment is not the High-Order Bit for AI Risk

yieldthought26 Sep 2022 21:16 UTC

14 points

18 comments2 min readLW link

Inverse Scaling Prize: Round 1 Winners

Ethan Perez and Ian McKenzie

26 Sep 2022 19:57 UTC

93 points

16 comments4 min readLW link

(irmckenzie.co.uk)

[Question] Does the existence of shared human values imply alignment is “easy”?

Morpheus26 Sep 2022 18:01 UTC

7 points

15 comments1 min readLW link

Meetup: Madison, WI (Oct 8)

svfritz26 Sep 2022 17:55 UTC

1 point

0 comments1 min readLW link

Ambiguity in Prediction Market Resolution is Harmful

aphyer26 Sep 2022 16:22 UTC

69 points

17 comments5 min readLW link

Framery Phone Booth CO2 Accumulation

jefftk26 Sep 2022 16:10 UTC

25 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] How can I remove the launch button from my LW home page?

sudo26 Sep 2022 15:15 UTC

8 points

4 comments1 min readLW link

Brief Notes on Transformers

Adam Jermyn26 Sep 2022 14:46 UTC

48 points

3 comments2 min readLW link

You are Underestimating The Likelihood That Convergent Instrumental Subgoals Lead to Aligned AGI

Mark Neyer26 Sep 2022 14:22 UTC

3 points

6 comments3 min readLW link

Climate-contingent Finance, and A Generalized Mechanism for X-Risk Reduction Financing

John Nay26 Sep 2022 13:23 UTC

0 points

2 comments26 min readLW link

Self-Control Secrets of the Puritan Masters

David Hugh-Jones26 Sep 2022 9:04 UTC

68 points

3 comments5 min readLW link

(wyclif.substack.com)

How I buy things when Lightcone wants them fast

Bird Concept26 Sep 2022 5:02 UTC

240 points

21 comments8 min readLW link

Oren’s Field Guide of Bad AGI Outcomes

Eris Discordia26 Sep 2022 4:06 UTC

0 points

0 comments1 min readLW link

On Generality

Eris Discordia26 Sep 2022 4:06 UTC

2 points

0 comments5 min readLW link

Planning a Loft Bed

jefftk26 Sep 2022 0:10 UTC

15 points

15 comments2 min readLW link

(www.jefftk.com)

Becoming Black Boxish

vitaliya25 Sep 2022 23:35 UTC

16 points

0 comments2 min readLW link

Announcing Balsa Research

Zvi25 Sep 2022 22:50 UTC

235 points

64 comments2 min readLW link 1 review

(thezvi.wordpress.com)

An Unexpected GPT-3 Decision in a Simple Gamble

casualphysicsenjoyer25 Sep 2022 16:46 UTC

8 points

4 comments1 min readLW link

“Agency” needs nuance

Evie Cottrell25 Sep 2022 7:40 UTC

23 points

1 comment14 min readLW link

Acceptance and Commitment Therapy (ACT) 101

Evie Cottrell25 Sep 2022 7:25 UTC

8 points

2 comments8 min readLW link

Bathroom Construction Cost Comparison

jefftk25 Sep 2022 2:30 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Prioritizing the Arts in response to AI automation

Casey25 Sep 2022 2:25 UTC

18 points

11 comments2 min readLW link

UI/UX From the Dark Ages

Shmi25 Sep 2022 1:53 UTC

25 points

15 comments2 min readLW link

P(misalignment x-risk|AGI) is small #[Future Fund worldview prize]

Dibbu Dibbu24 Sep 2022 23:54 UTC

−18 points

0 comments4 min readLW link

[Question] Papers to start getting into NLP-focused alignment research

Feraidoon24 Sep 2022 23:53 UTC

6 points

0 comments1 min readLW link

Whose Fault?

Markovia24 Sep 2022 23:53 UTC

1 point

0 comments1 min readLW link

Brain-over-body biases, and the embodied value problem in AI alignment

geoffreymiller24 Sep 2022 22:24 UTC

10 points

6 comments25 min readLW link

Opt out from the Funni

Coafos24 Sep 2022 22:07 UTC

8 points

1 comment2 min readLW link

AI coöperation is more possible than you think

42317524 Sep 2022 21:26 UTC

7 points

0 comments2 min readLW link

“Cotton Gin” AI Risk

42317524 Sep 2022 21:26 UTC

7 points

3 comments2 min readLW link

Two reasons we might be closer to solving alignment than it seems

KatWoods and AmberDawn

24 Sep 2022 20:00 UTC

57 points

9 comments4 min readLW link