All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Some Adventures of a Curious Richard Feynman

Dalton Mabery6 Jul 2022 23:11 UTC

10 points

0 comments3 min readLW link

Cognitive Dissonance on Cognitive Capability

niederman6 Jul 2022 22:53 UTC

6 points

0 comments1 min readLW link

(maxniederman.com)

Outer vs inner misalignment: three framings

Richard_Ngo6 Jul 2022 19:46 UTC

49 points

5 comments9 min readLW link

Tarnished Guy who Puts a Num on it

Jacob Falkovich6 Jul 2022 18:05 UTC

44 points

11 comments4 min readLW link

Deep neural networks are not opaque.

jem-mosig6 Jul 2022 18:03 UTC

22 points

14 comments3 min readLW link

How humanity would respond to slow takeoff, with takeaways from the entire COVID-19 pandemic

Noosphere896 Jul 2022 17:52 UTC

4 points

1 comment2 min readLW link

[Question] Should you write under a blog or your own name?

Dalton Mabery6 Jul 2022 15:26 UTC

2 points

2 comments1 min readLW link

Carrying the Torch: A Response to Anna Salamon by the Guild of the Rose

moridinamael6 Jul 2022 14:20 UTC

133 points

16 comments6 min readLW link

Predicting Parental Emotional Changes?

jefftk6 Jul 2022 13:50 UTC

39 points

11 comments2 min readLW link

(www.jefftk.com)

Berlin AI Safety Open Meetup July 2022

pranomostro6 Jul 2022 12:41 UTC

6 points

0 comments1 min readLW link

Forecasting Through Fiction

Yitz6 Jul 2022 5:03 UTC

5 points

2 comments8 min readLW link

Introducing the Fund for Alignment Research (We’re Hiring!)

AdamGleave, Scott Emmons, Ethan Perez and Claudia Shi

6 Jul 2022 2:07 UTC

62 points

0 comments4 min readLW link

My vision of a good future, part I

Jeffrey Ladish6 Jul 2022 1:23 UTC

66 points

18 comments9 min readLW link

Imperial Russia was doing fine without the Soviets

Davis Kedrosky5 Jul 2022 22:24 UTC

6 points

3 comments14 min readLW link

(daviskedrosky.substack.com)

A Pattern Language For Rationality

Vaniver5 Jul 2022 19:08 UTC

75 points

14 comments15 min readLW link

How to destroy the universe with a hypercomputer

Trevor Cappallo5 Jul 2022 19:05 UTC

2 points

3 comments1 min readLW link

The curious case of Pretty Good human inner/outer alignment

PavleMiha5 Jul 2022 19:04 UTC

41 points

45 comments4 min readLW link

When is it appropriate to use statistical models and probabilities for decision making ?

Younes Kamel5 Jul 2022 12:34 UTC

10 points

7 comments4 min readLW link

(youneskamel.substack.com)

Goal Factoring

CFAR!Duncan5 Jul 2022 7:10 UTC

80 points

2 comments8 min readLW link

Assorted thoughts about abstraction

Adam Zerner5 Jul 2022 6:40 UTC

16 points

9 comments7 min readLW link

[AN #172] Sorry for the long hiatus!

Rohin Shah5 Jul 2022 6:20 UTC

54 points

0 comments3 min readLW link

(mailchi.mp)

Outline: The Rectifying of Maps

hamnox5 Jul 2022 5:14 UTC

7 points

0 comments2 min readLW link

[Question] Seeking opinions on the current and forward state of cryptocurrencies.

jmh5 Jul 2022 5:01 UTC

7 points

6 comments1 min readLW link

ITT-passing and civility are good; “charity” is bad; steelmanning is niche

Rob Bensinger5 Jul 2022 0:15 UTC

161 points

36 comments6 min readLW link 1 review

Please help us communicate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC

4 points

7 comments2 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_Armstrong4 Jul 2022 20:48 UTC

82 points

12 comments4 min readLW link

Procedural Executive Function, Part 1

DaystarEld4 Jul 2022 18:51 UTC

33 points

2 comments13 min readLW link

(daystareld.com)

Anthropic’s SoLU (Softmax Linear Unit)

Joel Burget4 Jul 2022 18:38 UTC

21 points

1 comment4 min readLW link

(transformer-circuits.pub)

Book Review: The Righteous Mind

ErnestScribbler4 Jul 2022 17:45 UTC

33 points

8 comments35 min readLW link

My Most Likely Reason to Die Young is AI X-Risk

AISafetyIsNotLongtermist4 Jul 2022 17:08 UTC

61 points

24 comments4 min readLW link

(forum.effectivealtruism.org)

Is General Intelligence “Compact”?

DragonGod4 Jul 2022 13:27 UTC

27 points

6 comments22 min readLW link

Remaking EfficientZero (as best I can)

Hoagy4 Jul 2022 11:03 UTC

36 points

9 comments22 min readLW link

We Need a Consolidated List of Bad AI Alignment Solutions

Double4 Jul 2022 6:54 UTC

9 points

14 comments1 min readLW link

AI Forecasting: One Year In

jsteinhardt4 Jul 2022 5:10 UTC

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

A compressed take on recent disagreements

kman4 Jul 2022 4:39 UTC

33 points

9 comments1 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. Murphy4 Jul 2022 1:25 UTC

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

Monthly Shorts 6/22

Celer3 Jul 2022 23:40 UTC

5 points

2 comments5 min readLW link

(keller.substack.com)

Decision theory and dynamic inconsistency

paulfchristiano3 Jul 2022 22:20 UTC

79 points

33 comments10 min readLW link

(sideways-view.com)

Five routes of access to scientific literature

DirectedEvolution3 Jul 2022 20:53 UTC

13 points

4 comments6 min readLW link

Toni Kurz and the Insanity of Climbing Mountains

GeneSmith3 Jul 2022 20:51 UTC

268 points

67 comments11 min readLW link 2 reviews

Wonder and The Golden AI Rule

JeffreyK3 Jul 2022 18:21 UTC

0 points

4 comments6 min readLW link

Evolution Doesn’t Have Feelings

UtilityMonster3 Jul 2022 17:13 UTC

−1 points

0 comments1 min readLW link

Nature abhors an immutable replicator… usually

MSRayne3 Jul 2022 15:08 UTC

28 points

10 comments3 min readLW link

Post hoc justifications as Compression Algorithm

Johannes C. Mayer3 Jul 2022 5:02 UTC

8 points

0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. Mayer3 Jul 2022 4:46 UTC

10 points

0 comments1 min readLW link

(www.youtube.com)

Sexual self-acceptance

Johannes C. Mayer3 Jul 2022 4:26 UTC

11 points

6 comments1 min readLW link

Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?

Paul Logan3 Jul 2022 3:03 UTC

−24 points

6 comments3 min readLW link

(laulpogan.substack.com)

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben Smith3 Jul 2022 2:51 UTC

11 points

1 comment4 min readLW link

Trigger-Action Planning

CFAR!Duncan3 Jul 2022 1:42 UTC

81 points

14 comments13 min readLW link 2 reviews

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín Soto3 Jul 2022 1:05 UTC

5 points

2 comments1 min readLW link