All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Research Notes: What are we aligning for?

Shoshannah Tekofsky8 Jul 2022 22:13 UTC

19 points

8 comments2 min readLW link

[Question] What New Desktop Should I Buy?

Zvi8 Jul 2022 15:04 UTC

15 points

19 comments1 min readLW link

Being a donor for Fecal Microbiota Transplants (FMT): Do good & earn easy money (up to 180k/y)

EternallyBlissful8 Jul 2022 6:17 UTC

36 points

26 comments8 min readLW link

(forum.effectivealtruism.org)

User research as a barometer of software design

Biff Wiff8 Jul 2022 6:02 UTC

31 points

13 comments3 min readLW link

Reinforcement Learner Wireheading

Nate Showell8 Jul 2022 5:32 UTC

8 points

2 comments3 min readLW link

Exposition as science: some ideas for how to make progress

riceissa8 Jul 2022 1:29 UTC

21 points

1 comment8 min readLW link

In Search of Strategic Clarity

james.lucassen8 Jul 2022 0:52 UTC

11 points

1 comment5 min readLW link

(jlucassen.com)

Unbounded Intelligence Lottery

kman7 Jul 2022 23:28 UTC

4 points

11 comments1 min readLW link

How to Become a World Historical Figure (Péladan’s Dream)

rogersbacon7 Jul 2022 22:39 UTC

21 points

3 comments30 min readLW link

(www.secretorum.life)

Safety considerations for online generative modeling

Sam Marks7 Jul 2022 18:31 UTC

42 points

9 comments14 min readLW link

Human values & biases are inaccessible to the genome

TurnTrout7 Jul 2022 17:29 UTC

95 points

54 comments6 min readLW link 1 review

Cooperation with and between AGI\’s

PeterMcCluskey7 Jul 2022 16:45 UTC

10 points

3 comments10 min readLW link

(www.bayesianinvestor.com)

Aversion Factoring

CFAR!Duncan7 Jul 2022 16:09 UTC

88 points

1 comment8 min readLW link

Genders Discrimination

Jacob Falkovich7 Jul 2022 15:20 UTC

10 points

16 comments4 min readLW link

Consider Multiclassing

JustisMills7 Jul 2022 14:54 UTC

24 points

1 comment3 min readLW link

Covid 7/7/22: Paxlovid at the Pharmacy

Zvi7 Jul 2022 14:30 UTC

34 points

11 comments12 min readLW link

(thezvi.wordpress.com)

Babysitting as Parenting Trial?

jefftk7 Jul 2022 13:20 UTC

52 points

24 comments3 min readLW link

(www.jefftk.com)

When Giving People Money Doesn’t Help

Zvi7 Jul 2022 13:00 UTC

58 points

12 comments10 min readLW link

(thezvi.wordpress.com)

Wealth as a source of technological stagnation?

alyssavance7 Jul 2022 5:46 UTC

21 points

1 comment3 min readLW link

Race Along Rashomon Ridge

Stephen Fowler, Peter S. Park and MichaelEinhorn

7 Jul 2022 3:20 UTC

52 points

16 comments9 min readLW link

[Question] What one paper would you show to someone to get them excited about your field?

oh543217 Jul 2022 2:55 UTC

10 points

1 comment1 min readLW link

Principles for Alignment/Agency Projects

johnswentworth7 Jul 2022 2:07 UTC

122 points

20 comments4 min readLW link

Confusions in My Model of AI Risk

peterbarnett7 Jul 2022 1:05 UTC

22 points

9 comments5 min readLW link

Some Adventures of a Curious Richard Feynman

Dalton Mabery6 Jul 2022 23:11 UTC

10 points

0 comments3 min readLW link

Outer vs inner misalignment: three framings

Richard_Ngo6 Jul 2022 19:46 UTC

53 points

5 comments9 min readLW link

Tarnished Guy who Puts a Num on it

Jacob Falkovich6 Jul 2022 18:05 UTC

44 points

11 comments4 min readLW link

Deep neural networks are not opaque.

jem-mosig6 Jul 2022 18:03 UTC

22 points

14 comments3 min readLW link

How humanity would respond to slow takeoff, with takeaways from the entire COVID-19 pandemic

Noosphere896 Jul 2022 17:52 UTC

4 points

1 comment2 min readLW link

[Question] Should you write under a blog or your own name?

Dalton Mabery6 Jul 2022 15:26 UTC

2 points

2 comments1 min readLW link

Carrying the Torch: A Response to Anna Salamon by the Guild of the Rose

moridinamael6 Jul 2022 14:20 UTC

137 points

16 comments6 min readLW link

Predicting Parental Emotional Changes?

jefftk6 Jul 2022 13:50 UTC

55 points

19 comments2 min readLW link

(www.jefftk.com)

Berlin AI Safety Open Meetup July 2022

pranomostro6 Jul 2022 12:41 UTC

6 points

0 comments1 min readLW link

Forecasting Through Fiction

Yitz6 Jul 2022 5:03 UTC

5 points

2 comments8 min readLW link

Introducing the Fund for Alignment Research (We’re Hiring!)

AdamGleave, Scott Emmons, Ethan Perez and Claudia Shi

6 Jul 2022 2:07 UTC

62 points

0 comments4 min readLW link

My vision of a good future, part I

Jeffrey Ladish6 Jul 2022 1:23 UTC

66 points

18 comments9 min readLW link

Imperial Russia was doing fine without the Soviets

Davis Kedrosky5 Jul 2022 22:24 UTC

6 points

3 comments14 min readLW link

(daviskedrosky.substack.com)

A Pattern Language For Rationality

Vaniver5 Jul 2022 19:08 UTC

75 points

14 comments15 min readLW link

How to destroy the universe with a hypercomputer

Trevor Cappallo5 Jul 2022 19:05 UTC

2 points

3 comments1 min readLW link

The curious case of Pretty Good human inner/outer alignment

PavleMiha5 Jul 2022 19:04 UTC

41 points

45 comments4 min readLW link

When is it appropriate to use statistical models and probabilities for decision making ?

Younes Kamel5 Jul 2022 12:34 UTC

10 points

7 comments4 min readLW link

(youneskamel.substack.com)

Goal Factoring

CFAR!Duncan5 Jul 2022 7:10 UTC

103 points

2 comments8 min readLW link

Assorted thoughts about abstraction

Biff Wiff5 Jul 2022 6:40 UTC

16 points

9 comments7 min readLW link

[AN #172] Sorry for the long hiatus!

Rohin Shah5 Jul 2022 6:20 UTC

54 points

0 comments3 min readLW link

(mailchi.mp)

Outline: The Rectifying of Maps

hamnox5 Jul 2022 5:14 UTC

7 points

0 comments2 min readLW link

[Question] Seeking opinions on the current and forward state of cryptocurrencies.

jmh5 Jul 2022 5:01 UTC

6 points

6 comments1 min readLW link

ITT-passing and civility are good; “charity” is bad; steelmanning is niche

Rob Bensinger5 Jul 2022 0:15 UTC

165 points

37 comments6 min readLW link 1 review

Please help us communicate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC

4 points

7 comments2 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_Armstrong4 Jul 2022 20:48 UTC

83 points

12 comments4 min readLW link

Procedural Executive Function, Part 1

DaystarEld4 Jul 2022 18:51 UTC

59 points

10 comments14 min readLW link

(daystareld.com)

Anthropic’s SoLU (Softmax Linear Unit)

Joel Burget4 Jul 2022 18:38 UTC

21 points

1 comment4 min readLW link

(transformer-circuits.pub)