All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Linkpost] The Story Of VaccinateCA

hath9 Dec 2022 23:54 UTC

103 points

4 comments10 min readLW link

(www.worksinprogress.co)

Prosaic misalignment from the Solomonoff Predictor

Cleo Nardo9 Dec 2022 17:53 UTC

43 points

3 comments5 min readLW link

Take 8: Queer the inner/outer alignment dichotomy.

Charlie Steiner9 Dec 2022 17:46 UTC

31 points

2 comments2 min readLW link

[Question] Does a LLM have a utility function?

Dagon9 Dec 2022 17:19 UTC

17 points

11 comments1 min readLW link

Monthly Roundup #1

Zvi9 Dec 2022 17:10 UTC

31 points

6 comments21 min readLW link

(thezvi.wordpress.com)

Working towards AI alignment is better

Johannes C. Mayer9 Dec 2022 15:39 UTC

8 points

2 comments2 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidad9 Dec 2022 14:06 UTC

97 points

19 comments5 min readLW link

ChatGPT’s Misalignment Isn’t What You Think

stavros9 Dec 2022 11:11 UTC

3 points

12 comments1 min readLW link

ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49

Esben Kran and Steinthal

9 Dec 2022 10:38 UTC

19 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

[Question] What are your thoughts on the future of AI-assisted software development?

RomanHauksson9 Dec 2022 10:04 UTC

4 points

4 comments1 min readLW link

Fear mitigated the nuclear threat, can it do the same to AGI risks?

Igor Ivanov9 Dec 2022 10:04 UTC

6 points

8 comments5 min readLW link

Setting the Zero Point

Duncan Sabien (Inactive)9 Dec 2022 6:06 UTC

96 points

43 comments20 min readLW link 1 review

Systems of Survival

Vaniver9 Dec 2022 5:13 UTC

63 points

5 comments5 min readLW link

[Question] Do You Have an Internal Monologue?

belkarx9 Dec 2022 3:04 UTC

23 points

7 comments1 min readLW link

[Question] How is the “sharp left turn defined”?

Chris_Leong9 Dec 2022 0:04 UTC

14 points

4 comments1 min readLW link

Linkpost for a generalist algorithmic learner: capable of carrying out sorting, shortest paths, string matching, convex hull finding in one network

lovetheusers9 Dec 2022 0:02 UTC

7 points

1 comment1 min readLW link

(twitter.com)

[Question] Where’s the economic incentive for wokism coming from?

Valentine8 Dec 2022 23:28 UTC

15 points

105 comments1 min readLW link

I Believe we are in a Hardware Overhang

nem8 Dec 2022 23:18 UTC

8 points

0 comments1 min readLW link

Of pumpkins, the Falcon Heavy, and Groucho Marx: High-Level discourse structure in ChatGPT

Bill Benzon8 Dec 2022 22:25 UTC

2 points

0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonexistence On Average?

Jordan Arel8 Dec 2022 21:57 UTC

4 points

5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofsky8 Dec 2022 19:50 UTC

82 points

6 comments14 min readLW link

(www.cold-takes.com)

Playing shell games with definitions

weverka8 Dec 2022 19:35 UTC

9 points

3 comments1 min readLW link

Notes on OpenAI’s alignment plan

Alex Flint8 Dec 2022 19:13 UTC

40 points

5 comments7 min readLW link

Relevant to natural abstractions: Euclidean Symmetry Equivariant Machine Learning—Overview, Applications, and Open Questions

the gears to ascension8 Dec 2022 18:01 UTC

8 points

0 comments1 min readLW link

(youtu.be)

I’ve started publishing the novel I wrote to promote EA

Timothy Underwood8 Dec 2022 17:30 UTC

10 points

3 comments1 min readLW link

Neural networks biased towards geometrically simple functions?

DavidHolmes8 Dec 2022 16:16 UTC

16 points

2 comments3 min readLW link

If Wentworth is right about natural abstractions, it would be bad for alignment

Wuschel Schulz8 Dec 2022 15:19 UTC

29 points

5 comments4 min readLW link

Covid 12/8/22: Another Winter Wave

Zvi8 Dec 2022 14:40 UTC

23 points

8 comments11 min readLW link

(thezvi.wordpress.com)

Why I’m Sceptical of Foom

DragonGod8 Dec 2022 10:01 UTC

20 points

36 comments3 min readLW link

Take 7: You should talk about “the human’s utility function” less.

Charlie Steiner8 Dec 2022 8:14 UTC

50 points

22 comments2 min readLW link

Machine Learning Consent

jefftk8 Dec 2022 3:50 UTC

38 points

14 comments3 min readLW link

(www.jefftk.com)

Riffing on the agent type

Quinn8 Dec 2022 0:19 UTC

21 points

3 comments4 min readLW link

[Question] Looking for ideas of public assets (stocks, funds, ETFs) that I can invest in to have a chance at profiting from the mass adoption and commercialization of AI technology

Annapurna7 Dec 2022 22:35 UTC

15 points

9 comments1 min readLW link

A Fallibilist Wordview

Toni MUENDEL7 Dec 2022 20:59 UTC

−12 points

2 comments13 min readLW link

Thoughts on AGI organizations and capabilities work

Rob Bensinger and So8res

7 Dec 2022 19:46 UTC

102 points

17 comments5 min readLW link

How to Think About Climate Models and How to Improve Them

clans7 Dec 2022 19:37 UTC

7 points

0 comments2 min readLW link

(locationtbd.home.blog)

The novelty quotient

River Lewis7 Dec 2022 17:16 UTC

4 points

7 comments2 min readLW link

(heytraveler.substack.com)

ChatGPT: “An error occurred. If this issue persists...”

Bill Benzon7 Dec 2022 15:41 UTC

5 points

11 comments3 min readLW link

Take 6: CAIS is actually Orwellian.

Charlie Steiner7 Dec 2022 13:50 UTC

12 points

8 comments2 min readLW link

Peter Thiel on Technological Stagnation and Out of Touch Rationalists

Trinley Goldenberg7 Dec 2022 13:15 UTC

9 points

26 comments1 min readLW link

(youtu.be)

[Link] Wavefunctions: from Linear Algebra to Spinors

sen7 Dec 2022 12:44 UTC

11 points

12 comments1 min readLW link

(paperclip.substack.com)

Why I like Zulip instead of Slack or Discord

Alok Singh7 Dec 2022 9:28 UTC

31 points

10 comments1 min readLW link

Bioweapons, and ChatGPT (another vulnerability story)

Beeblebrox7 Dec 2022 7:27 UTC

−5 points

0 comments2 min readLW link

Where to be an AI Safety Professor

scasper7 Dec 2022 7:09 UTC

31 points

12 comments2 min readLW link

[Question] Are there any tools to convert LW sequences to PDF or any other file format?

quetzal_rainbow7 Dec 2022 5:28 UTC

2 points

2 comments1 min readLW link

Manifold Markets community meetup

Sinclair Chen7 Dec 2022 3:25 UTC

4 points

0 comments1 min readLW link

“Attention Passengers”: not for Signs

jefftk7 Dec 2022 2:00 UTC

27 points

10 comments1 min readLW link

(www.jefftk.com)

[ASoT] Probability Infects Concepts it Touches

Ulisse Mini7 Dec 2022 1:48 UTC

10 points

4 comments1 min readLW link

Simple Way to Prevent Power-Seeking AI

research_prime_space7 Dec 2022 0:26 UTC

12 points

1 comment1 min readLW link

In defense of probably wrong mechanistic models

evhub6 Dec 2022 23:24 UTC

55 points

10 comments2 min readLW link