All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

a rough sketch of formal aligned AI using QACI

Tamsin Leake11 Dec 2022 23:40 UTC

14 points

0 comments4 min readLW link

(carado.moe)

Benchmarks for Comparing Human and AI Intelligence

MrThink11 Dec 2022 22:06 UTC

8 points

4 comments2 min readLW link

Reflections on the PIBBSS Fellowship 2022

Nora_Ammann and particlemania

11 Dec 2022 21:53 UTC

32 points

0 comments18 min readLW link

A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_Porter11 Dec 2022 21:11 UTC

15 points

11 comments1 min readLW link

Finite Factored Sets in Pictures

Magdalena Wache11 Dec 2022 18:49 UTC

174 points

35 comments12 min readLW link

Formalization as suspension of intuition

adamShimi11 Dec 2022 15:16 UTC

54 points

18 comments1 min readLW link

(epistemologicalvigilance.substack.com)

An argument on animal consciousness (soliciting criticism)

SciHamster11 Dec 2022 15:12 UTC

−3 points

2 comments1 min readLW link

ChatGPT’s new novel rationality technique of fact checking

ChristianKl11 Dec 2022 13:54 UTC

−14 points

7 comments1 min readLW link

Reframing inner alignment

davidad11 Dec 2022 13:53 UTC

53 points

13 comments4 min readLW link

A poem about applied rationality by ChatGPT

ChristianKl11 Dec 2022 13:43 UTC

4 points

0 comments1 min readLW link

ChatGPT goes through a wormhole hole in our Shandyesque universe [virtual wacky weed]

Bill Benzon11 Dec 2022 11:59 UTC

−1 points

2 comments3 min readLW link

Using Obsidian if you’re used to using Roam

Solenoid_Entity11 Dec 2022 8:59 UTC

19 points

4 comments2 min readLW link

[fiction] Our Final Hour

Mati_Roy11 Dec 2022 5:49 UTC

17 points

5 comments3 min readLW link

Consider using reversible automata for alignment research

Alex_Altair11 Dec 2022 1:00 UTC

88 points

30 comments2 min readLW link

High level discourse structure in ChatGPT: Part 2 [Quasi-symbolic?]

Bill Benzon10 Dec 2022 22:26 UTC

7 points

0 comments6 min readLW link

Poll Results on AGI

Niclas Kupper10 Dec 2022 21:25 UTC

18 points

0 comments2 min readLW link

Reflecting on the 2022 Guild of the Rose Workshops

moridinamael10 Dec 2022 21:21 UTC

26 points

7 comments8 min readLW link

[Question] Reversing a quantum simulation on the planetary scale

Mythopoeist10 Dec 2022 20:26 UTC

2 points

3 comments1 min readLW link

ACX Zurich December Meetup

MB10 Dec 2022 19:23 UTC

1 point

0 comments1 min readLW link

FMT: a great opportunity for soon-to-be parents

Anton Rodenhauser10 Dec 2022 17:56 UTC

7 points

0 comments6 min readLW link

[ASoT] Natural abstractions and AlphaZero

Ulisse Mini10 Dec 2022 17:53 UTC

33 points

1 comment1 min readLW link

(arxiv.org)

[Question] How promising are legal avenues to restrict AI training data?

thehalliard10 Dec 2022 16:31 UTC

9 points

2 comments1 min readLW link

Inspiration as a Scarce Resource

zenbu zenbu zenbu zenbu10 Dec 2022 15:23 UTC

7 points

0 comments4 min readLW link

(inflorescence.substack.com)

Will Manifold Markets/Metaculus have built-in support for reflective latent variables by 2025?

tailcalled10 Dec 2022 13:55 UTC

34 points

0 comments1 min readLW link

My thoughts on OpenAI’s Alignment plan

Donald Hobson10 Dec 2022 10:35 UTC

25 points

1 comment6 min readLW link

[Question] How would you improve ChatGPT’s filtering?

Noah Scales10 Dec 2022 8:05 UTC

9 points

6 comments1 min readLW link

[Question] A thought experiment

sisyphus10 Dec 2022 5:23 UTC

3 points

12 comments1 min readLW link

patio11′s “Observations from an EA-adjacent (?) charitable effort”

RobertM10 Dec 2022 0:27 UTC

43 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

A dynamical systems primer for entropy and optimization

Alex_Altair10 Dec 2022 0:13 UTC

47 points

3 comments7 min readLW link

[Linkpost] The Story Of VaccinateCA

hath9 Dec 2022 23:54 UTC

103 points

4 comments10 min readLW link

(www.worksinprogress.co)

Prosaic misalignment from the Solomonoff Predictor

Cleo Nardo9 Dec 2022 17:53 UTC

40 points

2 comments5 min readLW link

Take 8: Queer the inner/outer alignment dichotomy.

Charlie Steiner9 Dec 2022 17:46 UTC

28 points

2 comments2 min readLW link

[Question] Does a LLM have a utility function?

Dagon9 Dec 2022 17:19 UTC

17 points

11 comments1 min readLW link

Monthly Roundup #1

Zvi9 Dec 2022 17:10 UTC

31 points

6 comments21 min readLW link

(thezvi.wordpress.com)

Working towards AI alignment is better

Johannes C. Mayer9 Dec 2022 15:39 UTC

8 points

2 comments2 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidad9 Dec 2022 14:06 UTC

84 points

19 comments5 min readLW link

ChatGPT’s Misalignment Isn’t What You Think

stavros9 Dec 2022 11:11 UTC

3 points

12 comments1 min readLW link

ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49

Esben Kran and Steinthal

9 Dec 2022 10:38 UTC

19 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

[Question] What are your thoughts on the future of AI-assisted software development?

RomanHauksson9 Dec 2022 10:04 UTC

4 points

4 comments1 min readLW link

Fear mitigated the nuclear threat, can it do the same to AGI risks?

Igor Ivanov9 Dec 2022 10:04 UTC

6 points

8 comments5 min readLW link

Setting the Zero Point

[DEACTIVATED] Duncan Sabien9 Dec 2022 6:06 UTC

90 points

43 comments20 min readLW link 1 review

Systems of Survival

Vaniver9 Dec 2022 5:13 UTC

63 points

5 comments5 min readLW link

[Question] Do You Have an Internal Monologue?

belkarx9 Dec 2022 3:04 UTC

23 points

7 comments1 min readLW link

[Question] How is the “sharp left turn defined”?

Chris_Leong9 Dec 2022 0:04 UTC

14 points

4 comments1 min readLW link

Linkpost for a generalist algorithmic learner: capable of carrying out sorting, shortest paths, string matching, convex hull finding in one network

lovetheusers9 Dec 2022 0:02 UTC

7 points

1 comment1 min readLW link

(twitter.com)

[Question] Where’s the economic incentive for wokism coming from?

Valentine8 Dec 2022 23:28 UTC

12 points

105 comments1 min readLW link

I Believe we are in a Hardware Overhang

nem8 Dec 2022 23:18 UTC

8 points

0 comments1 min readLW link

Of pumpkins, the Falcon Heavy, and Groucho Marx: High-Level discourse structure in ChatGPT

Bill Benzon8 Dec 2022 22:25 UTC

2 points

0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonexistence On Average?

Jordan Arel8 Dec 2022 21:57 UTC

4 points

5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofsky8 Dec 2022 19:50 UTC

71 points

6 comments14 min readLW link

(www.cold-takes.com)