All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

12 career-related questions that may (or may not) be helpful for people interested in alignment research

Orpheus1612 Dec 2022 22:36 UTC

20 points

0 comments2 min readLW link

Concept extrapolation for hypothesis generation

Stuart_Armstrong, Patrick Leask and rgorman

12 Dec 2022 22:09 UTC

20 points

2 comments3 min readLW link

Let’s go meta: Grammatical knowledge and self-referential sentences [ChatGPT]

Bill Benzon12 Dec 2022 21:50 UTC

5 points

0 comments9 min readLW link

D&D.Sci December 2022 Evaluation and Ruleset

abstractapplic12 Dec 2022 21:21 UTC

17 points

8 comments2 min readLW link

Log-odds are better than Probabilities

Robert_AIZI12 Dec 2022 20:10 UTC

22 points

4 comments4 min readLW link

(aizi.substack.com)

Bengaluru LW/ACX Social Meetup—December 2022

faiz12 Dec 2022 19:30 UTC

4 points

0 comments1 min readLW link

Psychological Disorders and Problems

adamShimi and Gabriel Alfour

12 Dec 2022 18:15 UTC

39 points

6 comments1 min readLW link

Confusing the goal and the path

adamShimi12 Dec 2022 16:42 UTC

44 points

7 comments1 min readLW link

(epistemologicalvigilance.substack.com)

Meaningful things are those the universe possesses a semantics for

Abhimanyu Pallavi Sudhir12 Dec 2022 16:03 UTC

30 points

15 comments14 min readLW link

Tradeoffs in complexity, abstraction, and generality

remember and Gabriel Alfour

12 Dec 2022 15:55 UTC

32 points

0 comments2 min readLW link

Green Line Extension Opening Dates

jefftk12 Dec 2022 14:40 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

Join the AI Testing Hackathon this Friday

Esben Kran12 Dec 2022 14:24 UTC

10 points

0 comments8 min readLW link

(alignmentjam.com)

Side-channels: input versus output

davidad12 Dec 2022 12:32 UTC

44 points

16 comments2 min readLW link

Take 9: No, RLHF/IDA/debate doesn’t solve outer alignment.

Charlie Steiner12 Dec 2022 11:51 UTC

33 points

13 comments2 min readLW link

Creating a database for base rates

nikos12 Dec 2022 10:09 UTC

2 points

1 comment3 min readLW link

(forum.effectivealtruism.org)

Trivial GPT-3.5 limitation workaround

Dave92F112 Dec 2022 8:42 UTC

5 points

4 comments1 min readLW link

Ponzi schemes can be highly profitable if your timing is good

GeneSmith12 Dec 2022 6:42 UTC

10 points

18 comments5 min readLW link

Prodding ChatGPT to solve a basic algebra problem

Shmi12 Dec 2022 4:09 UTC

14 points

6 comments1 min readLW link

(twitter.com)

Wider Default Audio Player in Chrome?

jefftk12 Dec 2022 3:30 UTC

11 points

2 comments1 min readLW link

(www.jefftk.com)

A brainteaser for language models

Adam Scherlis12 Dec 2022 2:43 UTC

47 points

3 comments2 min readLW link

Benchmarks for Comparing Human and AI Intelligence

MrThink11 Dec 2022 22:06 UTC

9 points

4 comments2 min readLW link

Reflections on the PIBBSS Fellowship 2022

Nora_Ammann and particlemania

11 Dec 2022 21:53 UTC

32 points

0 comments18 min readLW link

A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_Porter11 Dec 2022 21:11 UTC

15 points

11 comments1 min readLW link

Finite Factored Sets in Pictures

Magdalena Wache11 Dec 2022 18:49 UTC

189 points

35 comments12 min readLW link

Formalization as suspension of intuition

adamShimi11 Dec 2022 15:16 UTC

54 points

18 comments1 min readLW link

(epistemologicalvigilance.substack.com)

An argument on animal consciousness (soliciting criticism)

SciHamster11 Dec 2022 15:12 UTC

1 point

2 comments1 min readLW link

ChatGPT’s new novel rationality technique of fact checking

ChristianKl11 Dec 2022 13:54 UTC

−14 points

7 comments1 min readLW link

Reframing inner alignment

davidad11 Dec 2022 13:53 UTC

53 points

13 comments4 min readLW link

A poem about applied rationality by ChatGPT

ChristianKl11 Dec 2022 13:43 UTC

4 points

0 comments1 min readLW link

ChatGPT goes through a wormhole hole in our Shandyesque universe [virtual wacky weed]

Bill Benzon11 Dec 2022 11:59 UTC

−1 points

2 comments3 min readLW link

Using Obsidian if you’re used to using Roam

Solenoid_Entity11 Dec 2022 8:59 UTC

19 points

4 comments2 min readLW link

[fiction] Our Final Hour

Mati_Roy11 Dec 2022 5:49 UTC

33 points

6 comments3 min readLW link

Consider using reversible automata for alignment research

Alex_Altair11 Dec 2022 1:00 UTC

89 points

30 comments2 min readLW link

High level discourse structure in ChatGPT: Part 2 [Quasi-symbolic?]

Bill Benzon10 Dec 2022 22:26 UTC

7 points

0 comments6 min readLW link

Poll Results on AGI

Niclas Kupper10 Dec 2022 21:25 UTC

18 points

0 comments2 min readLW link

Reflecting on the 2022 Guild of the Rose Workshops

moridinamael10 Dec 2022 21:21 UTC

25 points

7 comments8 min readLW link

[Question] Reversing a quantum simulation on the planetary scale

Mythopoeist10 Dec 2022 20:26 UTC

2 points

3 comments1 min readLW link

ACX Zurich December Meetup

MB10 Dec 2022 19:23 UTC

1 point

0 comments1 min readLW link

[ASoT] Natural abstractions and AlphaZero

Ulisse Mini10 Dec 2022 17:53 UTC

33 points

1 comment1 min readLW link

(arxiv.org)

[Question] How promising are legal avenues to restrict AI training data?

thehalliard10 Dec 2022 16:31 UTC

9 points

2 comments1 min readLW link

Inspiration as a Scarce Resource

zenbu zenbu zenbu zenbu10 Dec 2022 15:23 UTC

7 points

0 comments4 min readLW link

(inflorescence.substack.com)

Will Manifold Markets/Metaculus have built-in support for reflective latent variables by 2025?

tailcalled10 Dec 2022 13:55 UTC

35 points

0 comments1 min readLW link

My thoughts on OpenAI’s Alignment plan

Donald Hobson10 Dec 2022 10:35 UTC

25 points

1 comment6 min readLW link

[Question] How would you improve ChatGPT’s filtering?

Noah Scales10 Dec 2022 8:05 UTC

9 points

6 comments1 min readLW link

[Question] A thought experiment

sisyphus10 Dec 2022 5:23 UTC

3 points

12 comments1 min readLW link

patio11′s “Observations from an EA-adjacent (?) charitable effort”

RobertM10 Dec 2022 0:27 UTC

43 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

A dynamical systems primer for entropy and optimization

Alex_Altair10 Dec 2022 0:13 UTC

45 points

3 comments7 min readLW link

[Linkpost] The Story Of VaccinateCA

hath9 Dec 2022 23:54 UTC

104 points

4 comments10 min readLW link

(www.worksinprogress.co)

Prosaic misalignment from the Solomonoff Predictor

Cleo Nardo9 Dec 2022 17:53 UTC

43 points

3 comments5 min readLW link

Take 8: Queer the inner/outer alignment dichotomy.

Charlie Steiner9 Dec 2022 17:46 UTC

31 points

2 comments2 min readLW link