Zoe Williams

Karma: 396

EA & LW Forums Weekly Summary (21 Aug − 27 Aug 22′)

Zoe Williams30 Aug 2022 1:42 UTC

57 points

4 comments12 min readLW link

Zoe Williams 30 Aug 2022 11:11 UTC
3 points
0
in reply to: Viktor Rehnberg’s comment on: EA & LW Forums Weekly Summary (21 Aug − 27 Aug 22′)
Good point, thank you—I’ve had a re-read of the conclusion and replaced the sentence with “Due to this, he concludes that climate change is still an important LT area—though not as important as some other global catastrophic risks (eg. biorisk), which outsize on both neglectedness and scale.”

Originally I think I’d mistaken his position a bit based on this sentence: “Overall, because other global catastrophic risks are so much more neglected than climate change, I think they are more pressing to work on, on the margin.” (and in addition I hadn’t used the clearest phrasing) But the wider conclusion fits the new sentence better.

Zoe Williams 30 Aug 2022 11:11 UTC
3 points
0
in reply to: Ruby’s comment on: EA & LW Forums Weekly Summary (21 Aug − 27 Aug 22′)
Great to hear, thanks :-)

Zoe Williams 6 Sep 2022 0:16 UTC
1 point
0
AF
on: Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible
Super interesting, thanks!

If you were running it again, you might want to think about standardizing the wording of the questions—it varies from ‘will / is’ to ‘is likely’ to ‘plausible’ and this can make it hard to compare between questions. Plausible in particular is quite a fuzzy word, for some it might mean 1% or more, for others it might just mean it’s not completely impossible / if a movie had that storyline, they’d be okay with it.

EA & LW Forums Weekly Summary (28 Aug − 3 Sep 22’)

Zoe Williams6 Sep 2022 11:06 UTC

51 points

2 comments14 min readLW link

EA & LW Forums Weekly Summary (5 − 11 Sep 22′)

Zoe Williams12 Sep 2022 23:24 UTC

24 points

0 comments13 min readLW link

EA & LW Forums Weekly Summary (12 − 18 Sep ’22)

Zoe Williams19 Sep 2022 5:08 UTC

11 points

0 comments13 min readLW link

EA & LW Forums Weekly Summary (19 − 25 Sep 22′)

Zoe Williams28 Sep 2022 20:18 UTC

16 points

2 comments19 min readLW link

Zoe Williams 29 Sep 2022 18:44 UTC
1 point
0
in reply to: ChristianKl’s comment on: EA & LW Forums Weekly Summary (19 − 25 Sep 22′)
Thanks for the info—added to post

EA & LW Forums Weekly Summary (26 Sep − 9 Oct 22′)

Zoe Williams10 Oct 2022 23:58 UTC

13 points

2 comments1 min readLW link

Zoe Williams 17 Oct 2022 22:20 UTC
1 point
0
in reply to: Evan R. Murphy’s comment on: EA & LW Forums Weekly Summary (26 Sep − 9 Oct 22′)
Thanks, great to hear!

EA & LW Forums Weekly Summary (10 − 16 Oct 22′)

Zoe Williams17 Oct 2022 22:51 UTC

12 points

4 comments1 min readLW link

Zoe Williams 18 Oct 2022 19:53 UTC
1 point
0
in reply to: Gunnar_Zarncke’s comment on: EA & LW Forums Weekly Summary (10 − 16 Oct 22′)
Thanks, realized I forgot to add the description of the top / curated section—fixed. Everything in there occurs in it’s own section too.

EA & LW Forums Weekly Summary (17 − 23 Oct 22′)

Zoe Williams25 Oct 2022 2:57 UTC

10 points

0 comments1 min readLW link

Zoe Williams 31 Oct 2022 2:47 UTC
2 points
0
in reply to: Evan R. Murphy’s comment on: EA & LW Forums Weekly Summary (10 − 16 Oct 22′)
Thanks for the feedback! I’ve passed it on.

It’s mainly because we wanted to keep the episodes to ~20m, to make them easy for people to keep up with week to week—and the LW posts tended toward the more technical side, which doesn’t translate as easily in podcast form (it can be hard to take in without the writing in front of you). We may do something for the LW posts in future though, unsure at this point.

EA & LW Forums Weekly Summary (24 − 30th Oct 22′)

Zoe Williams1 Nov 2022 2:58 UTC

13 points

1 comment1 min readLW link

EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)

Zoe Williams8 Nov 2022 3:58 UTC

12 points

1 comment1 min readLW link

EA & LW Forums Weekly Summary (7th Nov − 13th Nov 22′)

Zoe Williams16 Nov 2022 3:04 UTC

19 points

0 comments1 min readLW link

EA & LW Forums Weekly Summary (14th Nov − 27th Nov 22′)

Zoe Williams29 Nov 2022 23:00 UTC

21 points

1 comment1 min readLW link

Zoe Williams 6 Dec 2022 3:08 UTC
3 points
0
on: The Plan − 2022 Update
Post summary (feel free to suggest edits!):
Last year, the author wrote up an plan they gave a “better than ⁵⁰⁄₅₀ chance” would work before AGI kills us all. This predicted that in 4-5 years, the alignment field would progress from preparadigmatic (unsure of the right questions or tools) to having a general roadmap and toolset.
They believe this is on track and give 40% likelihood that over the next 1-2 years the field of alignment will converge toward primarily working on decoding the internal language of neural nets—with interpretability on the experimental side, in addition to theoretical work. This could lead to identifying which potential alignment targets (like human values, corrigibility, Do What I Mean, etc) are likely to be naturally expressible in the internal language of neural nets, and how to express them. They think we should then focus on those.

In their personal work, they’ve found theory work faster than expected, and crossing the theory-practice gap mildly slower. In 2022 most of their time went into theory work like the Basic Foundations sequence, workshops and conferences, training others, and writing up intro-level arguments on alignment strategies.

(If you’d like to see more summaries of top EA and LW forum posts, check out the Weekly Summaries series.)