All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

A (EtA: quick) note on terminology: AI Alignment != AI x-safety

David Scott Krueger8 Feb 2023 22:33 UTC

46 points

20 comments1 min readLW link

GPT-175bee

Adam Scherlis and LawrenceC

8 Feb 2023 18:58 UTC

126 points

14 comments1 min readLW link

EigenKarma: trust at scale

Henrik Karlsson8 Feb 2023 18:52 UTC

186 points

52 comments5 min readLW link

Conditioning Predictive Models: Interactions with other approaches

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

8 Feb 2023 18:19 UTC

32 points

2 comments11 min readLW link

Wanted: Technical animator and/or front-end developer for interactive diagrams of invention

jasoncrawford8 Feb 2023 17:14 UTC

30 points

3 comments1 min readLW link

(rootsofprogress.org)

A multi-disciplinary view on AI safety research

Roman Leventov8 Feb 2023 16:50 UTC

47 points

4 comments26 min readLW link

Community building: Lessons from ten years of facilitation experience

Severin T. Seehrich8 Feb 2023 16:26 UTC

17 points

0 comments21 min readLW link

Progress links and tweets, 2023-02-08

jasoncrawford8 Feb 2023 15:52 UTC

10 points

0 comments1 min readLW link

(rootsofprogress.org)

A Particular Equilibrium

Algon8 Feb 2023 15:16 UTC

13 points

0 comments2 min readLW link

(algon-33.github.io)

Self-Awareness (and possible mode collapse around it) in ChatGPT

Yitz8 Feb 2023 9:57 UTC

18 points

2 comments2 min readLW link

Drugs are Sometimes Good, Actually

Gordon Seidoh Worley8 Feb 2023 2:24 UTC

13 points

8 comments4 min readLW link

House Covid Infection Retrospective

jefftk8 Feb 2023 2:20 UTC

25 points

1 comment2 min readLW link

(www.jefftk.com)

Noting an error in Inadequate Equilibria

Matthew Barnett8 Feb 2023 1:33 UTC

379 points

60 comments2 min readLW link 2 reviews

Living Nomadically: My 80/20 Guide

KatWoods8 Feb 2023 1:31 UTC

38 points

18 comments1 min readLW link

OpenAI/Microsoft announce “next generation language model” integrated into Bing/Edge

LawrenceC7 Feb 2023 20:38 UTC

79 points

4 comments1 min readLW link

(blogs.microsoft.com)

How evals might (or might not) prevent catastrophic risks from AI

Orpheus167 Feb 2023 20:16 UTC

45 points

0 comments9 min readLW link

Conditioning Predictive Models: Making inner alignment as easy as possible

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

7 Feb 2023 20:04 UTC

33 points

2 comments19 min readLW link

On The Current Status Of AI Dating

Nikita Brancatisano7 Feb 2023 20:00 UTC

53 points

8 comments6 min readLW link

Framing AI strategy

Zach Stein-Perlman7 Feb 2023 19:20 UTC

34 points

1 comment18 min readLW link

(aiimpacts.org)

Review of AI Alignment Progress

PeterMcCluskey7 Feb 2023 18:57 UTC

72 points

32 comments7 min readLW link

(bayesianinvestor.com)

The Economics of Contracts

Edward P. Könings7 Feb 2023 13:52 UTC

21 points

3 comments8 min readLW link

(edwardknings.substack.com)

Two very different experiences with ChatGPT

Sherrinford7 Feb 2023 13:09 UTC

38 points

15 comments5 min readLW link

[About Me] Cinera’s Home Page

DragonGod7 Feb 2023 12:56 UTC

30 points

2 comments9 min readLW link

Stuff I Recommend You Use

Arjun Panickssery7 Feb 2023 12:18 UTC

17 points

2 comments2 min readLW link

(arjunpanickssery.substack.com)

AXRP: Store, Patreon, Video

DanielFilan7 Feb 2023 4:50 UTC

12 points

0 comments1 min readLW link

Duckbill Masks Are Great

jefftk7 Feb 2023 3:00 UTC

22 points

14 comments1 min readLW link

(www.jefftk.com)

EA & LW Forum Weekly Summary (30th Jan − 5th Feb 2023)

Zoe Williams7 Feb 2023 2:13 UTC

3 points

3 comments14 min readLW link

[ASoT] Policy Trajectory Visualization

Ulisse Mini7 Feb 2023 0:13 UTC

9 points

2 comments1 min readLW link

English is a Terrible Programming Language—And other reasons AI won’t displace programmers

dawsoneliasen6 Feb 2023 22:12 UTC

22 points

8 comments8 min readLW link

(orbistertius.substack.com)

African Wild Dogs Vote By Sneezing—Can AI Help Us Do Better?

Augmented Assembly6 Feb 2023 21:09 UTC

10 points

6 comments4 min readLW link

In defense of the MBTI

ZZZZZZ6 Feb 2023 21:08 UTC

−14 points

22 comments4 min readLW link

Early situational awareness and its implications, a story

Jacob Pfau6 Feb 2023 20:45 UTC

29 points

6 comments3 min readLW link

Conditioning Predictive Models: The case for competitiveness

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

6 Feb 2023 20:08 UTC

20 points

3 comments11 min readLW link

Google announces ‘Bard’ powered by LaMDA

M. Y. Zuo6 Feb 2023 19:40 UTC

31 points

3 comments2 min readLW link

SolidGoldMagikarp II: technical details and more recent findings

mwatkins and Jessica Rumbelow

6 Feb 2023 19:09 UTC

114 points

45 comments13 min readLW link

Addendum: More Efficient FFNs via Attention

Robert_AIZI6 Feb 2023 18:55 UTC

10 points

2 comments5 min readLW link

(aizi.substack.com)

Here’s Why I’m Hesitant To Respond In More Depth

DirectedEvolution6 Feb 2023 18:36 UTC

56 points

10 comments4 min readLW link 1 review

Childhoods of exceptional people

Henrik Karlsson6 Feb 2023 17:27 UTC

353 points

62 comments15 min readLW link 1 review

(escapingflatland.substack.com)

Foodpairing and Embeddings

jurabrazdil6 Feb 2023 15:09 UTC

14 points

2 comments5 min readLW link

Monthly Roundup #3

Zvi6 Feb 2023 13:00 UTC

41 points

9 comments27 min readLW link

(thezvi.wordpress.com)

Project Idea: Lots of Cause-area-specific Online Unconferences

Linda Linsefors6 Feb 2023 11:05 UTC

27 points

1 comment5 min readLW link

(docs.google.com)

Oxford Essay Writing

Flourish Journal, RP and Jemima

6 Feb 2023 8:24 UTC

5 points

0 comments1 min readLW link

Decision Transformer Interpretability

Joseph Bloom and Paul Colognese

6 Feb 2023 7:29 UTC

87 points

13 comments24 min readLW link

Why is Everyone So Boring? By Robin Hanson

trevor6 Feb 2023 4:17 UTC

59 points

11 comments1 min readLW link

(www.overcomingbias.com)

Gradient surfing: the hidden role of regularization

Jesse Hoogland6 Feb 2023 3:50 UTC

38 points

9 comments14 min readLW link

(www.jessehoogland.com)

Why Are Bacteria So Simple?

aysja6 Feb 2023 3:00 UTC

172 points

33 comments10 min readLW link

The Law of Identity

Chris_Leong6 Feb 2023 2:59 UTC

5 points

5 comments4 min readLW link

Robin Hanson on “Explaining the Sacred”

Raemon6 Feb 2023 0:50 UTC

13 points

3 comments3 min readLW link

(www.overcomingbias.com)

Interview Daniel Murfet on Universal Phenomena in Learning Machines

Alexander Gietelink Oldenziel6 Feb 2023 0:00 UTC

61 points

2 comments16 min readLW link

SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow and mwatkins

5 Feb 2023 22:02 UTC

679 points

208 comments12 min readLW link 1 review