All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28

There are (probably) no superhuman Go AIs: strong human players beat the strongest AIs

TaranFeb 19, 2023, 12:25 PM

125 points

34 comments4 min readLW link

Navigating public AI x-risk hype while pursuing technical solutions

Dan BraunFeb 19, 2023, 12:22 PM

18 points

0 comments2 min readLW link

Somewhat against “just update all the way”

tailcalledFeb 19, 2023, 10:49 AM

31 points

10 comments2 min readLW link

Human beats SOTA Go AI by learning an adversarial policy

Vanessa KosoyFeb 19, 2023, 9:38 AM

59 points

32 comments1 min readLW link

(goattack.far.ai)

Degamification

Nate ShowellFeb 19, 2023, 5:35 AM

23 points

2 comments2 min readLW link

Stop posting prompt injections on Twitter and calling it “misalignment”

lcFeb 19, 2023, 2:21 AM

144 points

9 comments1 min readLW link

AGI in sight: our look at the game board

Andrea_Miotti and Gabriel Alfour

Feb 18, 2023, 10:17 PM

227 points

135 comments6 min readLW link

(andreamiotti.substack.com)

We should be signal-boosting anti Bing chat content

mbrooksFeb 18, 2023, 6:52 PM

−4 points

13 comments2 min readLW link

Can talk, can think, can suffer.

IlioFeb 18, 2023, 6:43 PM

1 point

8 comments3 min readLW link

Parametrically retargetable decision-makers tend to seek power

TurnTroutFeb 18, 2023, 6:41 PM

172 points

10 comments2 min readLW link

(arxiv.org)

Near-Term Risks of an Obedient Artificial Intelligence

ymeskhoutFeb 18, 2023, 6:30 PM

20 points

1 comment6 min readLW link

EIS VII: A Challenge for Mechanists

scasperFeb 18, 2023, 6:27 PM

36 points

4 comments3 min readLW link

Reading Speed Exists!

Johannes C. MayerFeb 18, 2023, 3:30 PM

12 points

9 comments1 min readLW link

The Practitioner’s Path 2.0: the Meditative Archetype

EvenflairFeb 18, 2023, 3:23 PM

14 points

1 comment2 min readLW link

(guildoftherose.org)

Should we cry “wolf”?

TapataktFeb 18, 2023, 11:24 AM

24 points

5 comments1 min readLW link

[Question] Name of the fallacy of assuming an extreme value (e.g. 0) with the illusion of ‘avoiding to have to make an assumption’?

FlorianHFeb 18, 2023, 8:11 AM

4 points

1 comment1 min readLW link

I Think We’re Approaching The Bitter Lesson’s Asymptote

SomeoneYouOnceKnewFeb 18, 2023, 5:33 AM

−3 points

9 comments5 min readLW link

Bus-Only Bus Lane Enforcement

jefftkFeb 18, 2023, 2:50 AM

19 points

15 comments1 min readLW link

(www.jefftk.com)

Run Head on Towards the Falling Tears

Johannes C. MayerFeb 18, 2023, 1:33 AM

6 points

0 comments2 min readLW link

Two problems with ‘Simulators’ as a frame

ryan_greenblattFeb 17, 2023, 11:34 PM

79 points

13 comments5 min readLW link

GPT-4 Predictions

Stephen McAleeseFeb 17, 2023, 11:20 PM

110 points

27 comments11 min readLW link

On Board Vision, Hollow Words, and the End of the World

MarcelloFeb 17, 2023, 11:18 PM

52 points

27 comments5 min readLW link

PICT: A Zero-Shot Prompt Template to Automate Evaluation

Quentin FEUILLADE--MONTIXIFeb 17, 2023, 11:16 PM

17 points

1 comment11 min readLW link

Hunch seeds: Info bio

the gears to ascensionFeb 17, 2023, 9:25 PM

12 points

0 comments9 min readLW link

Why Do We Believe

ScrewtapeFeb 17, 2023, 8:58 PM

9 points

3 comments3 min readLW link

I Am Scared of Posting Negative Takes About Bing’s AI

YitzFeb 17, 2023, 8:50 PM

63 points

28 comments1 min readLW link

EIS VI: Critiques of Mechanistic Interpretability Work in AI Safety

scasperFeb 17, 2023, 8:48 PM

49 points

9 comments12 min readLW link

Tinker Bell Theory and LLMs

Fergus FettesFeb 17, 2023, 8:23 PM

1 point

11 comments1 min readLW link

Recommendation: Bug Bounties and Responsible Disclosure for Advanced ML Systems

VaniverFeb 17, 2023, 8:11 PM

125 points

12 comments2 min readLW link

Microsoft and OpenAI, stop telling chatbots to roleplay as AI

hold_my_fishFeb 17, 2023, 7:55 PM

50 points

10 comments1 min readLW link

A warm-up for the AI governance project

jacekFeb 17, 2023, 6:06 PM

10 points

2 comments3 min readLW link

Link Post > Blog Post

party girlFeb 17, 2023, 5:59 PM

4 points

6 comments1 min readLW link

(onthespectrumontheguestlist.substack.com)

One-layer transformers aren’t equivalent to a set of skip-trigrams

BuckFeb 17, 2023, 5:26 PM

127 points

11 comments7 min readLW link

[Question] Should we be kind and polite to emerging AIs?

David GrossFeb 17, 2023, 4:58 PM

9 points

13 comments1 min readLW link

Follow-up Posting on Cyborg Psychologist

Hopkins StanleyFeb 17, 2023, 4:56 PM

0 points

2 comments1 min readLW link

(www.lesswrong.com)

A “slow takeoff” might still look fast

MichaelDickensFeb 17, 2023, 4:51 PM

5 points

3 comments1 min readLW link

AI Safety Info Distillation Fellowship

Robert Miles and mwatkins

Feb 17, 2023, 4:16 PM

47 points

3 comments3 min readLW link

Nozick’s Dilemma: A Critique of Game Theory

Edward P. KöningsFeb 17, 2023, 4:11 PM

10 points

1 comment13 min readLW link

[Question] Are LLMs sufficient for AI takeoff?

rpglover64Feb 17, 2023, 3:46 PM

8 points

2 comments1 min readLW link

Sydney’s Secret: A Short Story by Bing Chat

felaFeb 17, 2023, 1:31 PM

36 points

1 comment5 min readLW link

Automating Consistency

HoagyFeb 17, 2023, 1:24 PM

10 points

0 comments1 min readLW link

Human decision processes are not well factored

remember and Gabriel Alfour

Feb 17, 2023, 1:11 PM

33 points

3 comments2 min readLW link

2023 ACX Predictions: Buy/Sell/Hold

ZviFeb 17, 2023, 1:10 PM

25 points

3 comments20 min readLW link

(thezvi.wordpress.com)

Bing chat is the AI fire alarm

RatiosFeb 17, 2023, 6:51 AM

115 points

63 comments3 min readLW link

Seeing more whole

Joe CarlsmithFeb 17, 2023, 5:12 AM

31 points

1 comment26 min readLW link

Powerful mesa-optimisation is already here

Roman LeventovFeb 17, 2023, 4:59 AM

35 points

1 comment2 min readLW link

(arxiv.org)

Self-Reference Breaks the Orthogonality Thesis

lsusrFeb 17, 2023, 4:11 AM

43 points

35 comments2 min readLW link

The public supports regulating AI for safety

Zach Stein-PerlmanFeb 17, 2023, 4:10 AM

114 points

9 comments1 min readLW link

(aiimpacts.org)

Bring “Ban faster SIMD semiconductors” into the Overton window

worried-techno-optimistFeb 17, 2023, 3:27 AM

−7 points

1 comment2 min readLW link

Republishing an old essay in light of current news on Bing’s AI: “Regarding Blake Lemoine’s claim that LaMDA is ‘sentient’, he might be right (sorta), but perhaps not for the reasons he thinks”

philosophybearFeb 17, 2023, 3:27 AM

3 points

0 comments5 min readLW link

(philosophybear.substack.com)