All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 181920 21 22 23 24 25 26 27 28 29 30 31

AI #55: Keep Clauding Along

ZviMar 14, 2024, 3:40 PM

62 points

16 comments70 min readLW link

(thezvi.wordpress.com)

To the average human, controlled AI is just as lethal as ‘misaligned’ AI

YonatanKMar 14, 2024, 2:52 PM

6 points

20 comments5 min readLW link

Claude vs GPT

Maxwell TabarrokMar 14, 2024, 12:41 PM

12 points

2 comments2 min readLW link

(www.maximum-progress.com)

A brief review of China’s AI industry and regulations

Elliot MckernonMar 14, 2024, 12:19 PM

24 points

0 comments16 min readLW link

[Question] Can any LLM be represented as an Equation?

Valentin BaltadzhievMar 14, 2024, 9:51 AM

1 point

2 comments1 min readLW link

‘Empiricism!’ as Anti-Epistemology

Eliezer YudkowskyMar 14, 2024, 2:02 AM

171 points

92 comments25 min readLW link

Opportunistic Time-Management

Richard HenageMar 13, 2024, 9:38 PM

13 points

2 comments1 min readLW link

AI governance and strategy: a list of research agendas and work that could be done.

NathanBarnard and Erin Robertson

Mar 13, 2024, 9:23 PM

7 points

1 comment17 min readLW link

Highlights from Lex Fridman’s interview of Yann LeCun

Joel BurgetMar 13, 2024, 8:58 PM

48 points

15 comments41 min readLW link

On the Latest TikTok Bill

ZviMar 13, 2024, 6:50 PM

58 points

7 comments29 min readLW link

(thezvi.wordpress.com)

[Question] Recommended book for a balanced take and lessons learned from covid pandemic response

Martin Hare RobertsonMar 13, 2024, 6:14 PM

4 points

0 comments1 min readLW link

ACX/LW Seattle spring meetup 2024

nsokolskyMar 13, 2024, 5:24 PM

12 points

3 comments1 min readLW link

Laying the Foundations for Vision and Multimodal Mechanistic Interpretability & Open Problems

Sonia Joseph and Neel Nanda

Mar 13, 2024, 5:09 PM

44 points

13 comments14 min readLW link

I was raised by devout Mormons, AMA [&|] Soliciting Advice

ErioirEMar 13, 2024, 4:52 PM

32 points

41 comments2 min readLW link

Relational Agency: Consistently Reaching Out

Jonathan MoregårdMar 13, 2024, 2:34 PM

16 points

0 comments5 min readLW link

(open.substack.com)

[Question] What could a policy banning AGI look like?

TsviBTMar 13, 2024, 2:19 PM

78 points

23 comments3 min readLW link

Clickbait Soapboxing

DaystarEldMar 13, 2024, 2:09 PM

24 points

16 comments3 min readLW link

(daystareld.com)

Virtual AI Safety Unconference 2024

Orpheus, Linda Linsefors, Joe Rogero, Arjun Yadav and Manuela García

Mar 13, 2024, 1:54 PM

14 points

0 comments1 min readLW link

Jobs, Relationships, and Other Cults

Ruby and Elizabeth

Mar 13, 2024, 5:58 AM

40 points

9 comments35 min readLW link

How do you improve the quality of your drinking water?

Alex K. Chen (parrot)Mar 13, 2024, 12:37 AM

11 points

2 comments1 min readLW link

The Parable Of The Fallen Pendulum—Part 2

johnswentworthMar 12, 2024, 9:41 PM

78 points

8 comments4 min readLW link

Open consultancy: Letting untrusted AIs choose what answer to argue for

Fabien RogerMar 12, 2024, 8:38 PM

35 points

5 comments5 min readLW link

[Question] Is anyone working on formally verified AI toolchains?

metachiralityMar 12, 2024, 7:36 PM

17 points

4 comments1 min readLW link

Transformer Debugger

Henk TillmanMar 12, 2024, 7:08 PM

26 points

0 comments1 min readLW link

(github.com)

Superforecasting the Origins of the Covid-19 Pandemic

DanielFilanMar 12, 2024, 7:01 PM

64 points

0 comments1 min readLW link

(goodjudgment.substack.com)

minimum viable action

Sindhu PrasadMar 12, 2024, 4:06 PM

1 point

0 comments3 min readLW link

Hardball questions for the Gemini Congressional Hearing

Michael ThiessenMar 12, 2024, 3:27 PM

−11 points

2 comments1 min readLW link

OpenAI: The Board Expands

ZviMar 12, 2024, 2:00 PM

92 points

1 comment30 min readLW link

(thezvi.wordpress.com)

Update on Developing an Ethics Calculator to Align an AGI to

sweenesmMar 12, 2024, 12:33 PM

4 points

2 comments8 min readLW link

[Question] How do you identify and counteract your biases in decision-making?

warrenjordanMar 12, 2024, 5:01 AM

2 points

1 comment1 min readLW link

How Much Have I Been Playing?

jefftkMar 12, 2024, 2:10 AM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

Miles TurpinMar 11, 2024, 11:46 PM

16 points

0 comments1 min readLW link

(arxiv.org)

AI Safety Action Plan—A report commissioned by the US State Department

agucovaMar 11, 2024, 10:14 PM

22 points

1 comment1 min readLW link

(www.gladstone.ai)

A discussion of AI risk and the cost/benefit calculation of stopping or pausing AI development

DuncanFowlerMar 11, 2024, 9:41 PM

1 point

0 comments1 min readLW link

Among the A.I. Doomsayers—The New Yorker

agucovaMar 11, 2024, 9:35 PM

12 points

1 comment1 min readLW link

(www.newyorker.com)

Be More Katja

Nathan YoungMar 11, 2024, 9:12 PM

53 points

0 comments3 min readLW link

AI Incident Reporting: A Regulatory Review

Deric Cheng and Elliot Mckernon

Mar 11, 2024, 9:03 PM

16 points

0 comments6 min readLW link

Results from an Adversarial Collaboration on AI Risk (FRI)

Josh Rosenberg, AvitalM, Molly and rosehadshar

Mar 11, 2024, 8:00 PM

61 points

3 comments9 min readLW link

(forecastingresearch.org)

The Astronomical Sacrifice Dilemma

Matthew McRedmondMar 11, 2024, 7:58 PM

15 points

3 comments4 min readLW link

Epiphenomenalism leads to eliminativism about qualia

Clément LMar 11, 2024, 7:53 PM

4 points

0 comments7 min readLW link

The Best Essay (Paul Graham)

Chris_LeongMar 11, 2024, 7:25 PM

25 points

2 comments1 min readLW link

(paulgraham.com)

Open Thread Spring 2024

habrykaMar 11, 2024, 7:17 PM

22 points

160 comments1 min readLW link

New social credit formalizations

KatjaGraceMar 11, 2024, 7:00 PM

23 points

3 comments2 min readLW link

(worldspiritsockpuppet.com)

How disagreements about Evidential Correlations could be settled

Martín SotoMar 11, 2024, 6:28 PM

12 points

3 comments4 min readLW link

“Artificial General Intelligence”: an extremely brief FAQ

Steven ByrnesMar 11, 2024, 5:49 PM

75 points

6 comments2 min readLW link

Some (problematic) aesthetics of what constitutes good work in academia

Steven ByrnesMar 11, 2024, 5:47 PM

148 points

12 comments12 min readLW link

Storable Votes with a Pay as you win mechanism: a contribution for institutional design

Arturo MaciasMar 11, 2024, 3:58 PM

17 points

19 comments2 min readLW link

Tend to your clarity, not your confusion

Severin T. SeehrichMar 11, 2024, 3:09 PM

23 points

1 comment6 min readLW link

[Question] What do we know about the AI knowledge and views, especially about existential risk, of the new OpenAI board members?

ZviMar 11, 2024, 2:55 PM

60 points

2 comments2 min readLW link

“How could I have thought that faster?”

mesaoptimizerMar 11, 2024, 10:56 AM

237 points

32 comments2 min readLW link

(twitter.com)