All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Opportunistic Time-Management

Richard Henage13 Mar 2024 21:38 UTC

13 points

2 comments1 min readLW link

AI governance and strategy: a list of research agendas and work that could be done.

NathanBarnard and Erin Robertson

13 Mar 2024 21:23 UTC

9 points

2 comments17 min readLW link

Highlights from Lex Fridman’s interview of Yann LeCun

Joel Burget13 Mar 2024 20:58 UTC

48 points

15 comments41 min readLW link

On the Latest TikTok Bill

Zvi13 Mar 2024 18:50 UTC

58 points

7 comments29 min readLW link

(thezvi.wordpress.com)

[Question] Recommended book for a balanced take and lessons learned from covid pandemic response

Martin Hare Robertson13 Mar 2024 18:14 UTC

4 points

0 comments1 min readLW link

ACX/LW Seattle spring meetup 2024

nsokolsky13 Mar 2024 17:24 UTC

12 points

3 comments1 min readLW link

Laying the Foundations for Vision and Multimodal Mechanistic Interpretability & Open Problems

Sonia Joseph and Neel Nanda

13 Mar 2024 17:09 UTC

44 points

13 comments14 min readLW link

I was raised by devout Mormons, AMA [&|] Soliciting Advice

ErioirE13 Mar 2024 16:52 UTC

32 points

41 comments2 min readLW link

Relational Agency: Consistently Reaching Out

Jonathan Moregård13 Mar 2024 14:34 UTC

16 points

0 comments5 min readLW link

(open.substack.com)

[Question] What could a policy banning AGI look like?

TsviBT13 Mar 2024 14:19 UTC

80 points

23 comments3 min readLW link

Clickbait Soapboxing

DaystarEld13 Mar 2024 14:09 UTC

24 points

16 comments3 min readLW link

(daystareld.com)

Virtual AI Safety Unconference 2024

Orpheus, Linda Linsefors, Joe Rogero, Arjun Yadav and Manuela García

13 Mar 2024 13:54 UTC

14 points

0 comments1 min readLW link

Jobs, Relationships, and Other Cults

Ruby and Elizabeth

13 Mar 2024 5:58 UTC

49 points

9 comments35 min readLW link

How do you improve the quality of your drinking water?

Alex K. Chen (StochasticCockatoo)13 Mar 2024 0:37 UTC

11 points

2 comments1 min readLW link

The Parable Of The Fallen Pendulum—Part 2

johnswentworth12 Mar 2024 21:41 UTC

79 points

8 comments4 min readLW link

Open consultancy: Letting untrusted AIs choose what answer to argue for

Fabien Roger12 Mar 2024 20:38 UTC

35 points

5 comments5 min readLW link

[Question] Is anyone working on formally verified AI toolchains?

metachirality12 Mar 2024 19:36 UTC

17 points

4 comments1 min readLW link

Transformer Debugger

Henk Tillman12 Mar 2024 19:08 UTC

26 points

0 comments1 min readLW link

(github.com)

Superforecasting the Origins of the Covid-19 Pandemic

DanielFilan12 Mar 2024 19:01 UTC

64 points

0 comments1 min readLW link

(goodjudgment.substack.com)

minimum viable action

Sindhu Prasad12 Mar 2024 16:06 UTC

1 point

0 comments3 min readLW link

Hardball questions for the Gemini Congressional Hearing

Michael Thiessen12 Mar 2024 15:27 UTC

−11 points

2 comments1 min readLW link

OpenAI: The Board Expands

Zvi12 Mar 2024 14:00 UTC

92 points

1 comment30 min readLW link

(thezvi.wordpress.com)

Update on Developing an Ethics Calculator to Align an AGI to

sweenesm12 Mar 2024 12:33 UTC

4 points

2 comments8 min readLW link

[Question] How do you identify and counteract your biases in decision-making?

warrenjordan12 Mar 2024 5:01 UTC

2 points

1 comment1 min readLW link

How Much Have I Been Playing?

jefftk12 Mar 2024 2:10 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

Miles Turpin11 Mar 2024 23:46 UTC

16 points

0 comments1 min readLW link

(arxiv.org)

AI Safety Action Plan—A report commissioned by the US State Department

agucova11 Mar 2024 22:14 UTC

22 points

1 comment1 min readLW link

(www.gladstone.ai)

A discussion of AI risk and the cost/benefit calculation of stopping or pausing AI development

DuncanFowler11 Mar 2024 21:41 UTC

1 point

0 comments1 min readLW link

Among the A.I. Doomsayers—The New Yorker

agucova11 Mar 2024 21:35 UTC

12 points

1 comment1 min readLW link

(www.newyorker.com)

Be More Katja

Nathan Young11 Mar 2024 21:12 UTC

53 points

0 comments3 min readLW link

AI Incident Reporting: A Regulatory Review

Deric Cheng and Elliot Mckernon

11 Mar 2024 21:03 UTC

16 points

0 comments6 min readLW link

Results from an Adversarial Collaboration on AI Risk (FRI)

Josh Rosenberg, AvitalM, Molly and rosehadshar

11 Mar 2024 20:00 UTC

61 points

3 comments9 min readLW link

(forecastingresearch.org)

The Astronomical Sacrifice Dilemma

Matthew McRedmond11 Mar 2024 19:58 UTC

15 points

3 comments4 min readLW link

Epiphenomenalism leads to eliminativism about qualia

Clément L11 Mar 2024 19:53 UTC

4 points

0 comments7 min readLW link

The Best Essay (Paul Graham)

Chris_Leong11 Mar 2024 19:25 UTC

25 points

2 comments1 min readLW link

(paulgraham.com)

Open Thread Spring 2024

habryka11 Mar 2024 19:17 UTC

22 points

162 comments1 min readLW link

New social credit formalizations

KatjaGrace11 Mar 2024 19:00 UTC

23 points

3 comments2 min readLW link

(worldspiritsockpuppet.com)

How disagreements about Evidential Correlations could be settled

Martín Soto11 Mar 2024 18:28 UTC

12 points

3 comments4 min readLW link

“Artificial General Intelligence”: an extremely brief FAQ

Steven Byrnes11 Mar 2024 17:49 UTC

75 points

6 comments2 min readLW link

Some (problematic) aesthetics of what constitutes good work in academia

Steven Byrnes11 Mar 2024 17:47 UTC

157 points

12 comments12 min readLW link

Storable Votes with a Pay as you win mechanism: a contribution for institutional design

Arturo Macias11 Mar 2024 15:58 UTC

17 points

19 comments2 min readLW link

Tend to your clarity, not your confusion

Severin T. Seehrich11 Mar 2024 15:09 UTC

23 points

1 comment6 min readLW link

[Question] What do we know about the AI knowledge and views, especially about existential risk, of the new OpenAI board members?

Zvi11 Mar 2024 14:55 UTC

60 points

2 comments2 min readLW link

“How could I have thought that faster?”

mesaoptimizer11 Mar 2024 10:56 UTC

256 points

37 comments2 min readLW link 4 reviews

(twitter.com)

Simple versus Short: Higher-order degeneracy and error-correction

Daniel Murfet11 Mar 2024 7:52 UTC

115 points

12 comments13 min readLW link 3 reviews

Deconstructing Bostrom’s Classic Argument for AI Doom

Nora Belrose11 Mar 2024 5:58 UTC

16 points

14 comments1 min readLW link

(www.youtube.com)

Advice Needed: Does Using a LLM Compomise My Personal Epistemic Security?

Naomi11 Mar 2024 5:57 UTC

17 points

7 comments2 min readLW link

Some Thoughts on Concept Formation and Use in Agents

CatGoddess11 Mar 2024 5:03 UTC

12 points

0 comments8 min readLW link

Steelmanning as an especially insidious form of strawmanning

Cornelius Dybdahl11 Mar 2024 2:25 UTC

10 points

13 comments5 min readLW link

One-shot strategy games?

Raemon11 Mar 2024 0:19 UTC

41 points

42 comments1 min readLW link