All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Caring only about your child may increase human suffering

Cipolla8 Aug 2024 23:47 UTC

−15 points

2 comments8 min readLW link

The Hessian rank bounds the learning coefficient

Lucius Bushnaq8 Aug 2024 20:55 UTC

68 points

11 comments4 min readLW link

GPT-4o System Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC

68 points

11 comments2 min readLW link

(openai.com)

Parasites (not a metaphor)

lemonhope8 Aug 2024 20:07 UTC

137 points

19 comments1 min readLW link

Some Unorthodox Ways To Achieve High GDP Growth

johnswentworth and David Lorell

8 Aug 2024 18:58 UTC

58 points

6 comments6 min readLW link

You can remove GPT2’s LayerNorm by fine-tuning for an hour

StefanHex8 Aug 2024 18:33 UTC

166 points

11 comments8 min readLW link

Leaving MIRI, Seeking Funding

abramdemski8 Aug 2024 18:32 UTC

266 points

19 comments2 min readLW link

[Question] Does VETLM solve AI superalignment?

Oleg Trott8 Aug 2024 18:22 UTC

−1 points

10 comments1 min readLW link

Toy Models of Superposition: what about BitNets?

Alejandro Tlaie8 Aug 2024 16:29 UTC

5 points

1 comment5 min readLW link

[LDSL#1] Performance optimization as a metaphor for life

tailcalled8 Aug 2024 16:16 UTC

33 points

6 comments5 min readLW link

Four Randomized Control Trials In Economics

Maxwell Tabarrok8 Aug 2024 15:59 UTC

20 points

1 comment4 min readLW link

(www.maximum-progress.com)

Cheap Whiteboards!

Johannes C. Mayer8 Aug 2024 13:52 UTC

20 points

2 comments1 min readLW link

AI #76: Six Shorts Stories About OpenAI

Zvi8 Aug 2024 13:50 UTC

53 points

10 comments48 min readLW link

(thezvi.wordpress.com)

[Question] What the cost difference in processing input vs. output tokens with LLMs?

kotrfa8 Aug 2024 10:43 UTC

3 points

10 comments1 min readLW link

Meno’s Paradox

Sifr8 Aug 2024 5:59 UTC

0 points

10 comments1 min readLW link

Case Story: Lack of Consumer Protection Procedures AI Manipulation and the Threat of Fund Concentration in Crypto Seeking Assistance to Fund a Civil Case to Establish Facts and Protect Vulnerable Consumers from Damage Caused by Automated Systems

Petr 'Margot' Andreev8 Aug 2024 5:55 UTC

−9 points

0 comments9 min readLW link

Motivation Theory

Zero Contradictions8 Aug 2024 5:05 UTC

3 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

It’s time for a self-reproducing machine

Carl Feynman7 Aug 2024 21:52 UTC

114 points

74 comments9 min readLW link

[LDSL#0] Some epistemological conundrums

tailcalled7 Aug 2024 19:52 UTC

56 points

11 comments10 min readLW link

Help us seed AI Safety Brussels

gergogaspar and ENAIS

7 Aug 2024 6:32 UTC

3 points

2 comments3 min readLW link

Adaptive Coherence

Zero Contradictions7 Aug 2024 6:17 UTC

2 points

0 comments2 min readLW link

(thewaywardaxolotl.blogspot.com)

Individual Utilities Shift Continuously as Geometric Weights Shift

StrivingForLegibility7 Aug 2024 1:41 UTC

2 points

0 comments17 min readLW link

Gradient Ascenders Reach the Harsanyi Hyperplane

StrivingForLegibility7 Aug 2024 1:40 UTC

4 points

0 comments6 min readLW link

Deriving the Geometric Utilitarian Weights

StrivingForLegibility7 Aug 2024 1:39 UTC

2 points

0 comments11 min readLW link

Proving the Geometric Utilitarian Theorem

StrivingForLegibility7 Aug 2024 1:39 UTC

25 points

0 comments8 min readLW link

The Geometric Importance of Side Payments

StrivingForLegibility7 Aug 2024 1:38 UTC

8 points

4 comments3 min readLW link

Attention-Feature Tables in Gemma 2 Residual Streams

J Bostock6 Aug 2024 22:56 UTC

2 points

0 comments14 min readLW link

[Question] What are the strategic implications if aliens and Earth civilizations produce similar utilities?

Maxime Riché6 Aug 2024 21:16 UTC

4 points

1 comment1 min readLW link

WTH is Cerebrolysin, actually?

gsfitzgerald and delton137

6 Aug 2024 20:40 UTC

184 points

23 comments17 min readLW link

FHE Can’t Save Us: The Case Against Cryptographic AI Boxing

Bart Jaworski6 Aug 2024 17:46 UTC

6 points

1 comment6 min readLW link

Inference-Only Debate Experiments Using Math Problems

Arjun Panickssery, Abhimanyu Pallavi Sudhir and JacksonKaunismaa

6 Aug 2024 17:44 UTC

32 points

0 comments2 min readLW link

[Question] Is an AI religion justified?

p4rziv4l6 Aug 2024 15:42 UTC

−35 points

11 comments1 min readLW link

Startup Roundup #2

Zvi6 Aug 2024 13:30 UTC

45 points

0 comments32 min readLW link

(thezvi.wordpress.com)

Mechanistic Anomaly Detection Research Update

Nora Belrose and David Johnston

6 Aug 2024 10:33 UTC

12 points

0 comments1 min readLW link

(blog.eleuther.ai)

Reasoning is not search—a chess example

p.b.6 Aug 2024 9:29 UTC

5 points

3 comments2 min readLW link

Broadly human level, cognitively complete AGI

p.b.6 Aug 2024 9:26 UTC

9 points

0 comments1 min readLW link

Does Evolutionary Theory Imply Genetic Tribalism?

Zero Contradictions6 Aug 2024 5:43 UTC

0 points

1 comment1 min readLW link

(thewaywardaxolotl.blogspot.com)

How I Learned To Stop Trusting Prediction Markets and Love the Arbitrage

orthonormal6 Aug 2024 2:32 UTC

201 points

33 comments3 min readLW link 3 reviews

John Schulman leaves OpenAI for Anthropic [and then left Anthropic again for Thinking Machines]

Sodium6 Aug 2024 1:23 UTC

57 points

0 comments1 min readLW link

Self-explaining SAE features

Dmitrii Kharlapenko, neverix, Neel Nanda and Arthur Conmy

5 Aug 2024 22:20 UTC

62 points

13 comments10 min readLW link

Value fragility and AI takeover

Joe Carlsmith5 Aug 2024 21:28 UTC

76 points

5 comments30 min readLW link

Madrid—ACX Meetups Everywhere Fall 2024

Pablo Villalobos5 Aug 2024 18:36 UTC

4 points

0 comments1 min readLW link

LLMs stifle creativity, eliminate opportunities for serendipitous discovery and disrupt intergenerational transfer of wisdom

Ghdz5 Aug 2024 18:27 UTC

7 points

3 comments7 min readLW link

Circular Reasoning

abramdemski5 Aug 2024 18:10 UTC

113 points

44 comments8 min readLW link 2 reviews

Fear of centralized power vs. fear of misaligned AGI: Vitalik Buterin on 80,000 Hours

Seth Herd5 Aug 2024 15:38 UTC

70 points

22 comments5 min readLW link

AI Safety at the Frontier: Paper Highlights, July ’24

gasteigerjo5 Aug 2024 13:00 UTC

8 points

0 comments7 min readLW link

(aisafetyfrontier.substack.com)

Game Theory and Society

Zero Contradictions5 Aug 2024 4:27 UTC

4 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Near-mode thinking on AI

Olli Järviniemi4 Aug 2024 20:47 UTC

126 points

10 comments5 min readLW link 1 review

Watermarks: Signing, Branding, and Boobytrapping

Shankar Sivarajan4 Aug 2024 20:41 UTC

4 points

0 comments1 min readLW link

Modelling Social Exchange: A Systematised Method to Judge Friendship Quality

Wynn Walker4 Aug 2024 18:49 UTC

6 points

0 comments5 min readLW link