All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 252627 28 29 30 31

What does a Gambler’s Verity world look like?

ErioirE25 Jul 2024 22:03 UTC

7 points

6 comments1 min readLW link

Pacing Outside the Box: RNNs Learn to Plan in Sokoban

Adrià Garriga-alonso, taufeeque, AdamGleave and ChengCheng

25 Jul 2024 22:00 UTC

59 points

8 comments2 min readLW link

(arxiv.org)

Sex, Death, and Complexity

Zero Contradictions25 Jul 2024 21:22 UTC

0 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Does robustness improve with scale?

ChengCheng, niki.h, Ian McKenzie, Oskar Hollinsworth, Tom Tseng and AdamGleave

25 Jul 2024 20:55 UTC

14 points

0 comments1 min readLW link

(far.ai)

Organisation for Program Equilibrium reading group

Smaug12325 Jul 2024 19:11 UTC

11 points

14 comments1 min readLW link

In Text

Valerii Kremnev25 Jul 2024 18:22 UTC

−3 points

0 comments5 min readLW link

“AI achieves silver-medal standard solving International Mathematical Olympiad problems”

gjm25 Jul 2024 15:58 UTC

133 points

38 comments2 min readLW link

(deepmind.google)

[Talk transcript] What “structure” is and why it matters

Alex_Altair25 Jul 2024 15:49 UTC

23 points

0 comments5 min readLW link

(www.youtube.com)

AI #74: GPT-4o Mini Me and Llama 3

Zvi25 Jul 2024 13:50 UTC

30 points

6 comments36 min readLW link

(thezvi.wordpress.com)

AI Constitutions are a tool to reduce societal scale risk

Sammy Martin25 Jul 2024 11:18 UTC

30 points

2 comments18 min readLW link

Determining the power of investors over Frontier AI Labs is strategically important to reduce x-risk

Lucie Philippon25 Jul 2024 1:12 UTC

18 points

7 comments2 min readLW link

FLI is hiring across Comms and Ops

beisenpress25 Jul 2024 0:06 UTC

1 point

0 comments1 min readLW link

A framework for thinking about AI power-seeking

Joe Carlsmith24 Jul 2024 22:41 UTC

70 points

15 comments16 min readLW link

Llama Llama-3-405B?

Zvi24 Jul 2024 19:40 UTC

51 points

9 comments30 min readLW link

(thezvi.wordpress.com)

AI Safety Memes Wiki

plex and Vishakha

24 Jul 2024 18:53 UTC

37 points

2 comments1 min readLW link

(aisafety.info)

Research Discussion on PSCA with Claude Sonnet 3.5

Robert Kralisch24 Jul 2024 16:53 UTC

−2 points

0 comments25 min readLW link

Reading More Each Day: A Simple $35 Tool

aysajan24 Jul 2024 13:54 UTC

30 points

2 comments1 min readLW link

You should go to ML conferences

Jan_Kulveit24 Jul 2024 11:47 UTC

112 points

13 comments4 min readLW link

The last era of human mistakes

owencb24 Jul 2024 9:58 UTC

34 points

2 comments7 min readLW link

(strangecities.substack.com)

Longevity: A critical look at “Loss of epigenetic information as a cause of mammalian aging”

Anna Crow24 Jul 2024 1:40 UTC

16 points

2 comments10 min readLW link

The Cancer Resolution?

PeterMcCluskey24 Jul 2024 0:25 UTC

34 points

25 comments6 min readLW link

(bayesianinvestor.com)

Positive visions for AI

L Rudolf L and Florence Hinder

23 Jul 2024 20:15 UTC

27 points

4 comments18 min readLW link

(www.florencehinder.com)

How reasonable is taking extinction risk?

FVelde23 Jul 2024 18:05 UTC

2 points

4 comments4 min readLW link

Unlearning via RMU is mostly shallow

Andy Arditi and bilalchughtai

23 Jul 2024 16:07 UTC

58 points

4 comments6 min readLW link

Monthly Roundup #20: July 2024

Zvi23 Jul 2024 12:50 UTC

33 points

9 comments38 min readLW link

(thezvi.wordpress.com)

Confusing the metric for the meaning: Perhaps correlated attributes are “natural”

NickyP23 Jul 2024 12:43 UTC

33 points

3 comments4 min readLW link

My covid-related beliefs and questions

Severin T. Seehrich23 Jul 2024 3:27 UTC

10 points

3 comments1 min readLW link

[Question] Is there a Schelling point for group house room listings?

NoSignalNoNoise23 Jul 2024 3:03 UTC

4 points

0 comments1 min readLW link

Room Available in Boston Group House

NoSignalNoNoise23 Jul 2024 2:55 UTC

15 points

1 comment1 min readLW link

D&D.Sci Scenario Index

aphyer and abstractapplic

23 Jul 2024 2:00 UTC

78 points

1 comment3 min readLW link 1 review

How to avoid death by AI.

Krantz23 Jul 2024 1:59 UTC

−3 points

13 comments2 min readLW link

Ransomware Payments Should Require a Sin Tax

Brian Bien22 Jul 2024 21:16 UTC

20 points

10 comments2 min readLW link

The Elusive Root Cause of Schizophrenia—Thesis Introduction Only

kareempforbes22 Jul 2024 20:24 UTC

−8 points

0 comments2 min readLW link

Is Chinese AGI a valid concern for the USA?

sammyboiz22 Jul 2024 20:21 UTC

0 points

2 comments9 min readLW link

Trying to understand Hanson’s Cultural Drift argument

Kemp22 Jul 2024 20:20 UTC

21 points

6 comments2 min readLW link

Efficient Dictionary Learning with Switch Sparse Autoencoders

Anish Mudide22 Jul 2024 18:45 UTC

118 points

20 comments12 min readLW link

Analyzing DeepMind’s Probabilistic Methods for Evaluating Agent Capabilities

Axel Højmark, fidgetsinner, Arjun Panickssery, Marius Hobbhahn and Jérémy Scheurer

22 Jul 2024 16:17 UTC

69 points

0 comments16 min readLW link

The Garden of Eden

Alexander Turok22 Jul 2024 16:07 UTC

23 points

2 comments9 min readLW link

Caring about excellence

owencb22 Jul 2024 14:24 UTC

47 points

5 comments6 min readLW link 1 review

Tim Dillon’s fake business altered my perspective more significantly than any other video I have watched in the last 24 months

Stuart Johnson22 Jul 2024 12:54 UTC

6 points

0 comments1 min readLW link

(youtu.be)

On the CrowdStrike Incident

Zvi22 Jul 2024 12:40 UTC

75 points

14 comments17 min readLW link

(thezvi.wordpress.com)

Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents

Sam F. Brown, BasilLabib, Codruta (Coco) Lugoj and Sai Sasank Y

22 Jul 2024 12:33 UTC

20 points

0 comments14 min readLW link

What does “the universe is quantum” actually mean?

Tahp22 Jul 2024 11:52 UTC

2 points

0 comments14 min readLW link

Initial Experiments Using SAEs to Help Detect AI Generated Text

Aaron_Scher22 Jul 2024 5:16 UTC

18 points

1 comment14 min readLW link

Categories of leadership on technical teams

benkuhn22 Jul 2024 4:50 UTC

43 points

0 comments8 min readLW link

(www.benkuhn.net)

An experiment on hidden cognition

Olli Järviniemi22 Jul 2024 3:26 UTC

25 points

2 comments7 min readLW link

OpenAI Boycott Revisit

Jake Dennie-Lu22 Jul 2024 1:44 UTC

17 points

2 comments2 min readLW link

Coalitional agency

Richard_Ngo22 Jul 2024 0:09 UTC

61 points

6 comments6 min readLW link

The AI Driver’s Licence—A Policy Proposal

Joshua W and Tessa Malan

21 Jul 2024 20:38 UTC

0 points

1 comment19 min readLW link

Demography and Destiny

Zero Contradictions21 Jul 2024 20:34 UTC

6 points

11 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)