All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Briefly Extending Differential Optimization to Distributions

J Bostock10 Mar 2024 20:41 UTC

4 points

0 comments2 min readLW link

Evolution did a surprising good job at aligning humans...to social status

Eli Tyre10 Mar 2024 19:34 UTC

62 points

46 comments1 min readLW link 1 review

Pausing AI is Positive Expected Value

Liron10 Mar 2024 17:10 UTC

9 points

2 comments3 min readLW link

(twitter.com)

W2SG: Introduction

Maria Kapros10 Mar 2024 16:25 UTC

2 points

2 comments10 min readLW link

An Optimistic Solution to the Fermi Paradox

Glenn Clayton10 Mar 2024 14:39 UTC

4 points

6 comments13 min readLW link

Counterfactual Civilization Simulation Version −1.0 aka my application to Johannes Mayer’s SPAR project

Morphism10 Mar 2024 10:10 UTC

1 point

0 comments14 min readLW link

Notes from a Prompt Factory

Richard_Ngo10 Mar 2024 5:13 UTC

114 points

19 comments9 min readLW link

(www.narrativeark.xyz)

Investigating Basin Volume with XOR Networks

CatGoddess10 Mar 2024 1:35 UTC

10 points

0 comments5 min readLW link

[Linkpost] MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Bogdan Ionut Cirstea10 Mar 2024 1:30 UTC

10 points

0 comments1 min readLW link

(openreview.net)

0th Person and 1st Person Logic

Adele Lopez10 Mar 2024 0:56 UTC

63 points

29 comments6 min readLW link

Completion Estimates

Commander Zander9 Mar 2024 22:56 UTC

7 points

2 comments3 min readLW link

Semi-Simplicial Types, Part I: Motivation and History

astradiol9 Mar 2024 22:07 UTC

20 points

3 comments10 min readLW link

Distinctions when Discussing Utility Functions

ozziegooen9 Mar 2024 20:14 UTC

24 points

7 comments8 min readLW link

What is progress?

jasoncrawford9 Mar 2024 16:28 UTC

10 points

4 comments6 min readLW link

(rootsofprogress.org)

Fifteen Lawsuits against OpenAI

Remmelt9 Mar 2024 12:22 UTC

27 points

4 comments1 min readLW link

Cambridge ACX/SSC monthly meetup (location changed to Fort St George!)

hamishtodd19 Mar 2024 11:10 UTC

2 points

0 comments1 min readLW link

MA E-ZPass Without a Car?

jefftk9 Mar 2024 2:40 UTC

15 points

2 comments1 min readLW link

(www.jefftk.com)

Closeness To the Issue (Part 5 of “The Sense Of Physical Necessity”)

LoganStrohl9 Mar 2024 0:36 UTC

36 points

1 comment15 min readLW link 1 review

Exploring the Evolution and Migration of Different Layer Embedding in LLMs

Ruixuan Huang8 Mar 2024 15:01 UTC

6 points

0 comments8 min readLW link

[Question] When and why did ‘training’ become ‘pretraining’?

beren8 Mar 2024 14:29 UTC

16 points

6 comments1 min readLW link

A T-o-M test: ‘popcorn’ or ‘chocolate’

MiguelDev8 Mar 2024 4:24 UTC

20 points

13 comments1 min readLW link

Scenario Forecasting Workshop: Materials and Learnings

elifland and Charlie Griffin

8 Mar 2024 2:30 UTC

50 points

3 comments2 min readLW link

Forecasting future gains due to post-training enhancements

elifland, Joel Becker and simeon_c

8 Mar 2024 2:11 UTC

31 points

2 comments1 min readLW link

(docs.google.com)

Do LLMs sometime simulate something akin to a dream?

Nezek8 Mar 2024 1:25 UTC

8 points

4 comments1 min readLW link

Community norms poll (2 mins)

Nathan Young7 Mar 2024 21:45 UTC

12 points

1 comment1 min readLW link

Announcing Convergence Analysis: An Institute for AI Scenario & Governance Research

David_Kristoffersson and Deric Cheng

7 Mar 2024 21:37 UTC

23 points

1 comment4 min readLW link

Woods’ new preprint on object permanence

Steven Byrnes7 Mar 2024 21:29 UTC

58 points

1 comment6 min readLW link

MATS AI Safety Strategy Curriculum

Ronny Fernandez and Ryan Kidd

7 Mar 2024 19:59 UTC

74 points

2 comments16 min readLW link

Political Biases in LLMs: Literature Review & Current Uses of AI in Elections

Yashvardhan Sharma, Robayet Hossain and Ariana Gamarra

7 Mar 2024 19:17 UTC

6 points

0 comments6 min readLW link

Evidential Correlations are Subjective, and it might be a problem

Martín Soto7 Mar 2024 18:37 UTC

32 points

6 comments14 min readLW link

AI Safety 101 : Capabilities—Human Level AI, What? How? and When?

markov and Charbel-Raphaël

7 Mar 2024 17:29 UTC

46 points

8 comments54 min readLW link

A Review of Weak to Strong Generalization [AI Safety Camp]

sevdeawesome7 Mar 2024 17:16 UTC

14 points

0 comments9 min readLW link

AISN #32: Measuring and Reducing Hazardous Knowledge in LLMs Plus, Forecasting the Future with LLMs, and Regulatory Markets

Corin Katzke and Dan H

7 Mar 2024 16:39 UTC

8 points

0 comments8 min readLW link

(newsletter.safe.ai)

AI #54: Clauding Along

Zvi7 Mar 2024 16:00 UTC

45 points

11 comments51 min readLW link

(thezvi.wordpress.com)

Being Interested in Other People

Jonathan Moregård7 Mar 2024 10:13 UTC

14 points

1 comment3 min readLW link

(youbutbetter.substack.com)

Talking to Congress: Can constituents contacting their legislator influence policy?

T_W7 Mar 2024 9:24 UTC

14 points

0 comments19 min readLW link

Explaining the AI Alignment Problem to Tibetan Buddhist Monks

Paul Colognese7 Mar 2024 9:00 UTC

20 points

3 comments6 min readLW link

What if Alignment is Not Enough?

WillPetillo7 Mar 2024 8:10 UTC

17 points

46 comments9 min readLW link

Sparks of AGI prompts on GPT2XL and its variant, RLLMv3

MiguelDev7 Mar 2024 6:33 UTC

4 points

0 comments4 min readLW link

An AI, a box, and a threat

jwfiredragon7 Mar 2024 6:15 UTC

10 points

0 comments6 min readLW link

Mud and Despair (Part 4 of “The Sense Of Physical Necessity”)

LoganStrohl7 Mar 2024 0:14 UTC

38 points

0 comments2 min readLW link

introduction to thermal conductivity and noise management

bhauth6 Mar 2024 23:14 UTC

31 points

1 comment4 min readLW link

(www.bhauth.com)

Essaying Other Plans

Screwtape6 Mar 2024 22:59 UTC

29 points

4 comments7 min readLW link

Invest in ACX Grants projects!

Saul Munn6 Mar 2024 20:27 UTC

23 points

1 comment3 min readLW link

Vote on Anthropic Topics to Discuss

Ben Pace6 Mar 2024 19:43 UTC

75 points

55 comments1 min readLW link

Simple Kelly betting in prediction markets

jessicata6 Mar 2024 18:59 UTC

39 points

3 comments3 min readLW link

(unstablerontology.substack.com)

On Claude 3.0

Zvi6 Mar 2024 18:50 UTC

76 points

5 comments31 min readLW link

(thezvi.wordpress.com)

[Question] Why correlation, though?

numpyNaN6 Mar 2024 16:53 UTC

23 points

7 comments1 min readLW link

Using axis lines for good or evil

dynomight6 Mar 2024 14:47 UTC

153 points

39 comments4 min readLW link

(dynomight.net)

Let’s build definitely-not-conscious AI

lemonhope6 Mar 2024 7:50 UTC

4 points

18 comments1 min readLW link