All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Clarifying Alignment Fundamentals Through the Lens of Ontology

Ben IhrigOct 7, 2024, 8:57 PM

12 points

4 comments24 min readLW link

Ethics on Cosmic Scale, Outer Space Treaty, Directed Panspermia, Forwards-Contamination, Technology Assessment, Planetary Protection, and Fermi’s Paradox

MrFantasticOct 7, 2024, 8:56 PM

−12 points

0 comments1 min readLW link

Domain-specific SAEs

jacob_droriOct 7, 2024, 8:15 PM

28 points

2 comments5 min readLW link

Metaculus Is Open Source

ChristianWilliamsOct 7, 2024, 7:55 PM

13 points

0 comments LW link

(www.metaculus.com)

Research update: Towards a Law of Iterated Expectations for Heuristic Estimators

Eric NeymanOct 7, 2024, 7:29 PM

87 points

2 comments22 min readLW link

AI Model Registries: A Foundational Tool for AI Governance

Elliot Mckernon, Deric Cheng and Gwyn Glasser

Oct 7, 2024, 7:27 PM

20 points

1 comment4 min readLW link

(www.convergenceanalysis.org)

Evaluating the truth of statements in a world of ambiguous language.

HastingsOct 7, 2024, 6:08 PM

48 points

19 comments2 min readLW link

Advice for journalists

Nathan YoungOct 7, 2024, 4:46 PM

101 points

53 comments9 min readLW link

(nathanpmyoung.substack.com)

Time Efficient Resistance Training

romeostevensitOct 7, 2024, 3:15 PM

42 points

12 comments3 min readLW link

A Narrow Path: a plan to deal with AI extinction risk

Andrea_Miotti, davekasten and Tolga

Oct 7, 2024, 1:02 PM

73 points

12 comments2 min readLW link

(www.narrowpath.co)

Toy Models of Feature Absorption in SAEs

chanind, hrdkbhatnagar, TomasD and Joseph Bloom

Oct 7, 2024, 9:56 AM

49 points

8 comments10 min readLW link

An argument that consequentialism is incomplete

cousin_itOct 7, 2024, 9:45 AM

35 points

27 comments1 min readLW link

An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation

hugofry, Ahmed Abdulaal, NMontanaBrown and a-ijishakin

Oct 7, 2024, 8:53 AM

40 points

1 comment5 min readLW link

(arxiv.org)

Compelling Villains and Coherent Values

Cole WyethOct 6, 2024, 7:53 PM

42 points

4 comments4 min readLW link

To Be Born in a Bag

Niko_McCartyOct 6, 2024, 5:21 PM

19 points

1 comment16 min readLW link

(www.asimov.press)

Whimsical Thoughts on an AI Notepad: Exploring Non-Invasive Neural Integration via Viral and Stem Cell Pathways

Pug stankyOct 6, 2024, 4:37 PM

1 point

2 comments4 min readLW link

Why I’m not a Bayesian

Richard_NgoOct 6, 2024, 3:22 PM

215 points

104 comments10 min readLW link

(www.mindthefuture.info)

European Progress Conference

Martin SustrikOct 6, 2024, 11:10 AM

27 points

11 comments3 min readLW link

(250bpm.substack.com)

Open Thread Fall 2024

habrykaOct 5, 2024, 10:28 PM

44 points

193 comments1 min readLW link

[Question] Seeking AI Alignment Tutor/Advisor: $100–150/hr

MrThinkOct 5, 2024, 9:28 PM

28 points

3 comments2 min readLW link

Interpretability of SAE Features Representing Check in ChessGPT

Jonathan KutasovOct 5, 2024, 8:43 PM

27 points

2 comments8 min readLW link

2024 Election Forecasting Contest

mike20731Oct 5, 2024, 8:43 PM

4 points

0 comments1 min readLW link

(www.mikesblog.net)

5 ways to improve CoT faithfulness

Caleb BiddulphOct 5, 2024, 8:17 PM

44 points

40 comments6 min readLW link

Consciousness As Recursive Reflections

Gunnar_ZarnckeOct 5, 2024, 8:00 PM

7 points

2 comments1 min readLW link

(www.astralcodexten.com)

What is it like to be psychologically healthy? Podcast ft. DaystarEld

Chipmonk and DaystarEld

Oct 5, 2024, 7:14 PM

31 points

8 comments2 min readLW link

(chrislakin.blog)

Musings on Text Data Wall (Oct 2024)

Vladimir_NesovOct 5, 2024, 7:00 PM

40 points

2 comments5 min readLW link

Apply to the Cooperative AI PhD Fellowship by October 14th!

Lewis HammondOct 5, 2024, 12:41 PM

23 points

0 comments LW link

AISafety.info: What is the “natural abstractions hypothesis”?

AlgonOct 5, 2024, 12:31 PM

38 points

2 comments3 min readLW link

(aisafety.info)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

25Hour and submarat

Oct 5, 2024, 11:30 AM

34 points

2 comments8 min readLW link

Exploring SAE features in LLMs with definition trees and token lists

mwatkinsOct 4, 2024, 10:15 PM

38 points

5 comments6 min readLW link

AXRP Episode 37 - Jaime Sevilla on Forecasting AI

DanielFilanOct 4, 2024, 9:00 PM

21 points

3 comments56 min readLW link

[Question] Seeking Solutions for Aggregating Classifier Outputs

Saeid GhafouriOct 4, 2024, 5:39 PM

−1 points

0 comments1 min readLW link

Amoeba roles in tech

Sindhu ShivaprasadOct 4, 2024, 5:25 PM

12 points

0 comments4 min readLW link

LASR Labs Spring 2025 applications are open!

Erin Robertson, charlie_griffin, joehardie and Justin Olive

Oct 4, 2024, 1:44 PM

38 points

0 comments4 min readLW link

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need

SodiumOct 3, 2024, 7:11 PM

35 points

17 comments17 min readLW link

Does natural selection favor AIs over humans?

cdkgOct 3, 2024, 6:47 PM

20 points

1 comment1 min readLW link

(link.springer.com)

What Hayek Taught Us About Nature

Ground Truth DataOct 3, 2024, 6:20 PM

−1 points

6 comments2 min readLW link

Biasing VLM Response with Visual Stimuli

Jaehyuk LimOct 3, 2024, 6:04 PM

5 points

0 comments8 min readLW link

AI #84: Better Than a Podcast

ZviOct 3, 2024, 3:00 PM

56 points

7 comments52 min readLW link

(thezvi.wordpress.com)

[Question] If I have some money, whom should I donate it to in order to reduce expected P(doom) the most?

KvmanThinkingOct 3, 2024, 11:31 AM

35 points

37 comments1 min readLW link

Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure

ThomasCederborgOct 3, 2024, 12:01 AM

12 points

7 comments12 min readLW link

“25 Lessons from 25 Years of Marriage” by honorary rationalist Ferrett Steinmetz

CronoDASOct 2, 2024, 10:42 PM

24 points

2 comments1 min readLW link

(theferrett.substack.com)

MIT FutureTech are hiring for a Head of Operations role

peterslatteryOct 2, 2024, 5:11 PM

8 points

0 comments4 min readLW link

Can AI Quantity beat AI Quality?

Gianluca CalcagniOct 2, 2024, 3:21 PM

2 points

0 comments5 min readLW link

[Intuitive self-models] 3. The Homunculus

Steven ByrnesOct 2, 2024, 3:20 PM

78 points

38 comments25 min readLW link

AI Safety University Organizing: Early Takeaways from Thirteen Groups

agucovaOct 2, 2024, 3:14 PM

26 points

0 comments LW link

Three main arguments that AI will save humans and one meta-argument

avturchinOct 2, 2024, 11:39 AM

8 points

8 comments2 min readLW link

Should we abstain from voting? (In nondeterministic elections)

B JacobsOct 2, 2024, 10:07 AM

5 points

6 comments4 min readLW link

(bobjacobs.substack.com)

AI Safety at the Frontier: Paper Highlights, September ’24

gasteigerjo2 Oct 2024 9:49 UTC

13 points

0 comments7 min readLW link

(aisafetyfrontier.substack.com)

Self-Help Corner: Loop Detection

adamShimi2 Oct 2024 8:33 UTC

88 points

6 comments2 min readLW link

(formethods.substack.com)