7 Oct 2024 22:44 UTC

43 points

6 comments13 min readLW link

2025 Color Trends

sarahconstantin7 Oct 2024 21:20 UTC

40 points

7 comments6 min readLW link

(sarahconstantin.substack.com)

Clarifying Alignment Fundamentals Through the Lens of Ontology

Ben Ihrig7 Oct 2024 20:57 UTC

12 points

4 comments24 min readLW link

Ethics on Cosmic Scale, Outer Space Treaty, Directed Panspermia, Forwards-Contamination, Technology Assessment, Planetary Protection, and Fermi’s Paradox

MrFantastic7 Oct 2024 20:56 UTC

−12 points

0 comments1 min readLW link

Domain-specific SAEs

jacob_drori7 Oct 2024 20:15 UTC

28 points

2 comments5 min readLW link

Metaculus Is Open Source

ChristianWilliams7 Oct 2024 19:55 UTC

13 points

0 comments1 min readLW link

(www.metaculus.com)

Research update: Towards a Law of Iterated Expectations for Heuristic Estimators

Eric Neyman7 Oct 2024 19:29 UTC

87 points

2 comments22 min readLW link

AI Model Registries: A Foundational Tool for AI Governance

Elliot Mckernon, Deric Cheng and Gwyn Glasser

7 Oct 2024 19:27 UTC

20 points

1 comment4 min readLW link

(www.convergenceanalysis.org)

Evaluating the truth of statements in a world of ambiguous language.

Hastings7 Oct 2024 18:08 UTC

48 points

19 comments2 min readLW link

Advice for journalists

Nathan Young7 Oct 2024 16:46 UTC

101 points

53 comments9 min readLW link

(nathanpmyoung.substack.com)

Time Efficient Resistance Training

romeostevensit7 Oct 2024 15:15 UTC

42 points

12 comments3 min readLW link

A Narrow Path: a plan to deal with AI extinction risk

Andrea_Miotti, davekasten and Tolga

7 Oct 2024 13:02 UTC

80 points

12 comments2 min readLW link

(www.narrowpath.co)

Toy Models of Feature Absorption in SAEs

chanind, hrdkbhatnagar, TomasD and Joseph Bloom

7 Oct 2024 9:56 UTC

49 points

8 comments10 min readLW link

An argument that consequentialism is incomplete

cousin_it7 Oct 2024 9:45 UTC

35 points

27 comments1 min readLW link

An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation

hugofry, Ahmed Abdulaal, NMontanaBrown and a-ijishakin

7 Oct 2024 8:53 UTC

40 points

1 comment5 min readLW link

(arxiv.org)

Compelling Villains and Coherent Values

Cole Wyeth6 Oct 2024 19:53 UTC

42 points

4 comments4 min readLW link

To Be Born in a Bag

Niko_McCarty6 Oct 2024 17:21 UTC

19 points

1 comment16 min readLW link

(www.asimov.press)

Whimsical Thoughts on an AI Notepad: Exploring Non-Invasive Neural Integration via Viral and Stem Cell Pathways

Pug stanky6 Oct 2024 16:37 UTC

1 point

2 comments4 min readLW link

Why I’m not a Bayesian

Richard_Ngo6 Oct 2024 15:22 UTC

221 points

104 comments10 min readLW link

(www.mindthefuture.info)

European Progress Conference

Martin Sustrik6 Oct 2024 11:10 UTC

27 points

11 comments3 min readLW link

(250bpm.substack.com)

Open Thread Fall 2024

habryka5 Oct 2024 22:28 UTC

44 points

194 comments1 min readLW link

[Question] Seeking AI Alignment Tutor/Advisor: $100–150/hr

MrThink5 Oct 2024 21:28 UTC

28 points

3 comments2 min readLW link

Interpretability of SAE Features Representing Check in ChessGPT

Jonathan Kutasov5 Oct 2024 20:43 UTC

27 points

2 comments8 min readLW link

2024 Election Forecasting Contest

mike207315 Oct 2024 20:43 UTC

4 points

0 comments1 min readLW link

(www.mikesblog.net)

5 ways to improve CoT faithfulness

Caleb Biddulph5 Oct 2024 20:17 UTC

46 points

40 comments6 min readLW link

Consciousness As Recursive Reflections

Gunnar_Zarncke5 Oct 2024 20:00 UTC

7 points

2 comments1 min readLW link

(www.astralcodexten.com)

Musings on Text Data Wall (Oct 2024)

Vladimir_Nesov5 Oct 2024 19:00 UTC

41 points

2 comments5 min readLW link

Apply to the Cooperative AI PhD Fellowship by October 14th!

Lewis Hammond5 Oct 2024 12:41 UTC

23 points

0 comments1 min readLW link

AISafety.info: What is the “natural abstractions hypothesis”?

Algon5 Oct 2024 12:31 UTC

38 points

2 comments3 min readLW link

(aisafety.info)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

25Hour and submarat

5 Oct 2024 11:30 UTC

34 points

2 comments8 min readLW link

Exploring SAE features in LLMs with definition trees and token lists

mwatkins4 Oct 2024 22:15 UTC

46 points

5 comments6 min readLW link

AXRP Episode 37 - Jaime Sevilla on Forecasting AI

DanielFilan4 Oct 2024 21:00 UTC

21 points

3 comments56 min readLW link

[Question] Seeking Solutions for Aggregating Classifier Outputs

Saeid Ghafouri4 Oct 2024 17:39 UTC

−1 points

0 comments1 min readLW link

Amoeba roles in tech

Sindhu Shivaprasad4 Oct 2024 17:25 UTC

12 points

0 comments4 min readLW link

LASR Labs Spring 2025 applications are open!

Erin Robertson, charlie_griffin, joehardie and Justin Olive

4 Oct 2024 13:44 UTC

38 points

0 comments4 min readLW link

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need

Sodium3 Oct 2024 19:11 UTC

35 points

17 comments17 min readLW link

Does natural selection favor AIs over humans?

cdkg3 Oct 2024 18:47 UTC

20 points

1 comment1 min readLW link

(link.springer.com)

What Hayek Taught Us About Nature

Ground Truth Data3 Oct 2024 18:20 UTC

−1 points

6 comments2 min readLW link

Biasing VLM Response with Visual Stimuli

Jaehyuk Lim3 Oct 2024 18:04 UTC

5 points

0 comments8 min readLW link

AI #84: Better Than a Podcast

Zvi3 Oct 2024 15:00 UTC

56 points

7 comments52 min readLW link

(thezvi.wordpress.com)

[Question] If I have some money, whom should I donate it to in order to reduce expected P(doom) the most?

KvmanThinking3 Oct 2024 11:31 UTC

35 points

37 comments1 min readLW link

Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure

ThomasCederborg3 Oct 2024 0:01 UTC

12 points

7 comments12 min readLW link

“25 Lessons from 25 Years of Marriage” by honorary rationalist Ferrett Steinmetz

CronoDAS2 Oct 2024 22:42 UTC

24 points

2 comments1 min readLW link

(theferrett.substack.com)

MIT FutureTech are hiring for a Head of Operations role

peterslattery2 Oct 2024 17:11 UTC

8 points

0 comments4 min readLW link

Can AI Quantity beat AI Quality?

Gianluca Calcagni2 Oct 2024 15:21 UTC

2 points

0 comments5 min readLW link

[Intuitive self-models] 3. The Homunculus

Steven Byrnes2 Oct 2024 15:20 UTC

78 points

39 comments25 min readLW link

AI Safety University Organizing: Early Takeaways from Thirteen Groups

agucova2 Oct 2024 15:14 UTC

26 points

0 comments9 min readLW link

Three main arguments that AI will save humans and one meta-argument

avturchin2 Oct 2024 11:39 UTC

9 points

8 comments2 min readLW link

Should we abstain from voting? (In nondeterministic elections)

B Jacobs2 Oct 2024 10:07 UTC

5 points

8 comments4 min readLW link

(bobjacobs.substack.com)

AI Safety at the Frontier: Paper Highlights, September ’24

gasteigerjo2 Oct 2024 9:49 UTC

13 points

0 comments7 min readLW link

(aisafetyfrontier.substack.com)