Com­pel­ling Villains and Co­her­ent Values

Cole WyethOct 6, 2024, 7:53 PM
42 points
4 comments4 min readLW link

To Be Born in a Bag

Niko_McCartyOct 6, 2024, 5:21 PM
19 points
1 comment16 min readLW link
(www.asimov.press)

Whim­si­cal Thoughts on an AI Notepad: Ex­plor­ing Non-In­va­sive Neu­ral In­te­gra­tion via Viral and Stem Cell Pathways

Pug stankyOct 6, 2024, 4:37 PM
1 point
2 comments4 min readLW link

Why I’m not a Bayesian

Richard_NgoOct 6, 2024, 3:22 PM
212 points
104 comments10 min readLW link
(www.mindthefuture.info)

Euro­pean Progress Conference

Martin SustrikOct 6, 2024, 11:10 AM
27 points
11 comments3 min readLW link
(250bpm.substack.com)

Open Thread Fall 2024

habrykaOct 5, 2024, 10:28 PM
44 points
193 comments1 min readLW link

[Question] Seek­ing AI Align­ment Tu­tor/​Ad­vi­sor: $100–150/​hr

MrThinkOct 5, 2024, 9:28 PM
26 points
3 comments2 min readLW link

In­ter­pretabil­ity of SAE Fea­tures Rep­re­sent­ing Check in ChessGPT

Jonathan KutasovOct 5, 2024, 8:43 PM
27 points
2 comments8 min readLW link

2024 Elec­tion Fore­cast­ing Contest

mike20731Oct 5, 2024, 8:43 PM
4 points
0 comments1 min readLW link
(www.mikesblog.net)

5 ways to im­prove CoT faithfulness

Caleb BiddulphOct 5, 2024, 8:17 PM
44 points
40 comments6 min readLW link

Con­scious­ness As Re­cur­sive Reflections

Gunnar_ZarnckeOct 5, 2024, 8:00 PM
7 points
2 comments1 min readLW link
(www.astralcodexten.com)

What is it like to be psy­cholog­i­cally healthy? Pod­cast ft. DaystarEld

Oct 5, 2024, 7:14 PM
31 points
8 comments2 min readLW link
(chrislakin.blog)

Mus­ings on Text Data Wall (Oct 2024)

Vladimir_NesovOct 5, 2024, 7:00 PM
40 points
2 comments5 min readLW link

Ap­ply to the Co­op­er­a­tive AI PhD Fel­low­ship by Oc­to­ber 14th!

Lewis HammondOct 5, 2024, 12:41 PM
23 points
0 commentsLW link

AISafety.info: What is the “nat­u­ral ab­strac­tions hy­poth­e­sis”?

AlgonOct 5, 2024, 12:31 PM
38 points
2 comments3 min readLW link
(aisafety.info)

ARENA4.0 Cap­stone: Hyper­pa­ram­e­ter tun­ing for MELBO + repli­ca­tion on Llama-3.2-1b-Instruct

Oct 5, 2024, 11:30 AM
34 points
2 comments8 min readLW link

Ex­plor­ing SAE fea­tures in LLMs with defi­ni­tion trees and to­ken lists

mwatkinsOct 4, 2024, 10:15 PM
38 points
5 comments6 min readLW link

AXRP Epi­sode 37 - Jaime Sevilla on Fore­cast­ing AI

DanielFilanOct 4, 2024, 9:00 PM
21 points
3 comments56 min readLW link

[Question] Seek­ing Solu­tions for Ag­gre­gat­ing Clas­sifier Outputs

Saeid GhafouriOct 4, 2024, 5:39 PM
−1 points
0 comments1 min readLW link

Amoeba roles in tech

Sindhu ShivaprasadOct 4, 2024, 5:25 PM
12 points
0 comments4 min readLW link

LASR Labs Spring 2025 ap­pli­ca­tions are open!

Oct 4, 2024, 1:44 PM
38 points
0 comments4 min readLW link

(Maybe) A Bag of Heuris­tics is All There Is & A Bag of Heuris­tics is All You Need

SodiumOct 3, 2024, 7:11 PM
35 points
17 comments17 min readLW link

Does nat­u­ral se­lec­tion fa­vor AIs over hu­mans?

cdkgOct 3, 2024, 6:47 PM
20 points
1 comment1 min readLW link
(link.springer.com)

What Hayek Taught Us About Nature

Ground Truth DataOct 3, 2024, 6:20 PM
−1 points
6 comments2 min readLW link

Bi­as­ing VLM Re­sponse with Vi­sual Stimuli

Jaehyuk LimOct 3, 2024, 6:04 PM
5 points
0 comments8 min readLW link

AI #84: Bet­ter Than a Podcast

ZviOct 3, 2024, 3:00 PM
56 points
7 comments52 min readLW link
(thezvi.wordpress.com)

[Question] If I have some money, whom should I donate it to in or­der to re­duce ex­pected P(doom) the most?

KvmanThinkingOct 3, 2024, 11:31 AM
35 points
37 comments1 min readLW link

Shut­ting down all com­pet­ing AI pro­jects might not buy a lot of time due to In­ter­nal Time Pressure

ThomasCederborgOct 3, 2024, 12:01 AM
12 points
7 comments12 min readLW link

“25 Les­sons from 25 Years of Mar­riage” by hon­orary ra­tio­nal­ist Fer­rett Stein­metz

CronoDASOct 2, 2024, 10:42 PM
24 points
2 comments1 min readLW link
(theferrett.substack.com)

MIT Fu­tureTech are hiring for a Head of Oper­a­tions role

peterslatteryOct 2, 2024, 5:11 PM
8 points
0 comments4 min readLW link

Can AI Quan­tity beat AI Qual­ity?

Gianluca CalcagniOct 2, 2024, 3:21 PM
2 points
0 comments5 min readLW link

[In­tu­itive self-mod­els] 3. The Homunculus

Steven ByrnesOct 2, 2024, 3:20 PM
78 points
38 comments25 min readLW link

AI Safety Univer­sity Or­ga­niz­ing: Early Take­aways from Thir­teen Groups

agucovaOct 2, 2024, 3:14 PM
26 points
0 commentsLW link

Three main ar­gu­ments that AI will save hu­mans and one meta-argument

avturchinOct 2, 2024, 11:39 AM
8 points
8 comments2 min readLW link

Should we ab­stain from vot­ing? (In non­de­ter­minis­tic elec­tions)

B JacobsOct 2, 2024, 10:07 AM
5 points
6 comments4 min readLW link
(bobjacobs.substack.com)

AI Safety at the Fron­tier: Paper High­lights, Septem­ber ’24

gasteigerjoOct 2, 2024, 9:49 AM
13 points
0 comments7 min readLW link
(aisafetyfrontier.substack.com)

Self-Help Corner: Loop Detection

adamShimiOct 2, 2024, 8:33 AM
88 points
6 comments2 min readLW link
(formethods.substack.com)

The mur­der­ous short­cut: a toy model of in­stru­men­tal convergence

Thomas KwaOct 2, 2024, 6:48 AM
37 points
0 comments2 min readLW link

Switch­ing to a Yamaha P-121 Keyboard

jefftkOct 2, 2024, 2:20 AM
11 points
0 comments2 min readLW link
(www.jefftk.com)

Fore­sight Vi­sion Week­end 2024

Allison DuettmannOct 1, 2024, 9:59 PM
8 points
0 comments1 min readLW link

Happy simulations

FateGrinderOct 1, 2024, 9:05 PM
−5 points
0 comments2 min readLW link

Three Sub­tle Ex­am­ples of Data Leakage

abstractapplicOct 1, 2024, 8:45 PM
172 points
16 comments4 min readLW link

AI Safety Newslet­ter #42: New­som Ve­toes SB 1047 Plus, OpenAI’s o1, and AI Gover­nance Summary

Oct 1, 2024, 8:35 PM
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

Retrieval Aug­mented Genesis

João Ribeiro MedeirosOct 1, 2024, 8:18 PM
6 points
0 comments29 min readLW link

Like­li­hood calcu­la­tion with duobels

Martin GerdesOct 1, 2024, 4:21 PM
4 points
0 comments6 min readLW link

Is Text Water­mark­ing a lost cause?

egor.timatkovOct 1, 2024, 4:20 PM
17 points
13 comments10 min readLW link

In­for­ma­tion dark matter

Logan KiellerOct 1, 2024, 3:05 PM
33 points
4 comments28 min readLW link
(logankieller.substack.com)

Con­ven­tional foot­notes con­sid­ered harmful

dkl9Oct 1, 2024, 2:54 PM
25 points
16 comments1 min readLW link
(dkl9.net)

New­som Ve­toes SB 1047

ZviOct 1, 2024, 12:20 PM
84 points
6 comments32 min readLW link
(thezvi.wordpress.com)

Will AI and Hu­man­ity Go to War?

Simon GoldsteinOct 1, 2024, 6:35 AM
9 points
4 comments6 min readLW link