AGI & Con­scious­ness—Joscha Bach

Rahul ChandOct 8, 2024, 10:51 PM
1 point
1 comment10 min readLW link

Video and tran­script of pre­sen­ta­tion on Oth­er­ness and con­trol in the age of AGI

Joe CarlsmithOct 8, 2024, 10:30 PM
35 points
1 comment27 min readLW link

From seeded com­plex­ity to con­scious­ness—yes, it’s all the same.

eschatailOct 8, 2024, 9:31 PM
−23 points
0 comments2 min readLW link

Limits of safe and al­igned AI

ShivamOct 8, 2024, 9:30 PM
2 points
0 comments4 min readLW link

[Question] What con­sti­tutes an in­fo­haz­ard?

K1r4d4rk.v1Oct 8, 2024, 9:29 PM
−4 points
8 comments1 min readLW link

[Question] What makes one a “ra­tio­nal­ist”?

mathyoufOct 8, 2024, 8:25 PM
7 points
5 comments3 min readLW link

[In­tu­itive self-mod­els] 4. Trance

Steven ByrnesOct 8, 2024, 1:30 PM
82 points
7 comments24 min readLW link

Schel­ling game eval­u­a­tions for AI control

Olli JärviniemiOct 8, 2024, 12:01 PM
71 points
5 comments11 min readLW link

Think­ing About a Pedalboard

jefftkOct 8, 2024, 11:50 AM
9 points
2 comments1 min readLW link
(www.jefftk.com)

Overview of strong hu­man in­tel­li­gence am­plifi­ca­tion methods

TsviBTOct 8, 2024, 8:37 AM
280 points
144 comments10 min readLW link

Near-death experiences

Declan MolonyOct 8, 2024, 6:34 AM
3 points
1 comment2 min readLW link

The un­rea­son­able effec­tive­ness of plas­mid se­quenc­ing as a service

Abhishaike MahajanOct 8, 2024, 2:02 AM
23 points
2 comments13 min readLW link
(www.owlposting.com)

There is a globe in your LLM

jacob_droriOct 8, 2024, 12:43 AM
89 points
4 comments1 min readLW link

MATS AI Safety Strat­egy Cur­ricu­lum v2

Oct 7, 2024, 10:44 PM
43 points
6 comments13 min readLW link

2025 Color Trends

sarahconstantinOct 7, 2024, 9:20 PM
40 points
7 comments6 min readLW link
(sarahconstantin.substack.com)

Clar­ify­ing Align­ment Fun­da­men­tals Through the Lens of Ontology

Ben IhrigOct 7, 2024, 8:57 PM
12 points
4 comments24 min readLW link

Ethics on Cos­mic Scale, Outer Space Treaty, Directed Pansper­mia, For­wards-Con­tam­i­na­tion, Tech­nol­ogy Assess­ment, Plane­tary Pro­tec­tion, and Fermi’s Paradox

MrFantasticOct 7, 2024, 8:56 PM
−12 points
0 comments1 min readLW link

Do­main-spe­cific SAEs

jacob_droriOct 7, 2024, 8:15 PM
28 points
2 comments5 min readLW link

Me­tac­u­lus Is Open Source

ChristianWilliamsOct 7, 2024, 7:55 PM
13 points
0 commentsLW link
(www.metaculus.com)

Re­search up­date: Towards a Law of Iter­ated Ex­pec­ta­tions for Heuris­tic Estimators

Eric NeymanOct 7, 2024, 7:29 PM
87 points
2 comments22 min readLW link

AI Model Registries: A Foun­da­tional Tool for AI Governance

Oct 7, 2024, 7:27 PM
20 points
1 comment4 min readLW link
(www.convergenceanalysis.org)

Eval­u­at­ing the truth of state­ments in a world of am­bigu­ous lan­guage.

HastingsOct 7, 2024, 6:08 PM
48 points
19 comments2 min readLW link

Ad­vice for journalists

Nathan YoungOct 7, 2024, 4:46 PM
101 points
53 comments9 min readLW link
(nathanpmyoung.substack.com)

Time Effi­cient Re­sis­tance Training

romeostevensitOct 7, 2024, 3:15 PM
42 points
12 comments3 min readLW link

A Nar­row Path: a plan to deal with AI ex­tinc­tion risk

Oct 7, 2024, 1:02 PM
73 points
12 comments2 min readLW link
(www.narrowpath.co)

Toy Models of Fea­ture Ab­sorp­tion in SAEs

Oct 7, 2024, 9:56 AM
49 points
8 comments10 min readLW link

An ar­gu­ment that con­se­quen­tial­ism is incomplete

cousin_itOct 7, 2024, 9:45 AM
35 points
27 comments1 min readLW link

An X-Ray is Worth 15 Fea­tures: Sparse Au­toen­coders for In­ter­pretable Ra­diol­ogy Re­port Generation

Oct 7, 2024, 8:53 AM
40 points
1 comment5 min readLW link
(arxiv.org)

Com­pel­ling Villains and Co­her­ent Values

Cole WyethOct 6, 2024, 7:53 PM
42 points
4 comments4 min readLW link

To Be Born in a Bag

Niko_McCartyOct 6, 2024, 5:21 PM
19 points
1 comment16 min readLW link
(www.asimov.press)

Whim­si­cal Thoughts on an AI Notepad: Ex­plor­ing Non-In­va­sive Neu­ral In­te­gra­tion via Viral and Stem Cell Pathways

Pug stankyOct 6, 2024, 4:37 PM
1 point
2 comments4 min readLW link

Why I’m not a Bayesian

Richard_NgoOct 6, 2024, 3:22 PM
212 points
104 comments10 min readLW link
(www.mindthefuture.info)

Euro­pean Progress Conference

Martin SustrikOct 6, 2024, 11:10 AM
27 points
11 comments3 min readLW link
(250bpm.substack.com)

Open Thread Fall 2024

habrykaOct 5, 2024, 10:28 PM
44 points
193 comments1 min readLW link

[Question] Seek­ing AI Align­ment Tu­tor/​Ad­vi­sor: $100–150/​hr

MrThinkOct 5, 2024, 9:28 PM
26 points
3 comments2 min readLW link

In­ter­pretabil­ity of SAE Fea­tures Rep­re­sent­ing Check in ChessGPT

Jonathan KutasovOct 5, 2024, 8:43 PM
27 points
2 comments8 min readLW link

2024 Elec­tion Fore­cast­ing Contest

mike20731Oct 5, 2024, 8:43 PM
4 points
0 comments1 min readLW link
(www.mikesblog.net)

5 ways to im­prove CoT faithfulness

Caleb BiddulphOct 5, 2024, 8:17 PM
44 points
40 comments6 min readLW link

Con­scious­ness As Re­cur­sive Reflections

Gunnar_ZarnckeOct 5, 2024, 8:00 PM
7 points
2 comments1 min readLW link
(www.astralcodexten.com)

What is it like to be psy­cholog­i­cally healthy? Pod­cast ft. DaystarEld

Oct 5, 2024, 7:14 PM
31 points
8 comments2 min readLW link
(chrislakin.blog)

Mus­ings on Text Data Wall (Oct 2024)

Vladimir_NesovOct 5, 2024, 7:00 PM
40 points
2 comments5 min readLW link

Ap­ply to the Co­op­er­a­tive AI PhD Fel­low­ship by Oc­to­ber 14th!

Lewis HammondOct 5, 2024, 12:41 PM
23 points
0 commentsLW link

AISafety.info: What is the “nat­u­ral ab­strac­tions hy­poth­e­sis”?

AlgonOct 5, 2024, 12:31 PM
38 points
2 comments3 min readLW link
(aisafety.info)

ARENA4.0 Cap­stone: Hyper­pa­ram­e­ter tun­ing for MELBO + repli­ca­tion on Llama-3.2-1b-Instruct

Oct 5, 2024, 11:30 AM
34 points
2 comments8 min readLW link

Ex­plor­ing SAE fea­tures in LLMs with defi­ni­tion trees and to­ken lists

mwatkinsOct 4, 2024, 10:15 PM
38 points
5 comments6 min readLW link

AXRP Epi­sode 37 - Jaime Sevilla on Fore­cast­ing AI

DanielFilanOct 4, 2024, 9:00 PM
21 points
3 comments56 min readLW link

[Question] Seek­ing Solu­tions for Ag­gre­gat­ing Clas­sifier Outputs

Saeid GhafouriOct 4, 2024, 5:39 PM
−1 points
0 comments1 min readLW link

Amoeba roles in tech

Sindhu ShivaprasadOct 4, 2024, 5:25 PM
12 points
0 comments4 min readLW link

LASR Labs Spring 2025 ap­pli­ca­tions are open!

Oct 4, 2024, 1:44 PM
38 points
0 comments4 min readLW link

(Maybe) A Bag of Heuris­tics is All There Is & A Bag of Heuris­tics is All You Need

SodiumOct 3, 2024, 7:11 PM
35 points
17 comments17 min readLW link