A Bit For You

Ronak_MehtaMar 24, 2024, 10:18 PM
0 points
0 comments2 min readLW link
(ronakrm.github.io)

All About Con­cave and Con­vex Agents

mako yassMar 24, 2024, 9:37 PM
64 points
24 comments8 min readLW link

Do not delete your mis­al­igned AGI.

mako yassMar 24, 2024, 9:37 PM
62 points
13 comments3 min readLW link

[Question] Define “Agent” (Embed­ded)

ApolloniaMar 24, 2024, 8:14 PM
10 points
1 comment1 min readLW link

[Question] Could LLMs Help Gen­er­ate New Con­cepts in Hu­man Lan­guage?

Pekka LampeltoMar 24, 2024, 8:13 PM
10 points
4 comments2 min readLW link

Wittgen­stein and the Pri­vate Lan­guage Argument

TMFOWMar 24, 2024, 8:06 PM
4 points
0 comments14 min readLW link
(tmfow.substack.com)

Self-Play By Analogy

Amica TerraMar 24, 2024, 8:06 PM
−2 points
2 comments7 min readLW link

Can quan­tised au­toen­coders find and in­ter­pret cir­cuits in lan­guage mod­els?

charlieoneillMar 24, 2024, 8:05 PM
30 points
4 comments24 min readLW link

Man­dolin Harp Sen­sor Placement

jefftkMar 24, 2024, 6:40 PM
11 points
0 comments1 min readLW link
(www.jefftk.com)

AI Align­ment and the Clas­si­cal Hu­man­ist Tradition

PeteJMar 24, 2024, 1:37 PM
−1 points
4 comments2 min readLW link

UNGA Re­s­olu­tion on AI: 5 Key Take­aways Look­ing to Fu­ture Policy

HerambMar 24, 2024, 12:23 PM
3 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

[Question] Are (Mo­tor)sports like F1 a good thing to cal­ibrate es­ti­mates against?

CstineSublimeMar 24, 2024, 9:07 AM
4 points
2 comments1 min readLW link

Nu­clear Quan­tum Im­mor­tal­ity Hack­ing

NezekMar 23, 2024, 10:08 PM
−3 points
2 comments2 min readLW link

As Many Ideas

ScrewtapeMar 23, 2024, 6:55 PM
7 points
0 comments1 min readLW link

My De­tailed Notes & Com­men­tary from Sec­u­lar Solstice

Jeffrey HeningerMar 23, 2024, 6:48 PM
35 points
16 comments13 min readLW link

Gen­eral Thoughts on Sec­u­lar Solstice

Jeffrey HeningerMar 23, 2024, 6:48 PM
101 points
60 comments8 min readLW link

How to make food/​wa­ter test­ing cheaper/​more scal­able? [eg for pu­rity/​toxin test­ing]

Alex K. Chen (parrot)Mar 23, 2024, 5:28 AM
9 points
2 comments1 min readLW link

Pro­to­typ­ing Pluck Sensors

jefftkMar 23, 2024, 1:30 AM
9 points
0 comments2 min readLW link
(www.jefftk.com)

Dangers of Closed-Loop AI

Gordon Seidoh WorleyMar 22, 2024, 11:52 PM
35 points
9 comments2 min readLW link

Why The In­sects Scream

Bentham's BulldogMar 22, 2024, 7:47 PM
4 points
11 comments9 min readLW link

What does “au­to­di­dact” mean?

bhauthMar 22, 2024, 6:37 PM
22 points
19 comments1 min readLW link

[Linkpost] Vague Ver­biage in Forecasting

trevorMar 22, 2024, 6:05 PM
11 points
9 comments3 min readLW link
(goodjudgment.com)

Wolf and Rabbit

Richard HenageMar 22, 2024, 5:20 PM
14 points
4 comments1 min readLW link

AI Model Registries: A Reg­u­la­tory Review

Mar 22, 2024, 4:04 PM
9 points
0 comments6 min readLW link

Video and tran­script of pre­sen­ta­tion on Schem­ing AIs

Joe CarlsmithMar 22, 2024, 3:52 PM
32 points
1 comment32 min readLW link

Bench­mark­ing LLM Agents on Kag­gle Competitions

aogMar 22, 2024, 1:09 PM
15 points
4 comments5 min readLW link

Amer­i­can Ac­cel­er­a­tion vs Development

Maxwell TabarrokMar 22, 2024, 1:01 PM
1 point
0 comments4 min readLW link
(www.maximum-progress.com)

Trans­for­ma­tive AI and Sce­nario Plan­ning for AI X-risk

Mar 22, 2024, 9:38 AM
15 points
0 comments8 min readLW link

The Pyromaniacs

Ted SandersMar 22, 2024, 6:55 AM
4 points
1 comment2 min readLW link

Ver­nor Vinge, who coined the term “Tech­nolog­i­cal Sin­gu­lar­ity”, dies at 79

Kaj_SotalaMar 21, 2024, 10:14 PM
150 points
25 comments1 min readLW link
(arstechnica.com)

ChatGPT can learn in­di­rect control

Raymond DouglasMar 21, 2024, 9:11 PM
213 points
27 comments1 min readLW link

“Deep Learn­ing” Is Func­tion Approximation

Zack_M_DavisMar 21, 2024, 5:50 PM
98 points
28 comments10 min readLW link
(zackmdavis.net)

A Teacher vs. Every­one Else

ronak69Mar 21, 2024, 5:45 PM
41 points
8 comments2 min readLW link

Static vs Dy­namic Alignment

Gracie GreenMar 21, 2024, 5:44 PM
5 points
0 comments12 min readLW link

On green

Joe CarlsmithMar 21, 2024, 5:38 PM
269 points
35 comments31 min readLW link

Com­par­ing Align­ment to other AGI in­ter­ven­tions: Ex­ten­sions and analysis

Martín SotoMar 21, 2024, 5:30 PM
7 points
0 comments4 min readLW link

The Com­cast Problem

RamblinDashMar 21, 2024, 4:46 PM
1 point
15 comments1 min readLW link

Vi­pas­sana Med­i­ta­tion and Ac­tive In­fer­ence: A Frame­work for Un­der­stand­ing Suffer­ing and its Cessation

sturbMar 21, 2024, 12:32 PM
50 points
8 comments19 min readLW link

AI #56: Black­well That Ends Well

ZviMar 21, 2024, 12:10 PM
34 points
16 comments68 min readLW link
(thezvi.wordpress.com)

An Afford­able CO2 Monitor

Pretentious PenguinMar 21, 2024, 3:06 AM
28 points
1 comment1 min readLW link

Deep­Mind: Eval­u­at­ing Fron­tier Models for Danger­ous Capabilities

Zach Stein-PerlmanMar 21, 2024, 3:00 AM
61 points
8 comments1 min readLW link
(arxiv.org)

Where are the Con­tra Dances?

jefftkMar 21, 2024, 2:00 AM
9 points
0 comments1 min readLW link
(www.jefftk.com)

Slim overview of work one could do to make AI go bet­ter (and a grab-bag of other ca­reer con­sid­er­a­tions)

Chi NguyenMar 20, 2024, 11:17 PM
9 points
1 commentLW link

How does AI solve prob­lems?

Dom PolsinelliMar 20, 2024, 10:29 PM
2 points
0 comments7 min readLW link

What I Learned (Con­clu­sion To “The Sense Of Phys­i­cal Ne­ces­sity”)

LoganStrohlMar 20, 2024, 9:24 PM
34 points
0 comments3 min readLW link

Stage­wise Devel­op­ment in Neu­ral Networks

Mar 20, 2024, 7:54 PM
96 points
1 comment11 min readLW link

On the Glad­stone Report

ZviMar 20, 2024, 7:50 PM
64 points
11 comments40 min readLW link
(thezvi.wordpress.com)

Nat­u­ral La­tents: The Concepts

Mar 20, 2024, 6:21 PM
90 points
19 comments19 min readLW link

Com­par­ing Align­ment to other AGI in­ter­ven­tions: Ba­sic model

Martín SotoMar 20, 2024, 6:17 PM
12 points
4 comments7 min readLW link

New re­port: Safety Cases for AI

joshcMar 20, 2024, 4:45 PM
89 points
14 comments1 min readLW link
(twitter.com)