The rise of AI in cybercrime

BobyResearcherJul 30, 2023, 8:19 PM
−15 points
1 comment2 min readLW link
(riseofAIincybercryme)

SSA vs. SIA: how fu­ture pop­u­la­tion may provide ev­i­dence for or against the foun­da­tions of poli­ti­cal liberalism

jJul 30, 2023, 8:18 PM
−6 points
10 comments55 min readLW link

Ra­tion­al­iza­tion Max­i­mizes Ex­pected Value

Kevin DorstJul 30, 2023, 8:11 PM
19 points
10 comments7 min readLW link
(kevindorst.substack.com)

Apollo Neuro Results

ElizabethJul 30, 2023, 6:40 PM
85 points
17 comments3 min readLW link
(acesounderglass.com)

Hilbert’s Triumph, Church and Tur­ing’s failure, and what it means (Post #2)

Noosphere89Jul 30, 2023, 2:33 PM
−5 points
16 comments15 min readLW link

[Question] Spe­cific Ar­gu­ments against open source LLMs?

IknownothingJul 30, 2023, 2:27 PM
4 points
2 comments1 min readLW link

So­cial­ism in large organizations

Adam ZernerJul 30, 2023, 7:25 AM
7 points
16 comments2 min readLW link

How to make real-money pre­dic­tion mar­kets on ar­bi­trary top­ics (Out­dated)

yutakaJul 30, 2023, 2:11 AM
57 points
13 comments3 min readLW link

[Question] Does de­cid­abil­ity of a the­ory im­ply com­plete­ness of the the­ory?

Noosphere89Jul 29, 2023, 11:53 PM
6 points
12 comments1 min readLW link

[Question] If I showed the EQ-SQ the­ory’s find­ings to be due to mea­sure­ment bias, would any­one change their minds about it?

tailcalledJul 29, 2023, 7:38 PM
23 points
13 comments1 min readLW link

Self-driv­ing car bets

paulfchristianoJul 29, 2023, 6:10 PM
236 points
44 comments5 min readLW link
(sideways-view.com)

The Parable of the Dag­ger—The Animation

WriterJul 29, 2023, 2:03 PM
20 points
6 comments1 min readLW link
(youtu.be)

Are Guitars Ob­so­lete?

jefftkJul 29, 2023, 1:20 PM
11 points
8 comments2 min readLW link
(www.jefftk.com)

NAMSI: A promis­ing ap­proach to alignment

Georgeo57Jul 29, 2023, 7:03 AM
−6 points
6 comments1 min readLW link

Un­der­stand­ing and Align­ing a Hu­man-like In­duc­tive Bias with Cog­ni­tive Science: a Re­view of Re­lated Liter­a­ture

Claire ShortJul 29, 2023, 6:10 AM
27 points
0 comments12 min readLW link

Why You Should Never Up­date Your Beliefs

Arjun PanicksseryJul 29, 2023, 12:27 AM
76 points
18 comments4 min readLW link1 review
(arjunpanickssery.substack.com)

Thoughts about the Mechanis­tic In­ter­pretabil­ity Challenge #2 (EIS VII #2)

RGRGRGJul 28, 2023, 8:44 PM
24 points
5 comments20 min readLW link

Be­cause of Lay­erNorm, Direc­tions in GPT-2 MLP Lay­ers are Monosemantic

ojorgensenJul 28, 2023, 7:43 PM
13 points
3 comments13 min readLW link

When can we trust model eval­u­a­tions?

evhubJul 28, 2023, 7:42 PM
166 points
10 comments10 min readLW link1 review

Yes, It’s Sub­jec­tive, But Why All The Crabs?

johnswentworthJul 28, 2023, 7:35 PM
250 points
15 comments6 min readLW link

Semaglu­tide and Muscle

5houtJul 28, 2023, 6:36 PM
15 points
14 comments5 min readLW link

Dou­ble Crux in a Box

ScrewtapeJul 28, 2023, 5:55 PM
8 points
3 comments1 min readLW link

Gra­di­ent de­scent might see the di­rec­tion of the op­ti­mum from far away

Mikhail SaminJul 28, 2023, 4:19 PM
70 points
13 comments4 min readLW link

Progress links di­gest, 2023-07-28: The deca­dent op­u­lence of mod­ern capitalism

jasoncrawfordJul 28, 2023, 2:36 PM
16 points
3 comments3 min readLW link
(rootsofprogress.org)

AI Aware­ness through In­ter­ac­tion with Blatantly Alien Models

VojtaKovarikJul 28, 2023, 8:41 AM
7 points
5 comments3 min readLW link

You don’t get to have cool flaws

Neil Jul 28, 2023, 5:37 AM
78 points
25 comments2 min readLW link3 reviews

Re­duc­ing syco­phancy and im­prov­ing hon­esty via ac­ti­va­tion steering

Nina PanicksseryJul 28, 2023, 2:46 AM
122 points
18 comments9 min readLW link1 review

Mech In­terp Puz­zle 2: Word2Vec Style Embeddings

Neel NandaJul 28, 2023, 12:50 AM
41 points
4 comments2 min readLW link

ETFE windows

bhauthJul 28, 2023, 12:46 AM
31 points
4 comments2 min readLW link
(www.bhauth.com)

A Short Memo on AI In­ter­pretabil­ity Rain­bows

scasperJul 27, 2023, 11:05 PM
18 points
0 comments2 min readLW link

Pul­ling the Rope Side­ways: Em­piri­cal Test Results

Daniel KokotajloJul 27, 2023, 10:18 PM
61 points
18 comments1 min readLW link

A $10k retroac­tive grant for VaccinateCA

Austin ChenJul 27, 2023, 6:14 PM
82 points
0 commentsLW link
(manifund.org)

Prefer­ence Ag­gre­ga­tion as Bayesian Inference

berenJul 27, 2023, 5:59 PM
14 points
1 comment1 min readLW link

AI #22: Into the Weeds

ZviJul 27, 2023, 5:40 PM
49 points
8 comments84 min readLW link
(thezvi.wordpress.com)

SSA re­jects an­thropic shadow, too

jessicataJul 27, 2023, 5:25 PM
74 points
38 comments11 min readLW link
(unstableontology.com)

[Question] What are ex­am­ples of some­one do­ing a lot of work to find the best of some­thing?

chanamessingerJul 27, 2023, 3:58 PM
29 points
16 comments1 min readLW link

AI-Plans.com 10-day Cri­tique-a-Thon

IknownothingJul 27, 2023, 11:44 AM
8 points
2 comments2 min readLW link
(manifund.org)

Pri­vacy in a Digi­tal World

FaustifyJul 27, 2023, 10:46 AM
2 points
0 comments5 min readLW link

Cul­ti­vat­ing a state of mind where new ideas are born

Henrik KarlssonJul 27, 2023, 9:16 AM
244 points
21 comments14 min readLW link2 reviews
(www.henrikkarlsson.xyz)

Par­tial Tran­script of Re­cent Se­nate Hear­ing Dis­cussing AI X-Risk

Daniel_EthJul 27, 2023, 9:16 AM
55 points
0 commentsLW link
(medium.com)

AXRP Epi­sode 24 - Su­per­al­ign­ment with Jan Leike

DanielFilanJul 27, 2023, 4:00 AM
55 points
3 comments69 min readLW link

[Question] Have you ever con­sid­ered tak­ing the ‘Tur­ing Test’ your­self?

Super AGIJul 27, 2023, 3:48 AM
2 points
6 comments1 min readLW link

AXRP Epi­sode 23 - Mechanis­tic Ano­maly De­tec­tion with Mark Xu

DanielFilan27 Jul 2023 1:50 UTC
22 points
0 comments72 min readLW link

GPT-4 can catch sub­tle cross-lan­guage trans­la­tion mistakes

Michael Tontchev27 Jul 2023 1:39 UTC
7 points
1 comment1 min readLW link

So­cial Balance through Em­brac­ing So­cial Credit

dhruvv26 Jul 2023 20:07 UTC
−39 points
9 comments3 min readLW link

Why no Ro­man In­dus­trial Revolu­tion?

jasoncrawford26 Jul 2023 19:34 UTC
62 points
30 comments3 min readLW link
(rootsofprogress.org)

Why you can’t treat de­cid­abil­ity and com­plex­ity as a con­stant (Post #1)

Noosphere8926 Jul 2023 17:54 UTC
6 points
13 comments5 min readLW link

A re­sponse to the Richards et al.’s “The Illu­sion of AI’s Ex­is­ten­tial Risk”

Harrison Fell26 Jul 2023 17:34 UTC
1 point
0 comments10 min readLW link

Meta-level ad­ver­sar­ial eval­u­a­tion of over­sight tech­niques might al­low ro­bust mea­sure­ment of their adequacy

26 Jul 2023 17:02 UTC
99 points
19 comments1 min readLW link1 review

Neuronpedia

Johnny Lin26 Jul 2023 16:29 UTC
135 points
51 comments2 min readLW link
(neuronpedia.org)