Im­pos­tor syn­drome: how to cure it with spread­sheets and med­i­ta­tion

KatWoodsFeb 9, 2023, 9:04 PM
31 points
2 comments19 min readLW link

Con­di­tion­ing Pre­dic­tive Models: De­ploy­ment strategy

Feb 9, 2023, 8:59 PM
28 points
0 comments10 min readLW link

Make Con­flict of In­ter­est Poli­cies Public

jefftkFeb 9, 2023, 7:30 PM
33 points
7 comments2 min readLW link
(www.jefftk.com)

Cu­rated blind auc­tion pre­dic­tion mar­kets and a rep­u­ta­tion sys­tem as an al­ter­na­tive to ed­i­to­rial re­view in news pub­li­ca­tion.

ciaran Feb 9, 2023, 6:48 PM
2 points
0 comments2 min readLW link

Tools for find­ing in­for­ma­tion on the internet

RomanHaukssonFeb 9, 2023, 5:05 PM
79 points
11 comments2 min readLW link
(roman.computer)

Covid 2/​9/​23: In­terferon λ

ZviFeb 9, 2023, 4:50 PM
48 points
8 comments12 min readLW link
(thezvi.wordpress.com)

EIS II: What is “In­ter­pretabil­ity”?

scasperFeb 9, 2023, 4:48 PM
28 points
6 comments4 min readLW link

The Eng­ineer’s In­ter­pretabil­ity Se­quence (EIS) I: Intro

scasperFeb 9, 2023, 4:28 PM
46 points
24 comments3 min readLW link

[Question] Do the Safety Prop­er­ties of Pow­er­ful AI Sys­tems Need to be Ad­ver­sar­i­ally Ro­bust? Why?

DragonGodFeb 9, 2023, 1:36 PM
22 points
42 comments2 min readLW link

Which ML skills are use­ful for find­ing a new AIS re­search agenda?

Yonatan CaleFeb 9, 2023, 1:09 PM
16 points
1 comment1 min readLW link

When To Stop

Alok SinghFeb 9, 2023, 9:10 AM
31 points
5 comments1 min readLW link
(alok.github.io)

The Per­va­sive Illu­sion of See­ing the Com­plete World

ShmiFeb 9, 2023, 6:47 AM
39 points
1 comment2 min readLW link

Reli­gion is Good, Actually

Gordon Seidoh WorleyFeb 9, 2023, 6:34 AM
−1 points
39 comments4 min readLW link

Us­ing PICT against Pas­taGPT Jailbreaking

Quentin FEUILLADE--MONTIXIFeb 9, 2023, 4:30 AM
26 points
0 comments9 min readLW link

Notes on the Math­e­mat­ics of LLM Architectures

carboniferous_umbraculum Feb 9, 2023, 1:45 AM
12 points
2 comments1 min readLW link
(drive.google.com)

On Devel­op­ing a Math­e­mat­i­cal The­ory of In­ter­pretabil­ity

carboniferous_umbraculum Feb 9, 2023, 1:45 AM
64 points
8 comments6 min readLW link

Ano­ma­lous to­kens re­veal the origi­nal iden­tities of In­struct models

Feb 9, 2023, 1:30 AM
140 points
16 comments9 min readLW link
(generative.ink)

[Question] How would you use video gamey tech to help with AI safety?

porbyFeb 9, 2023, 12:20 AM
9 points
5 comments1 min readLW link

A (EtA: quick) note on ter­minol­ogy: AI Align­ment != AI x-safety

David Scott Krueger (formerly: capybaralet)Feb 8, 2023, 10:33 PM
46 points
20 comments1 min readLW link

GPT-175bee

Feb 8, 2023, 6:58 PM
122 points
14 comments1 min readLW link

Ei­genKarma: trust at scale

Henrik KarlssonFeb 8, 2023, 6:52 PM
186 points
52 comments5 min readLW link

Con­di­tion­ing Pre­dic­tive Models: In­ter­ac­tions with other approaches

Feb 8, 2023, 6:19 PM
32 points
2 comments11 min readLW link

Wanted: Tech­ni­cal an­i­ma­tor and/​or front-end de­vel­oper for in­ter­ac­tive di­a­grams of invention

jasoncrawfordFeb 8, 2023, 5:14 PM
30 points
3 comments1 min readLW link
(rootsofprogress.org)

A multi-dis­ci­plinary view on AI safety research

Roman LeventovFeb 8, 2023, 4:50 PM
46 points
4 comments26 min readLW link

Com­mu­nity build­ing: Les­sons from ten years of fa­cil­i­ta­tion experience

Severin T. SeehrichFeb 8, 2023, 4:26 PM
17 points
0 commentsLW link

Progress links and tweets, 2023-02-08

jasoncrawfordFeb 8, 2023, 3:52 PM
10 points
0 comments1 min readLW link
(rootsofprogress.org)

A Par­tic­u­lar Equilibrium

AlgonFeb 8, 2023, 3:16 PM
13 points
0 comments2 min readLW link
(algon-33.github.io)

Self-Aware­ness (and pos­si­ble mode col­lapse around it) in ChatGPT

YitzFeb 8, 2023, 9:57 AM
18 points
2 comments2 min readLW link

Drugs are Some­times Good, Actually

Gordon Seidoh WorleyFeb 8, 2023, 2:24 AM
13 points
8 comments4 min readLW link

House Covid In­fec­tion Retrospective

jefftkFeb 8, 2023, 2:20 AM
25 points
1 comment2 min readLW link
(www.jefftk.com)

Not­ing an er­ror in Inad­e­quate Equilibria

Matthew BarnettFeb 8, 2023, 1:33 AM
366 points
60 comments2 min readLW link2 reviews

Liv­ing No­mad­i­cally: My 80/​20 Guide

KatWoodsFeb 8, 2023, 1:31 AM
37 points
18 comments1 min readLW link

OpenAI/​Microsoft an­nounce “next gen­er­a­tion lan­guage model” in­te­grated into Bing/​Edge

LawrenceCFeb 7, 2023, 8:38 PM
79 points
4 comments1 min readLW link
(blogs.microsoft.com)

How evals might (or might not) pre­vent catas­trophic risks from AI

Orpheus16Feb 7, 2023, 8:16 PM
45 points
0 comments9 min readLW link

Con­di­tion­ing Pre­dic­tive Models: Mak­ing in­ner al­ign­ment as easy as possible

Feb 7, 2023, 8:04 PM
27 points
2 comments19 min readLW link

On The Cur­rent Sta­tus Of AI Dating

Nikita BrancatisanoFeb 7, 2023, 8:00 PM
52 points
8 comments6 min readLW link

Fram­ing AI strategy

Zach Stein-PerlmanFeb 7, 2023, 7:20 PM
33 points
1 comment18 min readLW link
(aiimpacts.org)

Re­view of AI Align­ment Progress

PeterMcCluskeyFeb 7, 2023, 6:57 PM
72 points
32 comments7 min readLW link
(bayesianinvestor.com)

The Eco­nomics of Contracts

Edward P. KöningsFeb 7, 2023, 1:52 PM
21 points
3 comments8 min readLW link
(edwardknings.substack.com)

Two very differ­ent ex­pe­riences with ChatGPT

SherrinfordFeb 7, 2023, 1:09 PM
38 points
15 comments5 min readLW link

[About Me] Cin­era’s Home Page

DragonGodFeb 7, 2023, 12:56 PM
30 points
2 comments9 min readLW link

Stuff I Recom­mend You Use

Arjun PanicksseryFeb 7, 2023, 12:18 PM
17 points
2 comments2 min readLW link
(arjunpanickssery.substack.com)

AXRP: Store, Pa­treon, Video

DanielFilanFeb 7, 2023, 4:50 AM
12 points
0 comments1 min readLW link

Duck­bill Masks Are Great

jefftkFeb 7, 2023, 3:00 AM
22 points
14 comments1 min readLW link
(www.jefftk.com)

EA & LW Fo­rum Weekly Sum­mary (30th Jan − 5th Feb 2023)

Zoe WilliamsFeb 7, 2023, 2:13 AM
3 points
3 commentsLW link

[ASoT] Policy Tra­jec­tory Visualization

Ulisse MiniFeb 7, 2023, 12:13 AM
9 points
2 comments1 min readLW link

English is a Ter­rible Pro­gram­ming Lan­guage—And other rea­sons AI won’t dis­place programmers

dawsoneliasenFeb 6, 2023, 10:12 PM
26 points
8 comments8 min readLW link
(orbistertius.substack.com)

Afri­can Wild Dogs Vote By Sneez­ing—Can AI Help Us Do Bet­ter?

Augmented AssemblyFeb 6, 2023, 9:09 PM
10 points
6 comments4 min readLW link

In defense of the MBTI

ZZZZZZFeb 6, 2023, 9:08 PM
−14 points
22 comments4 min readLW link

Early situ­a­tional aware­ness and its im­pli­ca­tions, a story

Jacob PfauFeb 6, 2023, 8:45 PM
29 points
6 comments3 min readLW link