RSS

Dan Hendrycks

Karma: 568

[Linkpost] Ex­is­ten­tial Risk Anal­y­sis in Em­piri­cal Re­search Papers

Dan Hendrycks2 Jul 2022 0:09 UTC
30 points
0 comments1 min readLW link
(arxiv.org)

Paper: Fore­cast­ing world events with neu­ral nets

1 Jul 2022 19:40 UTC
22 points
3 comments4 min readLW link

Open Prob­lems in AI X-Risk [PAIS #5]

10 Jun 2022 2:08 UTC
42 points
3 comments35 min readLW link

[MLSN #4]: Many New In­ter­pretabil­ity Papers, Vir­tual Logit Match­ing, Ra­tion­al­iza­tion Helps Robustness

Dan Hendrycks3 Jun 2022 1:20 UTC
14 points
0 comments4 min readLW link

Perform Tractable Re­search While Avoid­ing Ca­pa­bil­ities Ex­ter­nal­ities [Prag­matic AI Safety #4]

30 May 2022 20:25 UTC
33 points
3 comments25 min readLW link

Com­plex Sys­tems for AI Safety [Prag­matic AI Safety #3]

24 May 2022 0:00 UTC
31 points
0 comments21 min readLW link

Ac­tion­able-guidance and roadmap recom­men­da­tions for the NIST AI Risk Man­age­ment Framework

17 May 2022 15:26 UTC
24 points
0 comments3 min readLW link

A Bird’s Eye View of the ML Field [Prag­matic AI Safety #2]

9 May 2022 17:18 UTC
111 points
2 comments35 min readLW link

In­tro­duc­tion to Prag­matic AI Safety [Prag­matic AI Safety #1]

9 May 2022 17:06 UTC
67 points
1 comment6 min readLW link

In­tro­duc­ing the ML Safety Schol­ars Program

4 May 2022 16:01 UTC
64 points
2 comments3 min readLW link

[$20K in Prizes] AI Safety Ar­gu­ments Competition

26 Apr 2022 16:13 UTC
70 points
540 comments3 min readLW link

[MLSN #3]: NeurIPS Safety Paper Roundup

Dan Hendrycks8 Mar 2022 15:17 UTC
42 points
0 comments4 min readLW link