RSS

mic

Karma: 583

[Question] How to quan­tify un­cer­tainty about a prob­a­bil­ity es­ti­mate?

mic6 May 2020 12:06 UTC
4 points
2 comments1 min readLW link

Ask AI com­pa­nies about what they are do­ing for AI safety?

mic9 Mar 2022 15:14 UTC
51 points
0 comments2 min readLW link

AI gov­er­nance stu­dent hackathon on Satur­day, April 23: reg­ister now!

mic12 Apr 2022 4:48 UTC
14 points
0 comments1 min readLW link

AI safety uni­ver­sity groups: a promis­ing op­por­tu­nity to re­duce ex­is­ten­tial risk

mic1 Jul 2022 3:59 UTC
14 points
0 comments11 min readLW link

OpenAI in­tro­duces func­tion call­ing for GPT-4

20 Jun 2023 1:58 UTC
24 points
3 comments4 min readLW link
(openai.com)

EU’s AI am­bi­tions at risk as US pushes to wa­ter down in­ter­na­tional treaty (linkpost)

mic31 Jul 2023 0:34 UTC
10 points
0 comments4 min readLW link
(www.euractiv.com)

[Question] Does LessWrong al­low ex­empt­ing posts from be­ing scraped by GPTBot?

mic9 Aug 2023 13:02 UTC
29 points
3 comments1 min readLW link

Su­per­vised Pro­gram for Align­ment Re­search (SPAR) at UC Berkeley: Spring 2023 summary

19 Aug 2023 2:27 UTC
20 points
2 comments6 min readLW link

Ideas for im­prov­ing epistemics in AI safety outreach

mic21 Aug 2023 19:55 UTC
64 points
6 comments3 min readLW link

SPAR seeks ad­vi­sors and stu­dents for AI safety pro­jects (Se­cond Wave)

mic14 Sep 2023 23:09 UTC
21 points
0 comments1 min readLW link

[Linkpost] Mark Zucker­berg con­fronted about Meta’s Llama 2 AI’s abil­ity to give users de­tailed guidance on mak­ing an­thrax—Busi­ness Insider

mic26 Sep 2023 12:05 UTC
18 points
11 comments2 min readLW link
(www.businessinsider.com)

The Gra­di­ent – The Ar­tifi­cial­ity of Alignment

mic8 Oct 2023 4:06 UTC
12 points
1 comment5 min readLW link
(thegradient.pub)

Solv­ing al­ign­ment isn’t enough for a flour­ish­ing future

mic2 Feb 2024 18:23 UTC
27 points
0 comments1 min readLW link
(papers.ssrn.com)

En­hanc­ing biose­cu­rity with lan­guage mod­els: defin­ing re­search directions

mic26 Mar 2024 12:30 UTC
12 points
0 comments1 min readLW link
(papers.ssrn.com)