RSS

GPT

TagLast edit: 9 Aug 2020 22:51 UTC by habryka

GPT (Gen­er­a­tive Pre­trained Trans­former) is a large trans­former-based lan­guage model cre­ated by OpenAI. Its abil­ity to gen­er­ate re­mark­ably hu­man-like re­sponses has rele­vance to dis­cus­sions on AGI.

Col­lec­tion of GPT-3 results

Kaj_Sotala18 Jul 2020 20:04 UTC
83 points
24 comments1 min readLW link
(twitter.com)

[Question] To what ex­tent is GPT-3 ca­pa­ble of rea­son­ing?

TurnTrout20 Jul 2020 17:10 UTC
63 points
74 comments16 min readLW link

GPT-3: a dis­ap­point­ing paper

nostalgebraist29 May 2020 19:06 UTC
55 points
37 comments8 min readLW link

$1000 bounty for OpenAI to show whether GPT3 was “de­liber­ately” pre­tend­ing to be stupi­der than it is

jacobjacob21 Jul 2020 18:42 UTC
51 points
40 comments2 min readLW link
(twitter.com)

Does GPT-2 Un­der­stand Any­thing?

Douglas Summers-Stay2 Jan 2020 17:09 UTC
38 points
23 comments5 min readLW link

Two Small Ex­per­i­ments on GPT-2

jimrandomh21 Feb 2019 2:59 UTC
56 points
28 comments1 min readLW link

‘This Waifu Does Not Ex­ist’: 100,000 StyleGAN & GPT-2 samples

gwern1 Mar 2019 4:29 UTC
39 points
6 comments1 min readLW link
(www.thiswaifudoesnotexist.net)

345M ver­sion GPT-2 released

lifelonglearner5 May 2019 2:49 UTC
38 points
0 comments1 min readLW link
(openai.com)

GPT-3 Fic­tion Samples

gwern25 Jun 2020 16:12 UTC
61 points
18 comments1 min readLW link
(www.gwern.net)

[Question] How “hon­est” is GPT-3?

abramdemski8 Jul 2020 19:38 UTC
72 points
18 comments5 min readLW link

[Question] How well can the GPT ar­chi­tec­ture solve the par­ity task?

FactorialCode11 Jul 2020 19:02 UTC
18 points
3 comments1 min readLW link

Align­ment As A Bot­tle­neck To Use­ful­ness Of GPT-3

johnswentworth21 Jul 2020 20:02 UTC
93 points
57 comments3 min readLW link

Repli­cat­ing the repli­ca­tion crisis with GPT-3?

skybrian22 Jul 2020 21:20 UTC
30 points
9 comments1 min readLW link

Can you get AGI from a Trans­former?

steve215223 Jul 2020 15:27 UTC
67 points
16 comments11 min readLW link

OpenGPT-2: We Repli­cated GPT-2 Be­cause You Can Too

avturchin23 Aug 2019 11:32 UTC
20 points
0 comments1 min readLW link
(medium.com)

Devel­op­men­tal Stages of GPTs

orthonormal26 Jul 2020 22:03 UTC
120 points
71 comments7 min readLW link

Writ­ing with GPT-3

Jacobian24 Jul 2020 15:22 UTC
42 points
0 comments4 min readLW link

Are we in an AI over­hang?

Andy Jones27 Jul 2020 12:48 UTC
224 points
87 comments4 min readLW link

[Question] How will in­ter­net fo­rums like LW be able to defend against GPT-style spam?

ChristianKl28 Jul 2020 20:12 UTC
14 points
18 comments1 min readLW link

The Hacker Learns to Trust

Ben Pace22 Jun 2019 0:27 UTC
83 points
18 comments8 min readLW link
(medium.com)

You Can Prob­a­bly Am­plify GPT3 Directly

Zachary Robertson26 Jul 2020 21:58 UTC
35 points
14 comments6 min readLW link

[Question] Will OpenAI’s work un­in­ten­tion­ally in­crease ex­is­ten­tial risks re­lated to AI?

adamShimi11 Aug 2020 18:16 UTC
56 points
48 comments1 min readLW link

GPT-3, be­lief, and consistency

skybrian16 Aug 2020 23:12 UTC
19 points
7 comments2 min readLW link

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin25 Feb 2019 20:40 UTC
137 points
29 comments6 min readLW link
(srconstantin.wordpress.com)

[AN #102]: Meta learn­ing by GPT-3, and a list of full pro­pos­als for AI alignment

rohinmshah3 Jun 2020 17:20 UTC
38 points
6 comments10 min readLW link
(mailchi.mp)

An­a­lyz­ing the Prob­lem GPT-3 is Try­ing to Solve

adamShimi6 Aug 2020 21:58 UTC
16 points
2 comments4 min readLW link

Us­ing GPT-N to Solve In­ter­pretabil­ity of Neu­ral Net­works: A Re­search Agenda

3 Sep 2020 18:27 UTC
60 points
11 comments2 min readLW link

Pre­dic­tions for GPT-N

hippke29 Jul 2020 1:16 UTC
37 points
31 comments1 min readLW link

is gpt-3 few-shot ready for real ap­pli­ca­tions?

nostalgebraist3 Aug 2020 19:50 UTC
31 points
5 comments9 min readLW link
(nostalgebraist.tumblr.com)

[April Fools] User GPT2 is Banned

jimrandomh2 Apr 2019 6:00 UTC
64 points
20 comments1 min readLW link

GPT-2: 6-Month Fol­low-Up

lifelonglearner21 Aug 2019 5:06 UTC
31 points
1 comment1 min readLW link

What’s Your Cog­ni­tive Al­gorithm?

Raemon18 Jun 2020 22:16 UTC
69 points
23 comments13 min readLW link

Image GPT

Daniel Kokotajlo18 Jun 2020 11:41 UTC
30 points
27 comments1 min readLW link
(openai.com)

The “AI Dun­geons” Dragon Model is heav­ily path de­pen­dent (test­ing GPT-3 on ethics)

Rafael Harth21 Jul 2020 12:14 UTC
47 points
9 comments6 min readLW link

GPT-3 Gems

TurnTrout23 Jul 2020 0:46 UTC
26 points
7 comments41 min readLW link

[Question] Ques­tion on GPT-3 Ex­cel Demo

Zhitao Hou22 Jun 2020 20:31 UTC
0 points
4 comments1 min readLW link

[Question] Are we cer­tain that gpt-2 and similar al­gorithms are not self-aware?

Ozyrus11 Jul 2019 8:37 UTC
1 point
9 comments1 min readLW link

[Question] What should we ex­pect from GPT-3?

avturchin21 Mar 2019 14:28 UTC
23 points
2 comments1 min readLW link

[Question] List of pub­lic pre­dic­tions of what GPT-X can or can’t do?

Daniel Kokotajlo14 Jun 2020 14:25 UTC
20 points
9 comments1 min readLW link

GPT-3: A Summary

leogao2 Jun 2020 18:14 UTC
20 points
0 comments1 min readLW link
(leogao.dev)

[Question] If AI is based on GPT, how to en­sure its safety?

avturchin18 Jun 2020 20:33 UTC
20 points
11 comments1 min readLW link

OpenAI an­nounces GPT-3

gwern29 May 2020 1:49 UTC
65 points
23 comments1 min readLW link
(arxiv.org)

[up­dated] how does gpt2′s train­ing cor­pus cap­ture in­ter­net dis­cus­sion? not well

nostalgebraist27 Jul 2020 22:30 UTC
24 points
3 comments2 min readLW link
(nostalgebraist.tumblr.com)

[Question] Prob­a­bil­ity that other ar­chi­tec­tures will scale as well as Trans­form­ers?

Daniel Kokotajlo28 Jul 2020 19:36 UTC
22 points
4 comments1 min readLW link

[Question] To what ex­tent are the scal­ing prop­er­ties of Trans­former net­works ex­cep­tional?

abramdemski28 Jul 2020 20:06 UTC
29 points
1 comment1 min readLW link

En­gag­ing Se­ri­ously with Short Timelines

deluks91729 Jul 2020 19:21 UTC
43 points
23 comments3 min readLW link

Suffi­ciently Ad­vanced Lan­guage Models Can Do Re­in­force­ment Learning

Zachary Robertson2 Aug 2020 15:32 UTC
23 points
7 comments7 min readLW link

[Question] How hard would it be to change GPT-3 in a way that al­lows au­dio?

ChristianKl28 Aug 2020 14:42 UTC
8 points
5 comments1 min readLW link

in­ter­pret­ing GPT: the logit lens

nostalgebraist31 Aug 2020 2:47 UTC
104 points
27 comments10 min readLW link

Why GPT wants to mesa-op­ti­mize & how we might change this

John_Maxwell19 Sep 2020 13:48 UTC
54 points
23 comments9 min readLW link

[Question] Where is hu­man level on text pre­dic­tion? (GPTs task)

Daniel Kokotajlo20 Sep 2020 9:00 UTC
20 points
17 comments1 min readLW link

Hiring en­g­ineers and re­searchers to help al­ign GPT-3

paulfchristiano1 Oct 2020 18:54 UTC
221 points
13 comments3 min readLW link

[Question] If GPT-6 is hu­man-level AGI but costs $200 per page of out­put, what would hap­pen?

Daniel Kokotajlo9 Oct 2020 12:00 UTC
28 points
30 comments1 min readLW link

[Question] What spe­cific dan­gers arise when ask­ing GPT-N to write an Align­ment Fo­rum post?

Matthew Barnett28 Jul 2020 2:56 UTC
41 points
14 comments1 min readLW link

Struc­tured Tasks for Lan­guage Models

Zachary Robertson29 Jul 2020 14:17 UTC
5 points
0 comments1 min readLW link

[Question] Is the work on AI al­ign­ment rele­vant to GPT?

Richard_Kennaway30 Jul 2020 12:23 UTC
12 points
5 comments1 min readLW link

Agen­tic Lan­guage Model Memes

FactorialCode1 Aug 2020 18:03 UTC
11 points
1 comment2 min readLW link

[Question] What are the most im­por­tant pa­pers/​post/​re­sources to read to un­der­stand more of GPT-3?

adamShimi2 Aug 2020 20:53 UTC
25 points
4 comments1 min readLW link

hu­man psy­chol­in­guists: a crit­i­cal appraisal

nostalgebraist31 Dec 2019 0:20 UTC
170 points
53 comments16 min readLW link
(nostalgebraist.tumblr.com)

[Question] 10/​50/​90% chance of GPT-N Trans­for­ma­tive AI?

human_generated_text9 Aug 2020 0:10 UTC
25 points
8 comments1 min readLW link

May Gw­ern.net newslet­ter (w/​GPT-3 com­men­tary)

gwern2 Jun 2020 15:40 UTC
32 points
7 comments1 min readLW link
(www.gwern.net)

A trick for Safer GPT-N

Razied23 Aug 2020 0:39 UTC
7 points
1 comment2 min readLW link

From GPT to AGI

ChristianKl31 Aug 2020 13:28 UTC
5 points
7 comments1 min readLW link

on “learn­ing to sum­ma­rize”

nostalgebraist12 Sep 2020 3:20 UTC
22 points
13 comments8 min readLW link
(nostalgebraist.tumblr.com)

The Col­lid­ing Ex­po­nen­tials of AI

VermillionStuka14 Oct 2020 23:31 UTC
16 points
10 comments5 min readLW link

[Question] GPT-3 + GAN

stick10917 Oct 2020 7:58 UTC
4 points
2 comments1 min readLW link
No comments.