The Shard Theory of Human ValuesQuintin Pope14 Jul 2022 1:36 UTCWritten by Quintin Pope, Alex Turner, Charles Foster, and Logan Smith. Card image generated by DALL-E 2:Humans provide an untapped wealth of evidence about alignmentTurnTrout and Quintin Pope14 Jul 2022 2:31 UTC156 points85 comments10 min readLW linkHuman values & biases are inaccessible to the genomeTurnTrout7 Jul 2022 17:29 UTC82 points43 comments5 min readLW linkReward is not the optimization targetTurnTrout25 Jul 2022 0:03 UTC156 points67 comments12 min readLW linkGeneral alignment propertiesTurnTrout8 Aug 2022 23:40 UTC36 points1 comment1 min readLW linkShard Theory: An OverviewDavid Udell11 Aug 2022 5:44 UTC65 points5 comments10 min readLW link