aog

Karma: 1,710

AI grantmaker at Longview Philanthropy and AI DPhil student at Oxford

Digital sentience funding opportunities: Support for applied work and research

aog and zdgroff

29 May 2025 15:22 UTC

21 points

0 comments4 min readLW link

Research Priorities for Hardware-Enabled Mechanisms (HEMs)

aog30 Apr 2025 17:43 UTC

18 points

3 comments15 min readLW link

(www.longview.org)

aog’s Shortform

aog19 Apr 2025 22:07 UTC

6 points

22 comments1 min readLW link

Benchmarking LLM Agents on Kaggle Competitions

aog22 Mar 2024 13:09 UTC

15 points

4 comments5 min readLW link

Adversarial Robustness Could Help Prevent Catastrophic Misuse

aog11 Dec 2023 19:12 UTC

30 points

18 comments9 min readLW link

Unsupervised Methods for Concept Discovery in AlphaZero

aog26 Oct 2023 19:05 UTC

9 points

0 comments1 min readLW link

(arxiv.org)

MLSN: #10 Adversarial Attacks Against Language and Vision Models, Improving LLM Honesty, and Tracing the Influence of LLM Training Data

13 Sep 2023 18:03 UTC

15 points

1 comment5 min readLW link

(newsletter.mlsafety.org)

Hoodwinked: Evaluating Deception Capabilities in Large Language Models

aog25 Aug 2023 19:39 UTC

25 points

3 comments3 min readLW link

Learning Transformer Programs [Linkpost]

aog8 Jun 2023 0:16 UTC

7 points

0 comments1 min readLW link

(arxiv.org)

Full Automation is Unlikely and Unnecessary for Explosive Growth

aog31 May 2023 21:55 UTC

28 points

3 comments5 min readLW link

Model-driven feedback could amplify alignment failures

aog30 Jan 2023 0:00 UTC

21 points

1 comment2 min readLW link

Analysis: US restricts GPU sales to China

aog7 Oct 2022 18:38 UTC

102 points

58 comments5 min readLW link

Git Re-Basin: Merging Models modulo Permutation Symmetries [Linkpost]

aog14 Sep 2022 8:55 UTC

21 points

0 comments2 min readLW link

(arxiv.org)

Argument against 20% GDP growth from AI within 10 years [Linkpost]

aog12 Sep 2022 4:08 UTC

59 points

20 comments5 min readLW link

(twitter.com)

ML Model Attribution Challenge [Linkpost]

aog30 Aug 2022 19:34 UTC

11 points

0 comments1 min readLW link

(mlmac.io)

Emergent Abilities of Large Language Models [Linkpost]

aog10 Aug 2022 18:02 UTC

25 points

2 comments1 min readLW link

(arxiv.org)

Key Papers in Language Model Safety

aog20 Jun 2022 15:00 UTC

40 points

1 comment22 min readLW link

Yudkowsky Contra Christiano on AI Takeoff Speeds [Linkpost]

aog5 Apr 2022 2:09 UTC

18 points

0 comments11 min readLW link

[Link] Did AlphaStar just click faster?

aog28 Jan 2019 20:23 UTC

4 points

14 comments1 min readLW link