Ben Cottier

Karma: 329

At Epoch, helping to clarify when and how transformative AI capabilities will be developed.

Previously a Research Fellow on the AI Governance & Strategy team at Rethink Priorities.

Ben Cottier 22 Aug 2025 15:29 UTC
1 point
0
on: How Fast is Algorithmic Progress in AI Inference?
This is neat! I like the idea of isolating technical progress. I’m curious whether you’ve tried this analysis on more benchmarks, considering that we found significant variation in slope across benchmarks in https://epoch.ai/data-insights/llm-inference-price-trends?insight-option=All+benchmarks

Data on AI

Robi Rahman, Jaime Sevilla Molina, Pablo Villalobos and Ben Cottier

20 Jun 2024 6:31 UTC

1 point

0 comments1 min readLW link

(epochai.org)

Announcing Epoch’s newly expanded Parameters, Compute and Data Trends in Machine Learning database

Robi Rahman, Jaime Sevilla Molina, Tamay, Ege Erdil, Pablo Villalobos, Ben Cottier and Matthew Barnett

25 Oct 2023 2:55 UTC

18 points

0 comments1 min readLW link

(epochai.org)

Ben Cottier 30 Mar 2023 16:29 UTC
1 point
0
in reply to: sairjy’s comment on: GPT-4
What is the source for the “JP Morgan note”?

Ben Cottier 25 Feb 2023 12:56 UTC
1 point
0
in reply to: ChristianKl’s comment on: GPT-3-like models are now much easier to access and deploy than to develop
To be clear (sorry if you already understood this from the post): Running BLOOM via an API that someone else created is easy. My claim is that someone needs significant expertise to be able to run their own instance of BLOOM. I think the hardest part is setting up multiple GPUs to run the 176B parameter model. But looking back, I might have underestimated how straightforward it is to get the open-source code to run BLOOM working. Maybe it’s basically plug-and-play as long as you get an appropriate A100 GPU instance on the cloud. I did not attempt to run BLOOM from scratch myself.
I recall that in an earlier draft, my estimate for how many people know how to independently run BLOOM was higher (indicating that it’s easier). I got push-back on that from someone who works at an AI lab (though this person wasn’t an ML practitioner themselves). I thought they made a valid point but I didn’t think carefully about whether they were actually right in this case. So I decreased my estimate in response to their feedback.

Trends in the dollar training cost of machine learning systems

Ben Cottier1 Feb 2023 14:48 UTC

23 points

0 comments2 min readLW link

(epochai.org)

Conclusion and Bibliography for “Understanding the diffusion of large language models”

Ben Cottier16 Jan 2023 1:46 UTC

4 points

0 comments11 min readLW link

Questions for further investigation of AI diffusion

Ben Cottier16 Jan 2023 1:46 UTC

4 points

0 comments11 min readLW link

Implications of large language model diffusion for AI governance

Ben Cottier16 Jan 2023 1:45 UTC

7 points

0 comments36 min readLW link

Publication decisions for large language models, and their impacts

Ben Cottier16 Jan 2023 1:44 UTC

4 points

0 comments14 min readLW link

Drivers of large language model diffusion: incremental research, publicity, and cascades

Ben Cottier16 Jan 2023 1:44 UTC

4 points

0 comments24 min readLW link

The replication and emulation of GPT-3

Ben Cottier16 Jan 2023 1:40 UTC

4 points

0 comments13 min readLW link

GPT-3-like models are now much easier to access and deploy than to develop

Ben Cottier16 Jan 2023 1:39 UTC

12 points

3 comments15 min readLW link

Background for “Understanding the diffusion of large language models”

Ben Cottier16 Jan 2023 1:38 UTC

4 points

0 comments23 min readLW link

Understanding the diffusion of large language models: summary

Ben Cottier16 Jan 2023 1:37 UTC

26 points

1 comment22 min readLW link

Modeling Failure Modes of High-Level Machine Intelligence

Ben Cottier, Daniel_Eth and Sammy Martin

6 Dec 2021 13:54 UTC

54 points

1 comment12 min readLW link

Ben Cottier 2 Dec 2021 3:50 UTC
2 points
0
on: What are the biggest current impacts of AI?
Personal AI assistants seem to have one of the largest impacts (or at least “presence”) mainly due to the number of users. The impact per person seems small—making life slightly more convenient and productive, maybe. Not sure if there is actually much impact on productivity. I wonder if there is any research on this. I haven’t looked into it at all.

Relatedly, chatbots are certainly used a lot, but I’m uncertain about its current impacts beyond personal entertainment and wellbeing (and uncertain about the direction of the impact on wellbeing).

What 2026 looks like has a few relevant facts on the current impacts, and interesting speculation about the future impacts of personal assistants and chatbots. E.g. facts:

“in China in 2021 the market for chatbots is $420M/year, and there are 10M active users. This article claims the global market is around $2B/year in 2021 and is projected to grow around 30%/year.”

I don’t feel surprised by those stats, but I also hadn’t really considered how big the market is.

Ben Cottier 7 Nov 2021 15:29 UTC
LW: 4 AF: 2
0
AF
in reply to: Steven Byrnes’s comment on: Modeling the impact of safety agendas
Nice! A couple things that this comment pointed out for me:
1. Real time is not always (and perhaps often not) the most useful way to talk about timelines. It can be more useful to talk about different paths, or economic growth, if that’s more relevant to how tractable the research is.
2. An agenda doesn’t necessarily have to argue that its assumptions are more likely, because we may have enough resources to get worthwhile expected returns on multiple approaches.
Something that’s unclear here: are you excited about this approach because you think brain-like AGI will be easier to align? Or is it more about the relative probabilities / neglectedness / your fit?

Modeling the impact of safety agendas

Ben Cottier5 Nov 2021 19:46 UTC

51 points

6 comments10 min readLW link

Ben Cottier 20 Oct 2021 9:07 UTC
1 point
0
AF
on: AI learns betrayal and how to avoid it
I’m excited about this project. I’ve been thinking along similar lines about inducing a model to learn deception, in the context of inner alignment. It seems really valuable to have concrete (but benign) examples of a problem to poke at and test potential solutions on. So far there seem to be less concrete examples of deception, betrayal and the like to work with in ML compared to say, distributional shift, or negative side effects.