Hauke Hillebrandt

Karma: 115

UK PM: $125M for AI safety

Hauke Hillebrandt12 Jun 2023 12:33 UTC

31 points

11 comments1 min readLW link

(twitter.com)

Hauke Hillebrandt 7 Aug 2023 10:27 UTC
21 points
12
on: Problems with Robin Hanson’s Quillette Article On AI
Hanson Strawmans the AI-Ruin Argument
I don’t agree with Hanson generally, but I think there’s something there that rationalist AI risk public outreach has overemphasized first principles thinking, theory, and logical possibilities (e.g. evolution, gradient decent, human-chimp analogy, ) over concrete more tangible empirical findings (e.g. deception emerging in small models, specification gaming, LLMs helping to create WMDs, etc.).

Overview of how AI might exacerbate long-running catastrophic risks

Hauke Hillebrandt7 Aug 2023 11:53 UTC

20 points

0 comments11 min readLW link

(aisafetyfundamentals.com)

The Bletchley Declaration on AI Safety

Hauke Hillebrandt1 Nov 2023 11:44 UTC

17 points

0 comments1 min readLW link

(www.gov.uk)

The case for C19 being widespread

Hauke Hillebrandt28 Mar 2020 0:07 UTC

16 points

64 comments5 min readLW link

Hauke Hillebrandt 13 Jun 2023 11:48 UTC
10 points
0
in reply to: habryka’s comment on: UK PM: $125M for AI safety
Agreed, the initial announcement read like AI safety washing and more political action is needed, hence the call to action to improve this.
But read the taskforce leader’s op-ed:
1. He signed the pause AI petition.
2. He cites ARC’s GPT-4 evaluation and Lesswrong in his AI report which has a large section on safety.
3. “[Anthropic] has invested substantially in alignment, with 42 per cent of its team working on that area in 2021. But ultimately it is locked in the same race. For that reason, I would support significant regulation by governments and a practical plan to transform these companies into a Cern-like organisation. We are not powerless to slow down this race. If you work in government, hold hearings and ask AI leaders, under oath, about their timelines for developing God-like AGI. Ask for a complete record of the security issues they have discovered when testing current models. Ask for evidence that they understand how these systems work and their confidence in achieving alignment. Invite independent experts to the hearings to cross-examine these labs. [...] Until now, humans have remained a necessary part of the learning process that characterises progress in AI. At some point, someone will figure out how to cut us out of the loop, creating a God-like AI capable of infinite self-improvement. By then, it may be too late.”
Also the PM just tweeted about AI safety.
Generally, this development seems more robustly good and the path to a big policy win for AI safety seems clearer here than past efforts trying to control US AGI firms optimizing for profit. Timing also seems much better as things looks way more ‘on’ now. And again, even if the EV sign of the taskforce flips, then $125M is .5% of the $21B invested in AGI firms this year.
Are you saying that, as a rule, ~EAs should stay clear of policy for fear of tacit endorsement, which has caused harm and made damage control much harder and we suffer from cluelessness/clumsiness? Yes, ~EA involvement has in the past sometimes been bad, accelerated AI, and people got involved to get power for later leverage or damage control (cf. OpenAI), with uncertain outcomes (though not sure it’s all robustly bad—e.g. some say that RLHF was pretty overdetermined).
I agree though that ~EA policy pushing for mild accelerationism vs. harmful actors is less robust (cf. the CHIPs Act, which I heard a wonk call the most aggressive US foreign policy in 20 years), so would love to hear your more fleshed out push back on this—I remember reading somewhere recently that you’ve also had a major rethink recently vis-a-vis unintended consequences from EA work?

[Question] Preprint says R0=~5 (!) / infection fatality ratio=~0.1%. Thoughts?

Hauke Hillebrandt20 Mar 2020 11:37 UTC

6 points

29 comments1 min readLW link

Let’s Fund: Impact of our $1M crowdfunded grant to the Center for Clean Energy Innovation

Hauke Hillebrandt4 Apr 2024 16:28 UTC

5 points

0 comments1 min readLW link

(lets-fund.org)

Hauke Hillebrandt 12 Jun 2023 18:29 UTC
5 points
−2
in reply to: habryka’s comment on: UK PM: $125M for AI safety
Ian Hogarth is leading the task force who’s on record saying that AGI could lead to “obsolescence or destruction of the human race” if there’s no regulation on the technology’s progress.
Matt Clifford is also advising the task force—on record having said the same thing and knows a lot about AI safety. He had Jess Whittlestone & Jack Clark on his podcast.
If mainstream AI safety is useful and doesn’t increase capabilities, then the taskforce and the $125M seem valuable.
If it improves capabilities, then it’s a drop in the bucket in terms of overall investment going into AI.

Hauke Hillebrandt 28 Mar 2020 18:28 UTC
4 points
in reply to: jacob_cannell’s comment on: The case for C19 being widespread
Very good analysis.
I also thought your recent blog was excellent and think you should make it a top level post:
https://entersingularity.wordpress.com/2020/03/23/covid-19-vs-influenza/

Hauke Hillebrandt 28 Mar 2020 9:44 UTC
4 points
in reply to: CellBioGuy’s comment on: The case for C19 being widespread
By the end of its odyssey, a total of 712 of them tested positive, about a fifth.
Perhaps other on the ship had already cleared the virus and were asymptomatic. PCR only works for a week. Also there might have been false negatives. I disagree that the age and comorbidity structure can only lead to skewed results by a factor of two or three, because this assumes that there are few asymptomatic infections (I’m arguing here that the age tables are wrong).
In my post, I’ve argued why the data out of China might be wrong.
Iceland’s data might be wrong because it is based on PCR not serology, which means that many people might have already cleared the infection, and it is also not random.

Hauke Hillebrandt 11 Apr 2024 14:54 UTC
3 points
0
on: Reverse Regulatory Capture
cf
“The Bootleggers and Baptists effect describes cases where an industry (e.g. bootleggers) agrees with prosocial actors like regulators (e.g. baptists) to regulate more (here ban alcohol during the prohibition) to maximize profits and deter entry. This seems to be happening in AI where the industry lobbies for stricter regulation. Yet, in the EU, OpenAI lobbied to water down EU AI regulation to not classify GPT as ‘high risk’ to exempt it from stringent legal requirements.^[1] In the US, the FTC recently said that Big Tech intimidates competition regulators.^[2]Capture can also manifest by passively accepting industry practices, which is problematic in high-risk scenarios where thorough regulation is key. After all, AI expertise gathers in particular geographic communities. We must avoid cultural capture when social preferences interfere with policy, since regulators interact with workers from regulated firms. Although less of a concern in a rule-based system, a standard-based system would enable more informal influence via considerable regulator discretion. We must reduce these risks, e.g. by appointing independent regulators and requiring public disclosure of regulatory decisions.”
“Big Tech also takes greater legal risks by aggressively and (illegally) collecting data with negative externalities for users and third parties (similarly, Big Tech often violates IP ^[3] while lobbying against laws to stop patent trolling, claiming they harm real patents, but actually, this makes new patents from startups worth less and more costly to enforce.)^[4]^“

Hauke Hillebrandt 13 Jun 2023 16:40 UTC
3 points
0
in reply to: Ben Pace’s comment on: UK PM: $125M for AI safety
ARC’s GPT-4 evaluation is cited in the FT article, in case that was ambiguous.

Hauke Hillebrandt 17 Apr 2020 10:12 UTC
3 points
on: April Coronavirus Open Thread
[Years of life lost due to C19]
A recent meta-analysis looks at C-19-related mortality by age groups in Europe and finds the following age distribution:
< 40: 0.1%
40-69: 12.8%
≥ 70: 84.8%
In this spreadsheet model I combine this data with Metaculus predictions to get at the years of life lost (YLLs) due to C19.
I find C19 might cause 6m − 87m YYLs (highly dependending on # of deaths). For comparison, substance abuse causes 13m, diarrhea causes 85m YLLs.
Countries often spend 1-3x GDP per capita to avert a DALY, and so the world might want to spend $2-8trn to avert C19 YYLs (could also be a rough proxy for the cost of C19).
One of the many simplifying assumptions of this model is that excludes disability caused by C19 - which might be severe.

Hauke Hillebrandt 28 Mar 2020 16:09 UTC
3 points
in reply to: avturchin’s comment on: The case for C19 being widespread
Cruise Ship passenger are a non random sample with perhaps higher co-morbidities.
The cruise ships analysed are non-random sample: “at least 25 other cruise ships have confirmed COVID-19 cases”
Being on a cruise ship might increase your risk because of dose response https://twitter.com/robinhanson/status/1242655704663691264
Onboard IFR. as 1.2% (0.38-2.7%) https://www.medrxiv.org/content/10.1101/2020.03.05.20031773v2
Ioannidis: “A whole country is not a ship.”

Hauke Hillebrandt 28 Mar 2020 14:04 UTC
3 points
in reply to: Pablo’s comment on: The case for C19 being widespread
Cheers- corrected.

Hauke Hillebrandt 28 Mar 2020 2:35 UTC
3 points
in reply to: ignoranceprior’s comment on: The case for C19 being widespread
No. My ambition here was a bit simpler. I have presented a rough qualitative argument here that infection is already widespread and only a toy model. There are some issues with this and I haven’t done formal modelling. For instance, this would be what would be called the “crude IFR” I think , but the time lag adjusted IFR (~30 days from infection to death) might increase the death toll.
Currently, also every death in Italy where coronavirus is detected is recorded as a C19 death.
FWIW, if UK death toll will surpass 10,000, then this wouldn’t fit very well with this hypothesis here.

When training AI, we should escalate the frequency of capability tests

Hauke Hillebrandt4 Aug 2023 16:07 UTC

2 points

0 comments1 min readLW link

M&A in AI

Hauke Hillebrandt31 Oct 2023 12:20 UTC

2 points

0 comments1 min readLW link

Hauke Hillebrandt 19 Feb 2023 19:09 UTC
2 points
1
in reply to: gwern’s comment on: Bing Chat is blatantly, aggressively misaligned
a large part of those ‘leaks’ are fake
Can you give concrete examples?