kwiat.dev

Karma: 252

kwiat.dev 19 Mar 2025 22:38 UTC
6 points
5
in reply to: Towards_Keeperhood’s comment on: I changed my mind about orca intelligence
So you do, gotcha

kwiat.dev 18 Mar 2025 22:50 UTC
18 points
6
on: I changed my mind about orca intelligence
Genuine questions—do you still believe you’re one of the smartest young supergeniuses?

kwiat.dev 28 Jan 2025 22:19 UTC
10 points
2
in reply to: Thane Ruthenis’s comment on: DeepSeek Panic at the App Store
Trump “announces” a lot of things. It doesn’t matter until he actually does them.

kwiat.dev 21 Jan 2025 21:26 UTC
18 points
0
on: We don’t want to post again “This might be the last AI Safety Camp”
While I participated in a previous edition, and somewhat enjoyed it, I couldn’t bring myself to support it now considering Remmelt is the organizer, between his anti-AI-art crusades and an overall “stop AI” activism. It’s unfortunate, since technical AI safety research is very valuable, but promoting those anti-AI initiatives makes it a probable net negative in my eyes.
Maybe it’s better to let AISC die a hero.

kwiat.dev 20 Aug 2024 23:25 UTC
1 point
0
in reply to: arisAlexis’s comment on: Ten counter-arguments that AI is (not) an existential risk (for now)
Because you could make the same argument could be made earlier in the “exponential curve”. I don’t think we should have paused AI (or more broadly CS) in the 50′s, and I don’t think we should do it now.

kwiat.dev 16 Aug 2024 0:31 UTC
−1 points
−4
on: Ariel Kwiatkowski’s Shortform
Modern misaligned AI systems are good, actually. There’s some recent news about Sakana AI developing a system where the agents tried to extend their own runtime by editing their code/config.
This is amazing for safety! Current systems are laughably incapable of posing x-risks. Now, thanks to capabilities research, we have a clear example of behaviour that would be dangerous in a more “serious” system. So we can proceed with empirical research, create and evaluate methods to deal with this specific risk, so that future systems do not have this failure mode.
The future of AI and AI safety has never been brighter.

kwiat.dev 13 Aug 2024 23:29 UTC
6 points
8
in reply to: Nathan Young’s comment on: Ten arguments that AI is an existential risk
Expert opinion is an argument for people who are not themselves particularly informed about the topic. For everyone else, it basically turns into an authority fallacy.

Ten counter-arguments that AI is (not) an existential risk (for now)

kwiat.dev13 Aug 2024 22:35 UTC

20 points

5 comments8 min readLW link

kwiat.dev 9 Jun 2024 10:42 UTC
−11 points
−9
on: What if a tech company forced you to move to NYC?
This seems like a rather silly argument. You can apply it to pretty much any global change, any technological progress. The world changes, and will change. You can be salty about it, or you can adapt.

kwiat.dev 2 May 2024 23:02 UTC
1 point
2
in reply to: faul_sname’s comment on: Why I’m not doing PauseAI
And how would one go about procuring such a rock? Asking for a friend.

Why I’m not doing PauseAI

kwiat.dev2 May 2024 22:00 UTC

−8 points

5 comments4 min readLW link

kwiat.dev 8 Mar 2024 13:41 UTC
3 points
1
in reply to: Nathan Helm-Burger’s comment on: AI Safety 101 : Capabilities—Human Level AI, What? How? and When?
The ML researchers saying stuff like AGI is 15 years away have either not carefully thought it through, or are lying to themselves or the survey.
Ah yes, the good ol’ “If someone disagrees with me, they must be stupid or lying”

kwiat.dev 4 Feb 2024 11:57 UTC
7 points
−1
in reply to: gilch’s comment on: My thoughts on the Beff Jezos—Connor Leahy debate
For what it’s worth, I think you’re approaching this in good faith, which I appreciate. But I also think you’re approaching the whole thing from a very, uh, lesswrong.com-y perspective, quietly making assumptions and using concepts that are common here, but not anywhere else.
I won’t reply to every individual point, because there’s lots of them, so I’m choosing the (subjectively) most important ones.
This is the actual topic. It’s the Black Marble thought experiment by Bostrom,
No it’s not, and obviously so. The actual topic is AI safety. It’s not false vacuum, it’s not a black marble, or a marble of any color for that matter.
Connor wasn’t talking about the topic, he was building up to the topic using an analogy, a more abstract model of the situation. Which might be fair enough, except you can’t just assert this model. I’m sure saying that AI is a black marble will be accepted as true around here, but it would obviously get pushback in that debate, so you shouldn’t sneak it past quietly.
Again, Connor is simply correct here. This is not a novel argument. It’s Goodhart’s Law.
As I’m pretty sure I said in the post, you can apply this reasoning to pretty much any expression of values or goals. Let’s say your goal is stopping AI progress. If you’re consistent, that means you’d want humanity to go extinct, because then AI would stop. This is the exact argument that Connor was using, it’s so transparent and I’m disappointed that you don’t see it.
Again, this is what Eliezer, Connor, and I think is the obvious thing that would happen once an unaligned superintelligence exists: it pushes its goals to the limit at the expense of all we value. This is not Connor being unfair; this is literally his position.
Great! So state and defend and argue for this position, in this specific case of an unaligned superintelligence! Because the way he did it in a debate, was just by extrapolating whatever views Beff expressed, without care for what they actually are, and showing that when you push them to the extreme, they fall apart. Because obviously they do, because of Goodhart’s Law. But you can’t dismiss a specific philosophy via a rhethorical device that can dismiss any philosophy.
Finally? Connor has been talking about this the whole time. Black marble!
Again, I extremely strongly disagree, but I suspect that’s a mannerism common in rationalist circles, using additional layers of abstraction and pretending they don’t exist. Black marble isn’t the point of the debate. AI safety is. You could put forward the claim that “AI = black marble”. I would lean towards disagreeing, I suspect Beff would strongly disagree, and then there could be a debate about this proposition.
Instead, Connor implicitly assumed the conclusion, and then proceeded to argue the obvious next point that “If we assume that AI black marble will kill us all, then we should not build it”.
Duh. The point of contention isn’t that we should destroy the world. The point of contention is that AI won’t destroy the world.
Connor is correctly making a very legit point here.
He’s not making a point. He’s again assuming the conclusion. You happen to agree with the conclusion, so you don’t have a problem with it.
The conclusion he’s assuming is: “Due to the nature of AI, it will progress so quickly going forward that already at this point we need to slow down or stop, because we won’t have time to do that later.”
My contention with this would be “No, I think AI capabilities will keep growing progressively, and we’ll have plenty of time to stop when that becomes necessary.”
This is the part that would have to be discussed. Not assumed.
That is a very old, very bad argument.
Believe it or not, I actually agree. Sort of. I think it’s not good as an argument, because (for me) it’s not meant to be an argument. It’s meant to be an analogy. I think we shouldn’t worry about overpopulation on Mars because the world we live in will be so vastly different when that becomes an immediate concern. Similarly, I think we shouldn’t (overly) worry about superintelligent AGI killing us, because the state of AI technology will be so vastly different when that becomes an immediate concern.
And of course, whether or not the two situations are comparable would be up to debate. I just used this to state my own position, without going the full length to justify it.
Yes. That would have been good. I could tell Connor was really trying to get there. Beff wasn’t listening though.
I kinda agree here? But the problem is on both sides. Beff was awfully resistant to even innocuous rhethorical devices, which I’d understand if that started late in the debate, but… it took him like idk 10 minutes to even respond to the initial technology ban question.
At the same time Connor was awfully bad at leading the conversation in that direction. Let’s just say he took the scenic route with a debate partner who made it even more scenic.
Besides that (which you didn’t even mention), I cannot imagine what Connor possibly could have done differently to meet your unstated standards, given his position. [...] What do you even want from him?
Great question. Ideally, the debate would go something like this.
B: So my view is that we should accelerate blahblah free energy blah AI blah [note: I’m not actually that familiar with the philosophical context, thermodynamic gods and whatever else; it’s probably mostly bullshit and imo irrelevant]
C: Yea, so my position is if we build AI without blah and before blah, then we will all die.
B: But the risk of dying is low because of X and Y reasons.
C: It’s actually high because of Z, I don’t think X is valid because W.
And keep trying to understand at what point exactly they disagree. Clearly they both want humanity/life/something to proliferate in some capacity, so even establishing that common ground in the beginning would be valuable. They did sorta reach it towards the end, but at that point the whole debate was played out.
Overall, I’m highly disappointed that people seem to agree with you. My problem isn’t even whether Connor is right, it’s how he argued for his positions. Obviously people around here will mostly agree with him. This doesn’t mean that his atrocious performance in the debate will convince anyone else that AI safety is important. It’s just preaching to the choir.

My thoughts on the Beff Jezos—Connor Leahy debate

kwiat.dev3 Feb 2024 19:47 UTC

−5 points

23 comments4 min readLW link

kwiat.dev 31 Jan 2024 20:49 UTC
1 point
3
on: Literally Everything is Infinite
So I genuinely don’t want to be mean, but this reminds me why I dislike so much of philosophy, including many chunks of rationalist writing.
This whole proposition is based on vibes, and is obviously false—just for sake of philosophy, we decide to ignore the “obvious” part, and roll with it for fun.
The chair I’m sitting on is finite. I may not be able to draw a specific boundary, but I can have a bounding box the size of the planet, and that’s still finite.
My life as a conscious being, as far as I know, is finite. It started some years ago, it will end some more years in the future. Admittedly I don’t have any evidence regarding what happens to qualia after death, but a vibe of infiniteness isn’t enough to convince me that I will infinitely keep experiencing things.
My childhood hamster’s life was finite. Sure, the particles are still somewhere in my hometown, but that’s no longer my hamster, nor my hamster’s life.
A day in my local frame is finite. It lasts about 24 hours, depending on how we define it—to be safe, it’s surely contained within 48 hours.
This whole thing just feels like… saying things. You can’t just say things and assume they are true, or even make sense. But apparently you can do that if you just refer to (ideally eastern) philosophy.

kwiat.dev 24 Jan 2024 13:13 UTC
19 points
3
on: This might be the last AI Safety Camp
What are the actual costs of running AISC? I participated in it some time ago, kinda participating this year again (it’s complicated). As far as I can tell, the only things that are required is some amount of organization, and then maybe a paid slack workspace. Is this just about salaries for the organizers?

kwiat.dev 20 Nov 2023 10:55 UTC
0 points
−6
in reply to: Roko’s comment on: “Why can’t you just turn it off?”
Huh, whaddayaknow, turns out Altman was in the end pushed back, the new interim CEO is someone who is pretty safety-focused, and you were entirely wrong.
Normalize waiting for more details before dropping confident hot takes.

kwiat.dev 19 Nov 2023 15:23 UTC
17 points
14
on: “Why can’t you just turn it off?”
The board has backed down after Altman rallied staff into a mass exodus
[citation needed]

I’ve seen rumors and speculations, but if you’re that confident, I hope you have some sources?
(for the record, I don’t really buy the rest of the argument either on several levels, but this part stood out to me the most)

kwiat.dev 23 Oct 2023 23:20 UTC
1 point
0
on: What’s in a Name? Are you really an “AI Pessimist”?
I’m never a big fan of this sort of… cognitive rewiring? Juggling definitions? This post reinforces my bias, since it’s written from a point of very stong bias itself.
AI optimists think AI will go well and be helpful.
AI pessimists think AI will go poorly and be harmful.
It’s not that deep.
The post itself is bordering on insulting anyone who has a different opinion than the author (who, no doubt, would prefer the label “AI strategist” than “AI extremists”). I was thinking about going into the details of why, but honestly… this is unlikely to be productive discourse coming from a place where the “other side” is immediately compared to nationalists (?!) or extremists (?!!!).
I’m an AI optimist. I think AI will go well and will help humanity flourish, through both capabilities and alignment research. I think things will work out. That’s all.

kwiat.dev 20 Oct 2023 20:21 UTC
3 points
3
in reply to: the gears to ascension’s comment on: TOMORROW: the largest AI Safety protest ever!
In what sense do you think it will (might) not go well? My guess is that it will not go at all—some people will show up in the various locations, maybe some local news outlets will pick it up, and within a week it will be forgotten

kwiat.dev

Ten counter-ar­gu­ments that AI is (not) an ex­is­ten­tial risk (for now)

Why I’m not do­ing PauseAI

My thoughts on the Beff Je­zos—Con­nor Leahy debate

Ten counter-arguments that AI is (not) an existential risk (for now)

Why I’m not doing PauseAI

My thoughts on the Beff Jezos—Connor Leahy debate