Baybar

Karma: 26

Baybar’s Shortform

Baybar1 Oct 2025 21:02 UTC

2 points

1 comment1 min readLW link

Baybar 1 Oct 2025 21:02 UTC
3 points
0
on: Baybar’s Shortform
I’ve discovered that generating a video of yourself with Sora 2 saying something like ‘this video was generated by AI’ is sufficiently freaky enough to make people who know you well, especially those skeptical about AI capabilities, start to freak out a bit.
Thought this might be a useful idea for others trying to persuade people to tune in, and not just auto reject the idea that very capable systems might be right around the corner.

What is LMArena actually measuring?

Baybar16 Sep 2025 21:44 UTC

11 points

0 comments5 min readLW link

Baybar 14 Sep 2025 4:42 UTC
3 points
0
in reply to: StanislavKrym’s comment on: What Parasitic AI might tell us about LLMs Persuasion Capabilities
I think that there are two problems with providing the ‘worse answer’. My first issue is that some conversations with LLMs can be about topics that don’t have clear worse answers. How can you tell which one is more persuasive?
Secondly, even if I knew which answer was better, I worry about the Waluigi effect. If I optimize for safest response, am I summoning an unsafe Waluigi? I think that it is possible. I really don’t think RL on user feedback is a good idea when we don’t know what to optimize for. The alignment problem certainly isn’t solved. I think flipping a coin is safer.
What kind of answer more specifically than ‘worse’ do you think I should pick, if I shouldn’t flip a coin?

What Parasitic AI might tell us about LLMs Persuasion Capabilities

Baybar13 Sep 2025 20:39 UTC

11 points

5 comments4 min readLW link

Baybar 11 Sep 2025 21:59 UTC
4 points
0
on: Open Thread—Summer 2025
I am curious about what the Albanian prime minister means when he says he is appointing a chatbot to be a minister. I am not crazy knowledgeable about AI, though I have lurked here for a while, and I don’t know if this is a new use case or not. Does anyone has any sense of what this actually means/if this is worth being concerned about in any way or if its just a PR stunt or something equally as unconcerning as a PR stunt. My read of it is that it seems like it is a helper tool that they are calling a cabinet minister for political reasons. It doesn’t seem like there is a lot of information out there right now about what they actually mean by this but maybe in a few days somebody more knowledgeable than me could clear this up for me.