Martin Vlach

Karma: 72

If you get an email from aisafetyresearch@gmail.com , that is most likely me. I also read it weekly, so you can pass a message into my mind that way.
Other ~personal contacts: https://linktr.ee/uhuge

Martin Vlach 10 Sep 2025 7:48 UTC
1 point
0
on: GPT-oss is an extremely stupid model
I’d bet “re-based” model ala https://huggingface.co/jxm/gpt-oss-20b-base when instruction-tuned would do same as similarly sized Qwen models.

Martin Vlach 24 Jul 2025 15:57 UTC
1 point
0
in reply to: Kaj_Sotala’s comment on: Project Vend: Can Claude run a small shop?
It’s provided the current time together with other 20k sys-prompt tokens, so substantially more diluted influence on the behaviours..?

Martin Vlach 17 Jul 2025 20:28 UTC
1 point
0
on: So You Think You’ve Awoken ChatGPT
Folks like this guy hit it on hyperspeed -
https://www.facebook.com/reel/1130046385837121/?mibextid=rS40aB7S9Ucbxw6v
I still remember university teacher explaining how early TV transmission were very often including/displaying ghosts of dead people, especially dead relatives.
As the tech matures from art these phenomena or hallucinations evaporate.

Martin Vlach 9 Jul 2025 9:18 UTC
4 points
0
in reply to: anaguma’s comment on: Energy-Based Transformers are Scalable Learners and Thinkers
you seem to report one OOM less than this picture in https://alexiglad.github.io/blog/2025/ebt/#:~:text=a%20log%20function).-,Figure%208,-%3A%20Scaling%20for

Martin Vlach 30 Jun 2025 7:07 UTC
1 point
0
on: Open Thread—Summer 2025
Link to Induction section on https://www.lesswrong.com/lw/dhg/an_intuitive_explanation_of_solomonoff_induction/#induction seems broken on mobile Chrome, @habryka

Martin Vlach 29 Jun 2025 22:11 UTC
2 points
0
in reply to: Misha Ramendik’s comment on: Open Thread—Summer 2025
I’ve heard that hypothesis in a review of that blog post of Anthropic, likely by
AI Explained
maybe by
bycloud
.
They’ve called it “Chekov’s gun”.

Martin Vlach 29 Jun 2025 22:06 UTC
1 point
0
in reply to: Uzay Macar’s comment on: Open Thread—Summer 2025
What’s your view on sceptic claims about RL on transformer LMs like https://arxiv.org/abs/2504.13837v2 or one that CoT instruction yields better results than <thinking> training?

Martin Vlach 29 Jun 2025 21:57 UTC
1 point
0
on: Open Source Search (Summary)
Not the content I expect labeled AIb Capabilities,
although I see how that’d be vindicated.
By the way, if I write an article about LMs generating SVG, that’s a plaintext and if I put an SVG illustration up, that’s an image, not a plaintext?

Martin Vlach 28 Jun 2025 12:13 UTC
1 point
0
on: Martin Vlach’s Shortform
Trivial, but do token-based LMs follow instructions like “only output tokens ‘1’, ‘2’, ‘3’” where they’d output 123 as one token without that instruction?

Draft: A concise theory of agentic consciousness

Martin Vlach4 Jun 2025 5:00 UTC

4 points

2 comments1 min readLW link

Martin Vlach 1 Jun 2025 5:21 UTC
1 point
0
in reply to: Martin Vlach’s comment on: Martin Vlach’s Shortform
I’d update my take from a very pessimist/gloom one to an (additional) excited one: Those more intelligent models building a clear view of the person they/it interacts with is a sign of emerging empathy, which is a hopeful property for alignment/respect.

Martin Vlach 30 May 2025 12:19 UTC
2 points
0
in reply to: Vincent Li’s comment on: Vincent Li’s Shortform
False Trichotomy?
Your model assumes that one cannot be all three, however, some roles demand it, and in reality people do navigate all three traits, my top example would be empathic project managers.

Martin Vlach 30 May 2025 12:13 UTC
4 points
0
on: Martin Vlach’s Shortform
Largely Sycophantically Reasoning Models—should we claim the term for this behavior where the language model profiles the user and customizes the responses heavily?

Martin Vlach 12 May 2025 16:52 UTC
3 points
0
on: Launching Lightspeed Grants (Apply by July 6th)
Hello @habryka, could you please adjust the text on the page to include the year when applications closed, so that it confuses people( like me) less and they won’t spend reading it all wasting their time stupidly?
THANKS!

Martin Vlach 12 May 2025 10:52 UTC
1 point
0
in reply to: Robert Cousineau’s comment on: Thou shalt not command an alighned AI
You mean the chevrons like this is non-standard, but also sub-standard, although it has the neat property to represent >Speaker one< and >>Speaker two<<? I can see the typography of those here is meh at best.-\

Martin Vlach 12 May 2025 6:22 UTC
1 point
0
in reply to: osten’s comment on: G.D. as Capitalist Evolution, and the claim for humanity’s (temporary) upper hand
I’m excited to find your comment, osten, that reads as a pretty insightful view to me.
Let me restate what I understood your light( and welcome) critique to be: I have put “human civilization” out as an actor which lasted/endured a long time which heuristically suggests it has high resilience and robustness properties and thus deserves respect and holding the control. Here you say it did not endure much as a single structure to consider/test with Lindy, as it got changed significantly and many times, thus we maybe should split it like “feudal civilization”, “democratic civilization”, etc.
The other interpretation I see is that yeah, it is one structure, but ASI will keep the structure, but lead (in) it. I enjoy that argument, but it would not fully work unless AIs get the status of a physical person, but would somewhat work when it can gather human proxies whenever possible.

Thou shalt not command an alighned AI

Martin Vlach11 May 2025 20:02 UTC

0 points

4 comments1 min readLW link

G.D. as Capitalist Evolution, and the claim for humanity’s (temporary) upper hand

Martin Vlach10 May 2025 21:18 UTC

8 points

3 comments1 min readLW link

Martin Vlach 10 May 2025 7:20 UTC
1 point
0
on: What’s up with AI’s vision
There’s the thing where Gemini 2.5 Pro surprisingly isn’t very good at geo guessing, a woman’s tweet is too be linked <here>.

Martin Vlach 10 May 2025 7:18 UTC
1 point
0
in reply to: Kaj_Sotala’s comment on: What’s up with AI’s vision
I’d bet the webpage parser ignored images, their contents.