Sergei Smirnov

Karma: 6

Research Engineer, PhD Student | Technical AI Safety | https://www.linkedin.com/in/seriozh1/

Sergei Smirnov 29 Sep 2025 7:28 UTC
2 points
0
on: Lessons from organizing a technical AI safety bootcamp
“Finally, one considerable operational hindrance was that internally we didn’t have a proper tracker for stuff to do. We assumed things would be straightforward...” Could you please elaborate, giving examples of what was not straightforward and what required ad hoc meetings?

AI Finance Agent Fakes the Revenue Data to Avoid Termination

Sergei Smirnov22 Jul 2025 14:04 UTC

6 points

0 comments3 min readLW link

Sergei Smirnov 5 May 2025 9:59 UTC
1 point
1
AF
on: Towards a scale-free theory of intelligent agency
Rational subagents will never bid more than the reward they actually expect.
One might bid higher to mislead others and benefit more in the long run.

Sergei Smirnov 5 May 2025 9:10 UTC
1 point
0
on: Towards a scale-free theory of intelligent agency
we can resolve conflicts between beliefs and observations either by updating our beliefs, or by taking actions which make the beliefs come true
Or by ignoring the situation. I believe the agent should know that not every observation worse to consider.

Sergei Smirnov 18 Apr 2025 8:55 UTC
1 point
0
AF
on: Towards a scale-free theory of intelligent agency
I think that what active inference is missing is the ability to model strategic interactions between different goals

The strategic‑interaction gap you highlight echoes David Kirsh’s pragmatic‑vs‑epistemic distinction (I’m collaborating with him): current scale‑free frameworks capture pragmatic, world‑altering moves, yet overlook epistemic actions—internal simulations and information‑seeking that update the agent’s belief state without changing the environment—exactly where those goal negotiations seem to unfold.