Sergii

Karma: 147

Software engineer from Ukraine, currently living and working in Estonia.
I mainly specialize in computer vison & robotics. https://grgv.xyz/.

Sergii May 18, 2025, 8:34 PM
1 point
0
on: Gödel, Escher, Bach in the age of LLMs
His later work, “I am a strange loop”, I think is more interesting for insights on consciousness.

Sergii May 9, 2025, 8:56 AM
3 points
0
in reply to: johnswentworth’s comment on: Orienting Toward Wizard Power
The problem is that most hackerspaces are not this:
”warehouse filled with whatever equipment one could possibly need to make things and run experiments in a dozen different domains”
most hackerspaces that I’ve seen are fairly cramped spaces full of junk hardware. which is understandable: rent is very high and new equipment is very expensive.
but would be cool to have access to something like:
https://www.facebook.com/watch/?v=530540257789037

Sergii May 8, 2025, 4:22 PM
7 points
5
on: Orienting Toward Wizard Power
Thanks, it’s an inspirtional pitch, I can relate.
And my observation from this kind of communities (hackerspace, engieering/hacking conference), is that a large fraction (I think majority?) of participants are much more interested in the tech itself rather than in applcations. There is also not that much drive for novelty and innovation.
I think that there should be space for exploration and learning, but to me, wizardry is about getting things done, solving actual practical problems.
For example, at hackaday.com, there are cool projects, but a large fraction of the (extremely talented) hackers are building yet another 8bit computer.

Sergii May 3, 2025, 8:55 PM
1 point
0
on: Prison Journal: Building Better Thinking Skills—Altruistic Person Saved > 100 Gorillas saved
You should at lest give the credit to the inventors: https://reason.com/2019/01/16/how-scientology-recruits-inside-florida/

Sergii May 3, 2025, 3:39 PM
1 point
0
on: Sergii’s Shortform
I have found that when using Anki for words/language learning, I frequently can’t remember the correct translation exactly, but can guess the translation as one of top-3 options. In fact, this works well for me—even knowing vaguely what the word means is very useful.
does anyone else uses Anki with non-exact answers?

Sergii Apr 29, 2025, 6:05 PM
2 points
0
in reply to: sam’s comment on: sam’s Shortform
I have similar issues, severity varies over time.
If I am in a bad place, things that help best:
- taking care of mental health. I do CBT when i’m in worse shape, and take SSRIs. YMMV. both getting dianosed and treated are important. this also includes regular exercise and good sleep. what you have described might be (although does not have to be) related to depression, anxiety, attention disorders.
- setting a timer for a short time, can be as short as 1min, and doing one of the avoided tasks for just that 1 minute. it kind if “breaks the spell” for me
- journaling, helps to “debug” the problems, and in most cases leads to wring down plans / intervations / resolutuons

Sergii Apr 28, 2025, 9:22 AM
1 point
0
in reply to: Mateusz Bagiński’s comment on: Mateusz Bagiński’s Shortform
Ha, thinking back to childhood I get it now, it’s the influence of the layot of the school daily journal in USSR/Ukraine, like https://cn1.nevsedoma.com.ua/images/2011/33/7/10000000.jpg

Sergii Apr 27, 2025, 7:39 PM
1 point
0
in reply to: Mateusz Bagiński’s comment on: Mateusz Bagiński’s Shortform
I have similar thing for week days, but somehow with a weird shape?
in general, it’s a similar cycle, but flipped horizontally, going left to right:
on top it’s: sun, sat
on the bottom: mon, tue, wed, thu, fri
the shape connecting days goes downwards from sun to mon, tue, wed, then upwards to thu, then down to fri, then up to sat, sun, closing the loop.
not sure is this makes any sense )

Sergii Apr 27, 2025, 7:12 PM
1 point
0
in reply to: Jacob Watts’s comment on: Why would AI companies use human-level AI to do alignment research?
unfortunately, AI research is commercialized and is heavily skewed by capitalist market needs,
so it’s still going to be all in for tryinng to make an “AI office worker”, safety be damned, until this effors hit wome wall, which I think is still plausible.

Sergii Apr 26, 2025, 8:40 AM
2 points
−6
on: Why would AI companies use human-level AI to do alignment research?
One possibility is that at some point AI products capabilities would be constrained by compute cost.
At that point, alignment “features” coule become a competitive advantage, so companies would invest much more in alignment.

Sergii Apr 19, 2025, 11:03 AM
1 point
0
on: Sergii’s Shortform
The latest short story by Greg Egan is kind of a hit piece on LW/EA/longtermism. I’ve really enjoyed it. “DEATH AND THE GORGON” https://asimovs.com/wp-content/uploads/2025/03/DeathGorgon_Egan.pdf

Sergii Apr 18, 2025, 3:00 PM
18 points
3
on: Three Months In, Evaluating Three Rationalist Cases for Trump
in the long-term this could move the country toward the draconian censorship regimes, restrictions on political opposition, and unresponsiveness to public opinion that we see today in England, France, and Germany
I don’t know much about freedom of speach in US, but all the free speech indexes that I’ve found with a quick search show that European countries are ahead of US. Am I missing something?
https://rsf.org/en/index
https://ourworldindata.org/grapher/freedom-of-expression-index
https://futurefreespeech.org/who-supports-free-speech-findings-from-a-global-survey/

Sergii Apr 17, 2025, 12:28 PM
2 points
0
on: Automating Mechanistic Interpretability via Program Synthesis
Apperently it’s more efficient to do it other way around, to compile programs into transformers, which are then useful as refecene and ground truth when analyzing “real” transformers.
See usage of TRACR in “Towards Automated Circuit Discovery for Mechanistic Interpretability” https://arxiv.org/pdf/2304.14997, for example.

Sergii Apr 11, 2025, 2:53 PM
2 points
0
on: Can I learn language faster? Or, perhaps, can I memorize the foreign words and recall them faster?
I have similar experience, but I don’t think it’s a problem—my approach to learning a language is to first accumulate enough recognized words (thousands), and then to read a lot. In my experience lots of reading improves both recognition and recall.

Sergii Mar 22, 2025, 9:40 PM
1 point
0
on: Any mistakes in my understanding of Transformers?
There are several ways to explain and diagram transformers, some links that were very helpful for my understanding:
https://blog.nelhage.com/post/transformers-for-software-engineers/
https://dugas.ch/artificial_curiosity/GPT_architecture.html
https://peterbloem.nl/blog/transformers
http://nlp.seas.harvard.edu/annotated-transformer/
https://sebastianraschka.com/blog/2023/self-attention-from-scratch.html
https://github.com/markriedl/transformer-walkthrough?ref=jeremyjordan.me
https://francescopochetti.com/a-visual-deep-dive-into-the-transformers-architecture-turning-karpathys-masterclass-into-pictures/
https://jalammar.github.io/illustrated-transformer/
https://e2eml.school/transformers.html
https://jaykmody.com/blog/attention-intuition/
https://eugeneyan.com/writing/attention/
https://www.jeremyjordan.me/attention/

Sergii Mar 8, 2025, 9:51 AM
1 point
0
in reply to: Milan W’s comment on: Sergii’s Shortform
In abstract sense, yes. But for me in practice finding truth means doing a check in wikipedia. It’s super easy to mislead humans, so should be as easy with AI.

Sergii Mar 8, 2025, 9:39 AM
3 points
0
on: A Bear Case: My Predictions Regarding AI Progress
- I agree with the possibility of pre-training platoeing as some point, possibly even in next few years.
- It would change timelines significantly. But there are other factors apart from scaling pre-training. For example, reasoning models like o3 crushing ARC-AGI (https://arcprize.org/blog/oai-o3-pub-breakthrough). Reasoning in latent space is too fresh yet, but it might be the next breakthrough of a similar magnitude.
- Why not take GPT-4.5 for what it is, OpenAI has literally stated that it’s not a frontier model? Ok, so GPT-5 will not be 100x-ed GPT-4, but maybe GPT-6 will be, and it might be enough for AGI.
- You should not look for progress in autonomy/agency in commercial offings like GPT-4.5. At this point OpenAI is focusing on what sells well (better personality and EQ). I think they care less about a path to AGI. Rapid advances towards agency/autonomy are better gauged from academic literature.
- I agree that we should not fall for “vibe checks”.
- But don’t bail on benchmarks, many people are working on benchmarks and evals, there is constant progress there, benchmarks are getting more objective and harder to game. Rather than looking at benchmarks that are pushed by OpenAI, it’s better to look for cutting-edge ones in academic literature. Evaluating a SOTA model with a benchmark that is few years old does not make sense at this point.

Sergii Mar 6, 2025, 6:56 PM
2 points
0
on: Sergii’s Shortform
LLMs live in an abstract textual world, and do not understand the real world well (see “[Physical Concept Understanding](https://physico-benchmark.github.io/index.html#)”). We already manipulate LLM’s with prompts, cut-off dates, etc… But what about going deeper by “poisoning” the training data with safety-enhancing beliefs?
For example, if training data has lots of content about how hopeless, futile and dangerous for an AI it is to scheme and hack, it might be a useful safety guardrail?

Sergii Feb 17, 2025, 8:47 PM
5 points
0
in reply to: William_S’s comment on: William_S’s Shortform
I made something like this, works differently though, blocking is based on a fixed prompt: https://grgv.xyz/blog/awf/

Sergii Apr 24, 2024, 10:40 PM
5 points
0
on: Sergii’s Shortform
What about estimating LLM capabilities from the length of a sequence of numbers that it can reverse?
I used prompts like:
”please reverse 4 5 8 1 1 8 1 4 4 9 3 9 3 3 3 5 5 2 7 8“
”please reverse 1 9 4 8 6 1 3 2 2 5”
etc...

Some results:
- Llama2 starts making mistakes after 5 numbers
- Llama3 can do 10, but fails at 20
- GPT-4 can do 20 but fails at 40

The followup questions are:
- what should be the name of this metric?
- are the other top-scoring models like Claude similar? (I don’t have access)
- any bets on how many numbers will GPT-5 be able to reverse?
- how many numbers should AGI be able to reverse? ASI? can this be a Turing test of sorts?