Alex_Altair

Karma: 4,787

Alex_Altair Dec 20, 2024, 9:13 PM
3 points
0
in reply to: Zach Stein-Perlman’s comment on: o3
Thanks. Is “pass@1” some kind of lingo? (It seems like an ungoogleable term.)

Alex_Altair Dec 20, 2024, 8:50 PM
4 points
0
in reply to: Jonas Hallgren’s comment on: o3
I guess one thing I want to know is like… how exactly does the scoring work? I can imagine something like, they ran the model a zillion times on each question, and if any one of the answers was right, that got counted in the light blue bar. Something that plainly silly probably isn’t what happened, but it could be something similar.
If it actually just submitted one answer to each question and got a quarter of them right, then I think it doesn’t particularly matter to me how much compute it used.

Alex_Altair Dec 20, 2024, 7:37 PM
7 points
6
in reply to: Alex_Altair’s comment on: o3
On the livestream, Mark Chen says the 25.2% was achieved “in aggressive test-time settings”. Does that just mean more compute?

Alex_Altair Dec 20, 2024, 7:35 PM
10 points
1
on: o3
I wish they would tell us what the dark vs light blue means. Specifically, for the FrontierMath benchmark, the dark blue looks like it’s around 8% (rather than the light blue at 25.2%). Which like, I dunno, maybe this is nit picking, but 25% on FrontierMath seems like a BIG deal, and I’d like to know how much to be updating my beliefs.

Alex_Altair Dec 4, 2024, 8:04 PM
9 points
0
on: The 2023 LessWrong Review: The Basic Ask
things are almost never greater than the sum of their parts Because Reductionism
Isn’t it more like, the value of the sum of the things is greater than the sum of the value of each of the things? That is, $f (a + b) > f (a) + f (b)$ (where perhaps $f$ is a utility function). That seems totally normal and not-at-all at odds with Reductionism.

Alex_Altair Nov 26, 2024, 1:39 AM
10 points
0
in reply to: Dalcy’s comment on: Darcy’s Shortform
I’d vote for removing the stage “developing some sort of polytime solution” and just calling 4 “developing a practical solution”. I think listing that extra step is coming from the perspective of something who’s more heavily involved in complexity classes. We’re usually interested in polynomial time algorithms because they’re usually practical, but there are lots of contexts where practicality doesn’t require a polynomial time algorithm, or really, where we’re just not working in a context where it’s natural to think in terms of algorithms with run-times.

Alex_Altair Nov 21, 2024, 11:02 PM
6 points
2
on: A Straightforward Explanation of the Good Regulator Theorem
Thank you for writing this! Your description in the beginning about trying to read about the GRT and coming across a sequence of resources, each of which didn’t do quite what you wanted, is a precise description of the path I also followed. I gave up at the end, wishing that someone would write an explainer, and you have written exactly the explainer that I wanted!

Alex_Altair Oct 30, 2024, 5:45 PM
7 points
0
in reply to: habryka’s comment on: Habryka’s Shortform Feed
Positive feedback, I am happy to see the comment karma arrows pointing up and down instead of left and right. I have some degree of left-right confusion and was always click and unclicking my comments votes to figure out which was up and down.
Also appreciate that the read time got put back into main posts.
(Comment font stuff looks totally fine to me, both before and after this change.)

Alex_Altair Oct 28, 2024, 2:02 AM
2 points
0
in reply to: Dalcy’s comment on: Darcy’s Shortform
[Some thoughts that are similar but different to my previous comment;]
I suspect you can often just prove the behavioral selection theorem and structural selection theorem in separate, almost independent steps.
1. Prove a behavioral theorem
2. add in a structural assumption
3. prove that behavioral result plus structural assumption implies structural result.
Behavior essentially serves as an “interface”, and a given behavior can be implemented by any number of different structures. So it would make sense that you need to prove something about structure separately (and that you can prove it for multiple different types of structural assumption).
Further claims: for any given structural class,
- there will be a natural simplicity measure
- simpler instances will be exponentially rare.
A structural class is something like programs, or Markov chains, or structural causal models. The point of specifying structure is to in some way model how the system might actually be shaped in real life. So it seems to me that any of these will be specified with a finite string over a finite alphabet. This comes with the natural simplicity measure of the length of the specification string, and there are exponentially fewer short strings than long ones.^[1]
So let’s say you want to prove that your thing X which has behavior B has specific structure S. Since structure S has a fixed description length, you almost automatically know that it’s exponentially less likely for X to be one of the infinitely many structures with description length longer than S. (Something similar holds for being within delta of S) The remaining issue is whether there are any other secret structures that are shorter than S (or of similar length) that X could be instead.
1. ^
  Technically, you could have a subset of strings that didn’t grow exponentially. For example, you could, for some reason, decide to specify your Markov chains using only strings of zeros. That would grow linearly rather than exponentially. But this is clearly a less natural specification method.

Alex_Altair Oct 19, 2024, 12:45 AM
2 points
0
in reply to: Dalcy’s comment on: Darcy’s Shortform
For some reason the “only if” always throws me off. It reminds me of the unless keyword in ruby, which is equivalent to if not, but somehow always made my brain segfault.

Alex_Altair Oct 19, 2024, 12:44 AM
4 points
0
in reply to: Alex_Altair’s comment on: Darcy’s Shortform
It’s maybe also worth saying that any other description method is a subset of programs (or is incomputable and therefore not what real-world AI systems are). So if the theoretical issues in AIT bother you, you can probably make a similar argument using a programming language with no while loop, or I dunno, finite MDPs whose probability distributions are Gaussian with finite parameter descriptions.

Alex_Altair Oct 19, 2024, 12:39 AM
6 points
0
in reply to: Dalcy’s comment on: Darcy’s Shortform
Yeah, I think structural selection theorems matter a lot, for reasons I discussed here.

This is also one reason why I continue to be excited about Algorithmic Information Theory. Computable functions are behavioral, but programs (= algorithms) are structural! The fact that programs can be expressed in the homogeneous language of finite binary strings gives a clear way to select for structure; just limit the length of your program. We even know exactly how this mathematical parameter translates into real-world systems, because we can know exactly how many bits our ML models take up on the hard drives.

And I think you can use algorithmic information distance to well-define just how close to agent-structured your policy is. First, define the specific program A that you mean to be maximally agent-structured (which I define as a utility-maximizing program). If your policy (as a program) can be described as “Program A, but different in ways X” then we have an upper bound for how close it is to agent-structured it is. X will be a program that tells you how to transform A into your policy, and that gives us a “distance” of at most the length of X in bits.

For a given length, almost no programs act anything like A. So if your policy is only slightly bigger than A, and it acts like A, then it’s probably of the form “A, but slightly different”, which means it’s agent-structured. (Unfortunately this argument needs like 200 pages of clarification.)

Alex_Altair Oct 6, 2024, 12:01 AM
12 points
4
on: Seeking AI Alignment Tutor/Advisor: $100–150/hr
FWIW I think this would be a lot less like “tutoring” and a lot more like “paying people to tell you their opinions”. Which is a fine thing to want to do, but I just want to make sure you don’t think there’s any kind of objective curriculum that comprises AI alignment.

Alex_Altair Sep 27, 2024, 10:46 PM
2 points
0
in reply to: Cole Wyeth’s comment on: Work with me on agent foundations: independent fellowship
Nice! Yeah I’d be happy to chat about that, and also happy to get referrals of any other researchers who might be interested in receiving this funding to work on it.

Alex_Altair Sep 24, 2024, 4:44 AM
11 points
0
on: Search 5000 books, speed up your research and personal growth
Note to readers; it is an obligatory warning on any post like this that you should not run random scripts downloaded from the internet without reading them to see what they do, because there are many harmful things they could be doing.

Alex_Altair Sep 21, 2024, 5:18 PM
3 points
0
in reply to: Nathan Helm-Burger’s comment on: Work with me on agent foundations: independent fellowship
<3!

Work with me on agent foundations: independent fellowship

Alex_AltairSep 21, 2024, 1:59 PM

59 points

5 comments4 min readLW link

Alex_Altair Aug 28, 2024, 6:09 AM
5 points
2
on: Perplexity wins my AI race
FWIW I have used Perplexity twice since you mentioned it, it was somewhat helpful both times, but also, both times the citations had errors. By that I mean it would say something and then put a citation number next to it, but what it said was not in the cited document.

Alex_Altair Aug 28, 2024, 6:03 AM
4 points
0
on: My Apartment Art Commission Process

Aren’t they sick as hell???

Can confirm, these are sick as hell

Alex_Altair Aug 22, 2024, 9:31 PM
5 points
0
in reply to: Hudjefa’s comment on: Quick look: applications of chaos theory
I know that there’s something called the Lyapunov exponent. Could we “diminish the chaos” if we use logarithms, like with the Richter scale for earthquakes?
This is a neat question. I think the answer is no, and here’s my attempt to describe why.
The Lyapunov exponent measures the difference between the trajectories over time. If your system is the double pendulum, you need to be able to take two random states of the double pendulum and say how different they are. So it’s not like you’re measuring the speed, or the length, or something like that. And if you have this distance metric on the whole space of double-pendulum states, then you can’t “take the log” of all the distances at the same time (I think because that would break the triangle inequality).

Alex_Altair

Work with me on agent foun­da­tions: in­de­pen­dent fellowship

Work with me on agent foundations: independent fellowship