> We have no counterexamples or causal experiments.
That is the nature of politics / economics / culture. There are no ways to conduct meaningful experiments on policies, but we still need to make policy decisions, so we need to have other ways to build causal models of the world. There’s a bunch of literature around how to do that. Daron Acemoglu won the 2024 nobel prize in economics for innovating some of these methods (specifically in the context of “institutional selection”)
> the forager → farmer → industrial → modern “progression” has included some return to forager values, and some brand-new directions that probably just wouldn’t work without the level of wealth and automation we’ve managed.
This is a really interesting point, can you expand on it more?
frmsaul
Both the Meiji restoration and the Alexander II reforms greatly increased the liberties of the average person. The fact that they made the state more powerful means that “liberalization” is adaptive and only states that do it “win”.
I agree that the “rise of China” is an interesting counter-point. Xiaoping’s China adopted some liberal ideas (free markets, some local democracy, some free-speech, some rule of law) but didn’t adopt “the entire liberal package”.
I imagine that de-centralisation and capabilities slowdown can go together. The government can heavily regulate the labs and limit capability growth, and the labs can compete on other things e.g costs, speed, “values”. What do you think?
Is Progress Inevitable?
frmsaul’s Shortform
I Haven’t Thought About the Blood Pipeline in Years!
As a student at the Technion, engineering exams evaluated both my capabilities and my ethics. What does it mean? It’s all downstream of this story:In 1961, the late Professor Haim Hanani was appointed as the vice president of the Technion. Once appointed, Hanani proposed to start teaching also humanities. The other professors did not agree. They claimed that there’s barely enough time to teach “hard” sciences.
Hanani wanted to prove his colleagues wrong. He gathered a hundred students, and gave them an exam with one question: what technical information do you need to plan a pipeline to transport blood from Ashdod to Eilat? The two cities are located 250km apart.
The students started working on a solution right away. Using drawing boards and slide rules, they suggested to measure the topographical situation along the route, check the pipe layout, and test the corrosion resistance.
When they finished and submitted the test, Professor Hanani announced that they all failed: “I did not ask to test your ability to plan a blood pipeline, but to examine your moral sensitivity. None of you asked whose blood will flow through the pipes, or who is asking to build it in the first place”.
Nowadays, Technion professors will sometimes hide ethically loaded questions in exams, my responsibility as a student was to find them and point them out instead of “just following orders” and answering them verbatim. So every time I faced a question on a problem-set or an exam, I spent a few seconds trying to decide if it’s a “blood pipeline” question or not. “Does this linked list represent a blood pipeline? Could this linear program be used to optimize the intake in a concentration camp?”. I don’t think I was ever actually presented with such a problem[1], but the blood pipeline was always in the back of my head.
In the language of ai alignment, you can say I was evaluation aware, maybe even evaluation paranoid. After leaving school, this habit didn’t really stick. I haven’t thought about the blood pipeline in years.
- ^
If I was, I didn’t notice
- ^
I like your theory. It would be interesting to see some mechanistic interpretability studies of this phenomena.
We Need to Get Serious about Uplift Studies
Yeah i totally agree, this is probably the right approach. I recently wrote about this type of benching.
From what i can tell, mirrorcode is much better. Im excited for it to be fully released.
Is ProgramBench Impossible?
This sounds interesting. I’d love to read your post.
chemical weapons were used in battle in ww1 but not in ww2. Why do you think that is?
This is really cool. How big do you think mythos is?
Theodor Herzl
Herzl dedicated his life to establishing a “national home” for the jews. He pretty much single handedly founded modern-zionism and turned it from a fringe (almost-sci-fi) idea to a mass movement. I think there is a strong argument that Israel wouldn’t exist without Herzl.
Cyborg evals
You’re gonna need a bigger boat (benchmark), METR
I loved the cartoon.
I really enjoyed the post. In particular, I loved this part below:
```
In some sense, reward hacking coming from RL shouldn’t be too surprising, as RL trains models to take actions that get high reward, and reward hacking is sometimes a good strategy for getting high reward. More concretely, RL may teach models certain behavioral traits that make the models more prone to reward hacks. RL training can teach models to be more persistent, to find creative solutions, to try solutions that are unlikely to work if there is no better alternative, and potentially to even think about how they are evaluated, as these traits are useful towards accomplishing tasks in a diversity of environments.
```
It made something[1] really click for me.
Not sure what to call it exactly, maybe: “generalization of reward hacking” / “Reard Seeker Selection”