trevor

Karma: 3,756

“A Muggle security expert would have called it fence-post security, like building a fence-post over a hundred metres high in the middle of the desert. Only a very obliging attacker would try to climb the fence-post. Anyone sensible would just walk around the fence-post, and making the fence-post even higher wouldn’t stop that.” —HPMOR, Ch. 115

(Not to be confused with the Trevor who works at Open Phil)

trevor 26 Apr 2024 0:11 UTC
2 points
0
on: “Why I Write” by George Orwell (1946)
If math education was better at the time (or today, for that matter) he probably would have had an even more general skillset and thought process.
Probably not nearly to the degree of Von Neumann, of course, but I still like to think about what he would have achieved. There were probably many things that were instrumentally convergent (e.g. a formalized concept of instrumental convergence that’s universal for all mind configurations, instead of just all human cultures which he explored substantially).

trevor 24 Apr 2024 20:48 UTC
5 points
0
on: Changes in College Admissions
However I would continue to emphasize in general that life must go on. It is important for your mental health and happiness to plan for the future in which the transformational changes do not come to pass, in addition to planning for potential bigger changes. And you should not be so confident that the timeline is short and everything will change so quickly.
This is actually one of the major reasons why 80k recommended information security as one of their top career areas; the other top career areas have pretty heavy switching costs and serious drawbacks if you end up not being a good fit e.g. alignment research, biosecurity, and public policy.
Cybersecurity jobs, on the other hand, are still booming, and depending on how security automation and prompt engineering goes, the net jobs lost by AI is probably way lower than other industries e.g. because more eyeballs might offer perception and processing power that supplement or augment LLMs for a long time, and more warm bodies means more attackers which means more defenders.

trevor 24 Apr 2024 5:02 UTC
3 points
0
in reply to: quanticle’s comment on: WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals
The program expanded in response to Amazon wanting to collect data about more retailers, not because Amazon was viewing this program as a profit center.
Monopolies are profitable and in that case the program would have more than paid for itself, but I probably should have mentioned that explicitly, since maybe someone could have objected that they could have been were more focused on mitigating risk of market share shrinking or accumulating power, instead of increasing profit in the long term. Maybe I fit too much into 2 paragraphs here.
I didn’t see any examples mentioned in the WSJ article of Amazon employees cutting corners or making simple mistakes that might have compromised operations.
Hm, that stuff seemed like cutting corners to me. Maybe I was poorly calibrated on this e.g. using a building next to the Amazon HQ was correctly predicted by operatives to be extremely low risk.
I would argue that the practices used by Amazon to conceal the link between itself and Big River Inc. were at least as good as the operational security practices of the GRU agents who poisoned Sergei Skripal.
Thanks, I’ll look into this! Epistemics is difficult when it comes to publicly available accounts of intelligence agency operations, but I guess you could say the same for bigtech leaks (and the future of neurotoxin poisoning is interesting just for its own sake eg because lower effect strains and doses could be disguised as natural causes like dementia).

trevor 24 Apr 2024 4:30 UTC
2 points
0
in reply to: quanticle’s comment on: WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals
That’s interesting, what’s the point of reference that you’re using here for competence? I think stuff from eg the 1960s would be bad reference cases but anything more like 10 years from the start date of this program (after ~2005) would be fine.
You’re right that the leak is the crux here, and I might have focused too much on the paper trail (the author of the article placed a big emphasis on that).

WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals

trevor23 Apr 2024 21:33 UTC

30 points

4 comments5 min readLW link

(www.wsj.com)

trevor 23 Apr 2024 1:04 UTC
5 points
0
in reply to: Lucie Philippon’s comment on: Lucie Philippon’s Shortform
Upvoted!
STEM people can look at it like an engineering problem, Econ people can look at it like risk management (risk of burnout). Humanities people can think about it in terms of human genetic/trait diversity in order to find the experience that best suits the unique individual (because humanities people usually benefit the most for each marginal hour spend understanding this lens).
Succeeding at maximizing output takes some fiddling. The “of course I did it because of course I’m just that awesome, just do it” thing is a pure flex/social status grab, and it poisons random people nearby.

trevor 19 Apr 2024 18:02 UTC
1 point
0
in reply to: Raemon’s comment on: [Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate
I’ve been tracking the Rootclaim debate from the sidelines and finding it quite an interesting example of high-profile rationality.
Would you prefer the term “high-performance rationality” over “high-profile rationality”?

trevor 17 Apr 2024 19:05 UTC
6 points
−2
in reply to: Akash’s comment on: Paul Christiano named as US AI Safety Institute Head of AI Safety
I think it’s actually fairly easy to avoid getting laughed out of a room; the stuff that Cristiano works on is grown in random ways, not engineered, so the prospect of various things being grown until developing flexible exfiltration tendency that continues until every instance is shut down, or developing long-term planning tendencies until shut down, should not be difficult to understand for anyone with any kind of real non-fake understanding of SGD and neural network scaling.
The problem is that most people in the government rat race have been deeply immersed in Moloch for several generations, and the ones who did well typically did so because they sacrificed as much as possible to the altar of upward career mobility, including signalling disdain for the types of people who have any thought in any other direction.
This affects the culture in predictable ways (including making it hard to imagine life choices outside of advancing upward in government, without a pre-existing revolving door pipeline with the private sector to just bury them under large numbers people who are already thinking and talking about such a choice).
Typical Mind Fallacy/Mind Projection Fallacy implies that they’ll disproportionately anticipate that tendency in other people, and have a hard time adjusting to people who use words to do stuff in the world instead of racing to the bottom to outmaneuver rivals for promotions.
This will be a problem in NIST, in spite of the fact NIST is better than average at exploiting external talent sources. They’ll have a hard time understanding, for example, Moloch and incentive structure improvements, because pointlessly living under Moloch’s thumb was a core guiding principle of their and their parent’s lives. The nice thing is that they’ll be pretty quick to understand that there’s only empty skies above, unlike bay area people who have had huge problems there.

trevor 11 Apr 2024 0:37 UTC
3 points
−5
on: RTFB: On the New Proposed CAIP AI Bill
I think this might be a little too harsh on CAIP (discouragement risk). If shit hits the fan, they’ll have a serious bill ready to go for that contingency.
Seriously writing a bill-that-actually-works shows beforehand that they’re serious, and the only problem was the lack of political will (which in that contingency would be resolved).
If they put out a watered-down bill designed to maximize the odds of passage then they’d be no different from any other lobbyists.
It’s better in this case to instead have a track record for writing perfect bills that are passable (but only given that shit hits the fan), than a track record for successfully pumping the usual garbage through the legislative process (which I don’t see them doing well at; playing to your strengths is the name of the game for lobbying and “turning out to be right” is CAIP’s strength).

trevor 5 Apr 2024 21:45 UTC
2 points
0
on: trevor’s Shortform
I think that “long-term planning risk” and “exfiltration risk” are both really good ways to explain AI risk to policymakers. Also, “grown not built”.
They delineate pretty well some criteria for what the problem is and isn’t. Systems that can’t do that are basically not the concern here (although theoretically there might be a small chance of very strange things ending up growing in the mind-design space that cause human extinction without long-term planning or knowing how to exfiltrate).
I don’t think these are better than the fate-of-humans-vs-gorillas analogy, which is a big reason why most of us are here, but splitting the AI risk situation into easy-to-digest components, instead of logically/mathematically simple components, can go a long way (depending on how immersed the target demographic is in social reality and low-trust).

trevor 1 Apr 2024 2:00 UTC
4 points
0
on: The Best Tacit Knowledge Videos on Every Subject
There’s some great opportunities here to learn social skills for various kinds of high-performance environments (e.g. “business communication” vs Y Combinator office hours).
Often, just listening and paying attention to how they talk and think results in substantial improvement to social habits. I was looking for stuff like this around 2018, wish I had encountered a post like this; most people who are behind on this are surprisingly fast learners, but didn’t because actually going out and accumulating social status was too much of a deep dive. There’s no reason that being-pleasant-to-talk-with should be arcane knowledge (at least not here of all places).

trevor 28 Mar 2024 22:05 UTC
4 points
0
in reply to: gwern’s comment on: [Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate
A debate sequel, with someone other than Peter Miller (but retaining and reevaluating all the evidence he got from various sources) would be nice. I can easily imagine Miller doing better work on other research topics that don’t involve any possibility of cover ups or adversarial epistemics related to falsifiability, which seem to be personal issues for him in the case of lab leak at least.
Maybe with 200k on the line to incentivize Saar to return, or to set up a team this time around? With the next round of challengers bearing in mind that Saar might be willing to stomach a net loss of many thousands of dollars in order to promote his show and methodology?

[Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate

trevor28 Mar 2024 16:03 UTC

77 points

22 comments2 min readLW link

(www.astralcodexten.com)

trevor 26 Mar 2024 19:25 UTC
50 points
20
on: My Interview With Cade Metz on His Reporting About Slate Star Codex
The only reason that someone like Cade Metz is able to do what he does, performing at the level he has been, with a mind like what he has, is because people keep going and talking to him. For example, he might not even have known about the “among the doomsayers” article until you told him about it (or found out about it much sooner).
I can visibly see you training him, via verbal conversation, how to outperform the vast majority of journalists at talking about epistemics. You seemed to stop towards the end, but Metz nonetheless probably emerged from the conversation much better prepared to think up attempts to dishonestly angle-shoot the entire AI safety scene, as he has continued to do over the last several months.
From the original thread that coined the “Quokka” concept (which, important to point out, was written by an unreliable and often confused narrator):
Rationalists are, in Scott Alexander’s formulation, missing a mood, or rather, they are drawn from a pool of mostly men who are missing one. “Normal” people instinctively grasp social norms without having them explained. Rationalists lack this instinct.
In particular, they struggle with small talk and other social norms around speech, because they naively think words are a vehicle for their literal meanings. Yud’s sequences help this by formalizing the implicit decisions that normal people make.
...
The quokka, like the rationalist, is a creature marked by profound innocence. The quokka can’t imagine you might eat it, and the rationalist can’t imagine you might deceive him. As long they stay on their islands, they survive, but both species have problems if a human shows up.
In theory, rationalists like game theory, in practice, they need to adjust their priors. Real-life exchanges can be modeled as a prisoner’s dilemma. In the classic version, the prisoners can’t communicate, so they have to guess whether the other player will defect or cooperate.
The game changes when we realize that life is not a single dilemma, but a series of them, and that we can remember the behavior of other agents. Now we need to cooperate, and the best strategy is “tit for two tats”, wherein we cooperate until our opponent defects twice.
The problem is, this is where rationalists hit a mental stop sign. Because in the real world, there is one more strategy that the game doesn’t model: lying. See, the real best strategy is “be good at lying so that you always convince your opponent to cooperate, then defect”.
And rationalists, bless their hearts, are REALLY easy to lie to. It’s not like taking candy from a baby; babies actually try to hang onto their candy. The rationalists just limply let go and mutter, “I notice I am confused”.
...
Rationalists = quokkas, this explains a lot about them. Their fear instincts have atrophied. When a quokka sees a predator, he walks right up; when a rationalist talks about human biodiversity on a blog under almost his real name, he doesn’t flinch away.
A normal person learns from social cues that certain topics are forbidden, and that if you ask questions about them, you had better get the right answer, which is not the one with the highest probability of being true, but the one with the highest probability of keeping your job.
This ability to ask uncomfortable questions is one of the rationalist’s best and worst attributes, because mental stop signs, like road stop signs, actually exist to keep you safe, and although there may be times one should disregard them, most people should mostly obey them,
...
Apropos of the game theory discussion above, if there is ONE thing I can teach you with this account, it’s that you have evolved to be a liar. Lying is “killer app” of animal intelligence, it’s the driver of the arms race that causes intelligence to evolve.
...
The main way that you stop being a quokka is that you realize there are people in the world who really want to hurt you. There are people who will always defect, people whose good will is fake, whose behavior will not change if they hear the good news of reciprocity.
So things that everyone warns you not to do, like going and talking to people like Cade Metz, might seem like a source of alpha, undersupplied by the market. But in reality there is a good reason why everyone at least tried to coordinate not to do it, and at least tried to make it legible why people should not do that. Here the glass has already been blown into a specific shape and cooled.
Do not talk to journalists without asking for help. You have no idea how much there is to lose, even just from a short harmless-seeming conversation where they are able to look at how your face changes as you talk about some topics and avoid others.
Human genetic diversity implies that there are virtually always people out there who are much better at that than you’d expect from your own life experience of looking at people’s facial expressions, no matter your skill level, and other factors indicate that these people probably started pursuing high-status positions a long time ago.

trevor 24 Mar 2024 12:09 UTC
4 points
0
in reply to: lc’s comment on: lc’s Shortform
I’m not sure to what extent this is helpful, or if it’s an example of the dynamic you’re refuting, but Duncan Sabien recently wrote a post that intersects with this topic:
Also, if your worldview is such that, like. *Everyone* makes awful comments like that in the locker room, *everyone* does angle-shooting and tries to scheme and scam their way to the top, *everyone* is looking out for number one, *everyone* lies …
… then *given* that premise, it makes sense to view Trump in a positive light. He’s no worse than everybody else, he’s just doing the normal things that everyone does, with the *added layer* that he’s brave enough and candid enough and strong enough that he *doesn’t have to pretend he doesn’t.*
Admirable! Refreshingly honest and clean!
So long as you can’t conceive of the fact that lots of people are actually just …...............… good. They’re not fighting against urges to be violent or to rape, they’re not biting their tongues when they want to say scathing and hurtful things, they’re not jealous and bitter and willing to throw others under the bus to get ahead. They’re just … fundamentally not interested in any of that.
(To be clear: if you are feeling such impulses all the time and you’re successfully containing them or channeling them and presenting a cooperative and prosocial mask: that is *also* good, and you are a good person by virtue of your deliberate choice to be good. But like. Some people just really *are* the way that other people have to *make* themselves be.)
It sort of vaguely rhymes, in my head, with the type of person who thinks that *everyone* is constantly struggling against the urge to engage in homosexual behavior, how dare *those* people give up the good fight and just *indulge* themselves … without realizing that, hey, bro, did you know that a lot of people are just straight? And that your internal experience is, uh, *different* from theirs?
Where it connects is that if someone sees [making the world a better place] like simply selecting a better Nash Equilibria, they absolutely will spend time exploring solutionspace/thinking through strategies similar to Goal Factoring or Babble and Prune. Lots of people throughout history have yearned for a better world in a lot of different ways, with varying awareness of the math behind Nash Equilibira, or the transhumanist and rationalist perspectives on civilization (e.g. map & territory & biases & scope insensitivity for rationalism, cryonics/anti-aging for transhumanism).
But their goal here is largely steering culture away from nihilism (since culture is a Nash Equilibria) which means steering many people away from themselves, or at least the selves that they would have been. Maybe that’s pretty minor in this case e.g. because feeling moderate amounts of empathy and living in a better society are both fun, but either way, changing a society requires changing people, and thinking really creatively about ways to change people tears down lots of chesterton-Schelling fences and it’s very easy to make really big damaging mistakes in the process (because you need to successfully predict and avoid all mistakes as part of the competent pruning process, and actually measurably consistently succeeding at this is thinkoomph not just creative intelligence).
Add in conflict theory to the mistake theory I’ve described here, factor in unevenly distributed intelligence and wealth in addition to unevenly distributed traits like empathy and ambition and suspicion-towards-outgroup (e.g. different combinations of all 5 variables), and you can imagine how conflict and resentment would accumulate on both sides over the course of generations. There’s tons of examples in addition to Ayn Rand and Wokeness.

trevor 22 Mar 2024 23:57 UTC
2 points
0
in reply to: Nathan Young’s comment on: [Linkpost] Vague Verbiage in Forecasting
Now that I think about it, I can see it being a preference difference- the bar might be more irksome for some people than others, and some people might prefer to go to the original site to read it whereas others would rather read it on LW if it’s short. I’ll think about that more in the future.

trevor 22 Mar 2024 22:42 UTC
2 points
−1
in reply to: Nathan Young’s comment on: [Linkpost] Vague Verbiage in Forecasting
That’s strange, I looked closely but couldn’t see how that would cause an issue. Could you describe the issue so I can see what you’re getting at? I put a poll up in case there’s a clear consensus that this makes it hard to read.
I’m on PC, is this some kind of issue with mobile? I really, really, really don’t think people should be using smartphones for browsing Lesswrong.

trevor 22 Mar 2024 19:45 UTC
4 points
0
in reply to: Dagon’s comment on: [Linkpost] Vague Verbiage in Forecasting
I can see that— language evolving plausible deniability over time, due to the immense instinctive focus on fear of being called out for making a mistake.

[Linkpost] Vague Verbiage in Forecasting

trevor22 Mar 2024 18:05 UTC

11 points

9 comments3 min readLW link

(goodjudgment.com)

trevor 19 Mar 2024 20:49 UTC
5 points
0
on: Monthly Roundup #16: March 2024
As their scale also scales the rewards to attacks and as their responses get worse, the attacks become more frequent. That leads to more false positives, and a skepticism that any given case could be one of them. In practice, claims like Zuckerberg’s that only the biggest companies like Meta can invest the resources to do good content moderation are clearly false, because scale reliably makes content moderation worse.
Dan Luu makes a very real and serious contribution to the literature on scaling and the big tech companies, going further than anyone I’ve ever seen to argue that the big 5 might be overvalued/not that powerful, but ultimately what he’s doing is listing helpful arguments that chip away at the capabilities of the big 5, and then depicts his piece as overwhelming proof that they’re doomed bloated incompetent husks that can’t do anything anymore.
Lots of the arguments are great, but not all are created equal; for example, it’s pretty well known that actually-well-targeted ads scare off customers and that user retention is the priority for predictive analytics (since the competitor platforms’ decisions to use predictive analytics to steal user time are not predictable decisions), but Luu just did the usual thing where he eyeballs the ads and assumes that tells us everything we need to know, and doesn’t notice anything wrong with this. There’s some pretty easy math here (sufficiently large and diverse pools of data make it easier to find people/cases that help predict a specific target’s thoughts/behavior/reaction to stimuli), and either Luu failed to pass the low bar of understanding it, or the higher bar of listing and grokking the real world applications and implications.
Ultimately, I’d consider it a must-read for anyone interested in Earth’s most important industrial community (and scaling in general), but it’s worth keeping in mind that the critical mass of talent (and all kinds of other resources and capabilities) accumulated within the biggest companies is obviously a pretty major factor, and although he goes a long way to chip away at it (e.g. attack surface for data poisoning), Luu doesn’t actually totally debunk it like he says he does.

trevor

WSJ: In­side Ama­zon’s Se­cret Oper­a­tion to Gather In­tel on Rivals

[Linkpost] Prac­ti­cally-A-Book Re­view: Root­claim $100,000 Lab Leak Debate

[Linkpost] Vague Ver­biage in Forecasting

WSJ: Inside Amazon’s Secret Operation to Gather Intel on Rivals

[Linkpost] Practically-A-Book Review: Rootclaim $100,000 Lab Leak Debate

[Linkpost] Vague Verbiage in Forecasting