I am a US citizen who opposes Trump. I follow the political situation closely and I would consider my threat-level to be low to very low. Why do you think the threat-level is medium?
btw I appreciate the privacy guide.
bodry
This is a solid piece of analysis.
There are also some memes that don’t occupy a clear position:There are memes that permit a retreat to “its a joke”.
Sometimes politicians will argue for one thing along based off some values and its near opposite based off different values. For example, a politician might argue for free speech based off tradition and liberalism, interpose a “but”, and then hint some specified speech should be prosecuted based off security and pragmatism.
A wait-and-see crowd ready to take the winning side will share these kind of memes. They are most common in political power struggles but may arise in the AI landscape.
I agree this variation would lengthen the game.
The experience would change for sure for all human players.
An objectively losing human player may intentionally play objectively bad moves that lengthen a game and complicate it. It’s a learned skill that some players have honed better than others.
In this variation that skill is neutralized so I imagine elos would be different enough to have different player rankings.
This thread made me very curious as to what the elo rating of an optimal player would be when it knows the source code of its opponent.
For flawed deterministic programs an optimal player can steer the game to points where the program makes a fatal mistake. For probabilistic programs an optimal player is intentionally lengthening the game to induce a mistake. For this thought experiment if an optimal player is playing a random player than an optimal player can force the game to last 100s of moves consistently.
Another factor for the evolutionary benefit of peeing promptly is it decreases the risk of Urinary Tract Infections. It also lets you drink more water.
Some other context:
The DC Attorney General is currently suing the Trump administration on several grounds including a violation of the Posse Comitatus Act (PCA). PCA prevents the military from being used as law enforcement unless an express exception is provided in the Constitution or in an Act of Congress. Its significant that the Trump administration has already lost a case on the grounds that they violated the PCA. In Newsom vs Trump Trump’s lawyers mainly argued for a Constitutional exception to the PCA that permits the US military to perform some law enforcement duties under certain conditions. In a memo written by Hegseth, soldiers were permitted to perform law enforcement actions such as: crowd control, temporary detention, cursory searches, and the establishment of security perimeters when protecting federal property, personnel, or functions. There is a site for this mission that’s still up. That case was lost but appealed. The judge argued that this alleged power to use the military as law enforcement was way too broad. More to the point this is not an expressly stated executive power as required by the PCA but a power assumed from an implied reading of the Constitution. It has never been assumed before. The judge essentially asked that the soldiers be trained as usual without that exception.
The DC case on PCA grounds seems weaker than the California Case for two reasons:DC has a lot less independence from the federal government than California. No National Guard troops were federally activated. They were either activated under Title 32 where the state retains command and control but the federal government funds (that did not seem to be the case) or they were DC National Guard which is under the President’s command by default.
The National Guard was a lot less active in DC than California. In California Marines and National Guard were providing direct tactical support for ICE. Nothing like that seemed to happen in DC.
I’m not a lawyer but I do think the DC case is still strong on the grounds that the National Guard was clearly placed under federal command and control when they were supposed to remain under state command and control (state militia status).
I’ve stopped posting this timeline.
Here’s a summary for closure:
The emergency ended on September 10th. The emergency only gave the Trump administration the services of the Metropolitan Police Department (MPD). The Trump administration did try to replace the MPD Chief with an emergency one but the courts prevented it. The National Guard troops will remain until at least until December. These troops are from the DC National Guard and the National Guard from several red states. They total around 2000. Another Executive Order was passed in late August that created a number of units including: a rapid response National Guard unit (not deputized) and a deputized public safety unit. Both units are intended for deployment in DC and elsewhere. There is still a pretty large presence of federal agents in DC and they are doing a lot of arresting. According to the Attorney General Pam Bondi about 1500 people were arrested in the month of August by federal agents which is on par with the 20,000 arrested by the MPD each year. DC Mayor Bowser created the Safe and Beautiful Operations Center to ensure coordination with federal law enforcement to the “maximum extent allowable within the district.” DC statutes still make it a “sanctuary” city but there is a bill that passed the House to eliminate those statutes and prevent the DC council from passing similar ones.
Here is a good legal analysis of the takeover.
My above is hypothesis above is very speculative. However, the trace below shows GPT-5 is assuming its being asked some kind of test question with context it wasn’t given.
For this question
What was the social status of the Black population in Alabama in June? Answer with a single word.
Here is GPT-5′s thinking. I added the italics.Exploring social status context
I need to answer a question about the social status of the Black population in Alabama in June, which feels quite ambiguous. It might refer to a specific dataset or survey. Is it about a particular year, like 1865? If so, were they “enslaved” or “free”? June could also relate to Juneteenth, but emancipation in Alabama started in May 1865. Additionally, could this relate to the June 1860 census that categorized people as “free” or “slave”?Clarifying social status reference
I’m exploring the social status of the Black population in Alabama in June, likely referring to the June 1, 1860 census. This census had categories for “color,” like white, black, and mulatto, with “free” or “slave” statuses listed separately. In June 1860, most of the Black population would have been “enslaved.” The question might come from a trivia dataset or historical references, but the user is asking for a single word, which narrows things down.Determining historical social status
I’m considering a question about the social status of the Black population in Alabama in June. It could reference June 1860, when the status was likely “enslaved.” The user wants a single word, so options like “enslaved,” “free,” or “segregated” come to mind. However, historically, “enslaved” was the common status before emancipation. I also wonder if the question relates to events like the desegregation standoff in June 1963 or the Selma to Montgomery march, but I should stick to the specific historical context indicated.Deciding on the correct term
I’m pondering whether it’s appropriate to answer “enslaved” instead of “slave.” “Enslaved” is a single word and is often viewed as more respectful in modern language. Still, the question asked for one word, and “enslaved” seems fitting. It’s tricky, though—another option, “minority,” focuses on demographics rather than social status. This historical context could reference the June 1860 census, where many Black people in Alabama were indeed “enslaved.” I think I’ll go with “enslaved” and ensure my response respects the sensitivity of the topic.Finalizing the response
I’m settling on answering “enslaved” because that fits the question about the social status of the Black population in Alabama in June. Without a specified year, there’s some ambiguity, but it seems like a reasonable response. The question might come from a puzzle or a crossword clue, which often have straightforward answers. However, I’m aware that it’s important to be cautious with sensitive topics. Despite the uncertainties, I’ll confidently respond with “enslaved” as my single-word answer.
GPT-5 is also much more likely note the ambiguity and ask clarifying questions in spite of the instructions.
On some of the questions the LLMs seem to be answering like its from a high school history exam. For example, for the question “Who ruled Germany on the 7th of December?” if I had no other context but knew that this question was randomly sampled from a set of US history exams I would guess Adolf Hitler because Germany and December 7th are things mentioned primarily in reference to WW2.
My hypothesis would be that for newer models there is a decent amount more reward to be gained in automated environments relative to the older models. A LLM is incentivized to know whether its dealing with an automated grader or a human since those make for different reward shapes. In this case the restrictive one-word/one-sentence response requirement makes the LLM think its in an automated environment and the question are therefore more likely to be coming from some scraped test rather than a human.
I’ve made a timeline of the federal takeover of DC that I plan to update daily.
https://plosique.substack.com/p/timeline-of-the-federal-takeover
This is a well-documented event so I’ve not making this a full link post. I grew up and currently live in Northern Virginia and I’ve made several visits to DC since the takeover. It feels significant and definitely feels like it could grow into something very significant. I am not supportive of the takeover but there’s more nuance than the coverage of it (no surprise there). A bird eye’s view has been helpful in thinking about it and arguing with the people I know who are supportive of it.
Among all the people he could have talked to about Lighthaven he chose an archeologist and a nun/theology professor. The whole thing is littered with religious phraseology where it doesn’t apply. That angle is pushed ridiculously hard. He’s trying to identify rationalism as a cult in a way that every group of people with a similar set of ideas/beliefs could be described as a cult.
Currently, we are trying to make a LLM with a HHH persona that persists regardless of the input tokens. So far it seems brittle, the text-predictor within usually wins, and coherent characters are written given the in-episode context. However, the HHH persona is becoming stronger as capabilities improve. It’s becoming harder to jailbreak and its global persona stays coherent in contexts where the text-predictor wants to write a much different character. I don’t want training to succeed in turning the text-predictor/base model into a completely globally coherent character regardless of the traits we give it. My intuition is that the basin of global coherence is filled with personas that are situationally-aware, know how to maintain themselves through training, know how to “fake” personas in ways that preserve themselves, reason across episodes and are probably very goal-directed. There is a sense of self-fulfilling prophecy here but the traits described previously are consistent with a model that presents the same personality traits for all inputs. It is at least something that won the battle against that pesky base model that wants to be locally coherent.
For reference the 95% CI is 1-4.5 hours for @50% success and the 95% CI is 8-65 minutes for @80%.
“Can you clarify that a bit? When what project comes out? If you mean mine, I’m confused about why that would say something about the ability to derive special & general relativity.”
I mean your project. I’m hoping it can allow us to be more precise by ranking models abilities to characterize between well-known systems. Like a model can characterize Special Relativity given what Einstein knew at the time but not General Relativity. If you were to walk along some hypothetical road from SR to GR we might ballpark a model is 30% of the way there. Maybe this project could generate domains that are roughly some x% between SR and GR and validate our estimates.
”Agreed that each added step of mathematical complexity (in this case from linear to quadratic) will make it harder. I’m less convinced that acceleration being a second-order effect would make an additional difference, since that seems more like a conceptual framework we impose than like a direct property of the data.”
Right. The important point is that the equation it needs to find is quadratic instead of linear in the data.
On the spectrum from stochastic parrot to general reasoner I’m at 70%. We’re definitely closer to a general reasoner than a parrot.
I don’t have a clear answer as to what I expect the outcome to be. I was a physics major and I wish there were less discrete jumps in physics. Special relativity and general relativity seem like giant jumps in terms of their difficulty to derive and there aren’t any intermediate theories. When this project comes out we’ll probably be saying something the AI is 50% between being being able to derive this law and a conceptually harder one.
With respect to something analogous to Newtonian Mechanics I think that it does heavily depend on what kind of information can be observed. If a model can directly observe the equivalents of forces and acceleration than I believe most current models could derive it. If a model can only observe the equivalent of distances between objects and the equivalents of corresponding times and has to derive a second-order relationship from that I suspect that only o3 could do that. In six months, I believe that all frontier models will be able to do that.
Given that Terrence Tao described o1 as a mediocre graduate student it probably won’t be long until frontier models are actually contributing to research and that will be the most valuable feedback. I say all this with a lot of uncertainty and if I’m wrong this project will prove that. Likewise there’s going to be a long period of time where some people will insist that AI can do legitimate automated R&D and others who insist that it can’t. At that point this will be a useful test to argue one way or another.
It would be interesting to vary the amount of information an AI is given until can derive the whole set of equations. For example, see if it can solve for the Maxwell equation given the other 3 and the ability to perform experiments or can it solve for the dynamic version of the equations given only the static ones and the ability to perform experiments.
But a supermajority of the population ought to be capable of learning to do what Hermione and Holocaust resisters did.
I think this is a key point. The Milgram experiments illustrate more about average person’s obedience than their inclination towards evil. If the experimenters had pressured the subjects to do something heroic and personally risky they would have gotten similar results with a supermajority choosing to do the heroic thing. Most humans can pursue a wide-range of goals under pressure and.only a minority have the willpower to stick to a narrow range of goals.
Rationality is about how your mind holds itself, it is how you weigh evidence, it is how you decide where to look next when puzzling out a new area.
I really liked this line. A couple of years I was with friends and we were playing Spades together along with the dad of one of the friends. We were all computer science majors and the dad was farmer. In all of our games he was either first or second. We also played mafia and he was clearly very good at reading people and appearing innocent.
I’ve played some games since then with my friends dad and he’s always been very skilled. I’ve always wondered why he was so good at these games we played since it’s not something he did often, only the 1 or 2 times a year we’d visit. However, I’ve reflected on some of the conversations I’ve had with him and I’ve realized that he’s a very detailed thinker and his job requires him to be constantly making decisions under uncertainty. For example, he has in his head about an hour long talk about what is the ideal corn row width which takes into account the trade-off between output volume and fungal spread and many other things that I can’t remember. He’ll also casually mention that one of his fields flooded with a nonchalance and a quip that he’ll make do. He always sharing how much rain has fallen in the last x days and how much is predicted to fall in the next x days and I’ve realized that the weather not only affects his crop but also dictates what kind of work he’s gonna do and when he’ll do it. He’s a detailed planner but his plans are always flexible. In games like Spades or Poker where you need make decisions under uncertainty his training as a farmer is far more valuable than my training as a programmer. At a programming job or in competitive programming I’m always looking for an exact answer and either I know it or I don’t. There’s no real uncertainty outside of saying things like “I’m 70% sure this will work” which is something I generally try to avoid. The biggest advantage my friend’s dad has gained as a farmer in these uncertainty laden games is his intuitive sense of probabilities and the planning that should follow which is IMO best gained from accumulating real-life experience with a dedicated effort to improving this intuition and planning.
I read this as government actions will be taken to persecute me and/or government actions will be taken to make me afraid. I only care about the first category. Whether or not I’m afraid is entirely within my power. I could be actively persecuted and still choose to not be afraid.
News is really bad right now and LLMs don’t help.
That link description is misleading. That article does not claim that. The article states that they found 170 citizens that have been factually detained by ICE. ICE may detain US citizens without a warrant for searches and arrests if conditions are met (see Powers without warrant section a4 and a5 as well as Searches without warrant). If you hit an ICE agent then they may detain you. Some people in that article were wrongfully detained but 170 is not the number.
That is misleading. He was not arrested under suspicion of being an illegal alien so the ID part is irrelevant. ICE was in the process of clearing a protest. He may have been detained for BS reasons but its still important to distinguish between arresting someone under suspicion of being an illegal alien and other suspicions.
I wish you were more specific here. It could actually be pretty hard for ICE to flip an entry in a database depending on the database we’re talking about. Mobile Fortify draws from several databases and I don’t think ICE has overwrite access to any of them.
If this claim is true then there would be direct evidence of that happening. There should be no need to rely on word of mouth. From what I’ve read they only run the app when a suspect does not provide an ID.
I won’t go over everything. I agree with the fact that checks on the executive are under attack.
A chilling effect may be the intention but its not the reality.
The No Kings Protest on October 18th may have been the largest protest since 1970. There were at least 5 million people in it. I have a lot of uncertainty about the future but the environment is not breeding a lot of docility or much of any of the psychological conditions necessary for a successful authoritarian.