I am a PhD student in computer science at the University of Waterloo, supervised by Professor Ming Li and advised by Professor Marcus Hutter.
My current research is related to applications of algorithmic probability to sequential decision theory (universal artificial intelligence). Recently I have been trying to start a dialogue between the computational cognitive science and UAI communities. Sometimes I build robots, professionally or otherwise. Another hobby (and a personal favorite of my posts here) is the Sherlockian abduction master list, which is a crowdsourced project seeking to make “Sherlock Holmes” style inference feasible by compiling observational cues. Give it a read and see if you can contribute!
See my personal website colewyeth.com for an overview of my interests and work.
I do ~two types of writing, academic publications and (lesswrong) posts. With the former I try to be careful enough that I can stand by ~all (strong/central) claims in 10 years, usually by presenting a combination of theorems with rigorous proofs and only more conservative intuitive speculation. With the later, I try to learn enough by writing that I have changed my mind by the time I’m finished—and though I usually include an “epistemic status” to suggest my (final) degree of confidence before posting, the ensuing discussion often changes my mind again. As of mid-2025, I think that the chances of AGI in the next few years are high enough (though still <50%) that it’s best to focus on disseminating safety relevant research as rapidly as possible, so I’m focusing less on long-term goals like academic success and the associated incentives. That means most of my work will appear online in an unpolished form long before it is published.
It’s easy to criticize the type of “fake” planning that systematically avoids/defuses criticism rather than aiming for success i.e. appearing blameless for a loss versus actually trying to win. But I think there’s a lot of traction to be gained from refusing to lose in any silly way; it means you have to dovetail a lot of reasonable strategies, that you could be criticized for missing eg “why didn’t you just try…” Trying EVERY post facto obvious thing is harder than it sounds and quite powerful (time permitting). Universal search/learning algorithms including market mechanisms like logical induction tend to have this form! So I don’t think this heuristic comes (only) from social approval or moral mazes. In my own research process, I think of it as “being professional.” It’s not the same as being brilliant, but it’s much easier to pull off consistently (if slowly) and it gets results.