anonymousaisafety

Karma: 724

anonymousaisafety Apr 16, 2023, 3:05 AM
−17 points
1
in reply to: Duncan Sabien (Inactive)’s comment on: Moderation notes re: recent Said/Duncan threads
I see that reading comprehension was an issue for you, since it seems that you stopped reading my post halfway through. Funny how a similar thing occurred on my last post too. It’s almost like you think that the rules don’t apply to you, since everyone else is required to read every single word in your posts with meticulous accuracy, whereas you’re free to pick & choose at your whim.

anonymousaisafety Apr 16, 2023, 1:34 AM
8 points
4
in reply to: Duncan Sabien (Inactive)’s comment on: Moderation notes re: recent Said/Duncan threads
i.e. splitting hairs and swirling words around to create a perpetual motte-and-bailey fog that lets him endlessly nitpick and retreat and say contradictory things at different times using the same words, and pretending to a sort of principle/coherence/consistency that he does not actually evince.
Yeah, almost like splitting hairs around whether making the public statement “I now categorize Said as a liar” is meaningfully different than “Said is a liar”.
Or admonishing someone for taking a potshot at you when they said
However, I suspect that Duncan won’t like this idea, because he wants to maintain a motte-and-bailey where his posts are half-baked when someone criticizes them but fully-baked when it’s time to apportion status.
...while acting as though somehow that would have been less offensive if they had only added “I suspect” to the latter half of that sentence as well. Raise your hand if you think that “I suspect that you won’t like this idea, because I suspect that you have the emotional maturity of a child” is less offensive because it now represents an unambiguously true statement of an opinion rather than being misconstrued as a fact. A reasonable person would say “No, that’s obviously intended to be an insult”—almost as though there can be meaning beyond just the words as written.
The problem is that if we believe in your philosophy of constantly looking for the utmost literal interpretation of the written word, you’re tricking us into playing a meta-gamed, rules-lawyered, “Sovereign citizen”-esque debate instead of, what’s the word—oh, right, Steelmanning. Assuming charity from the other side. Seeking to find common ground.
For example, I can point out that Said clearly used the word “or” in their statement. Since reading comprehension seems to be an issue for a “median high-karma LWer” like yourself, I’ll bold it for you.
Said: Well, I think that “criticism”, in a context like this topic of discussion, certainly includes something like “pointing to a flaw or lacuna, or suggesting an important or even necessary avenue for improvement”.
Is it therefore consistent for “asking for examples” to be contained by that set, while likewise not being pointing to a flaw? Yes, because if we say that a thing is contained by a set of “A or B”, it could be “A”, or it could be “B”.
Now that we’ve done your useless exercise of playing with words, what have we achieved? Absolutely nothing, which is why games like these aren’t tolerated in real workplaces, since this is a waste of everyone’s time.
You are behaving in a seriously insufferable way right now.
Sorry, I meant—“I think that you are behaving in what feels like to me a seriously insufferable way right now, where by insufferable I mean having or showing unbearable arrogance or conceit”.

anonymousaisafety Apr 14, 2023, 10:54 PM
−2 points
12
in reply to: Duncan Sabien (Inactive)’s comment on: Moderation notes re: recent Said/Duncan threads
Yes, I have read your posts.
I note that in none of them did you take any part of the responsibility for escalating the disagreement to its current level of toxicity.
You have instead pointed out Said’s actions, and Said’s behavior, and the moderators lack of action, and how people “skim social points off the top”, etc.

anonymousaisafety Apr 14, 2023, 10:13 PM
47 points
37
in reply to: Duncan Sabien (Inactive)’s comment on: Moderation notes re: recent Said/Duncan threads
@Duncan_Sabien I didn’t actually upvote @clone of saturn’s post, but when I read it, I found myself agreeing with it.
I’ve read a lot of your posts over the past few days because of this disagreement. My most charitable description of what I’ve read would be “spirited” and “passionate”.
You strongly believe in a particular set of norms and want to teach everyone else. You welcome the feedback from your peers and excitedly embrace it, insofar as the dot product between a high-dimensional vector describing your norms and a similar vector describing the criticism is positive.
However, I’ve noticed that when someone actually disagrees with you—and I mean disagreement in the sense of “I believe that this claim rests on incorrect priors and is therefore false.”—I have been shocked by the level of animosity you’ve shown in your writing.
Full disclosure: I originally messaged the moderators in private about your behavior, but I’m now writing this in public because in part because of your continued statements on this thread that you’ve done nothing wrong.
I think that your responses over the past few days have been needlessly escalatory in a way that Said’s weren’t. If we go with the Socrates metaphor, Said is sitting there asking “why” over and over, but you’ve let emotions rule and leapt for violence (metaphorically, although you then did then publish a post about killing Socrates, so YMMV).
There will always be people who don’t communicate in a way that you’d prefer. It’s important (for a strong, functioning team) to handle that gracefully. It looks to me that you’ve become so self-convinced that your communication style is “correct” that you’ve taken a war path towards the people who won’t accept it—Zack and Said.
In a company, this is problematic because some of the things that you’re asking for are actually not possible for certain employees. Employees who have English as a second language, or who come from a different culture, or who may have autism, all might struggle with your requirements. As a concrete example, you wrote at length that saying “This is insane” is inflammatory in a way that “I think that this is insane” wouldn’t be—while I understand and appreciate the subtlety of that distinction, I also know that many people will view the difference between those statements as meaningless filler at best. I wrote some thoughts on that here: https://www.lesswrong.com/posts/9vjEavucqFnfSEvqk/on-aiming-for-convergence-on-truth?commentId=rGaKpCSkK6QnYBtD4
I believe that you are shutting down debates prematurely by casting your peers as antagonist towards you. In a corporate setting, as an engineer acquires more and more seniority, it becomes increasingly important for them to manage their emotions, because they’re a role model for junior engineers.
I do think that @Said Achmiz can improve their behavior too. In particular, I think Said could recognize that sometimes their posts are met with hostility, and rather than debating this particular point, they could gracefully disengage from a specific conversation when they determine that someone does not appreciate their contributions.
However, I worry that you, Duncan, are setting an increasingly poor example. I don’t know that I agree with the ability to ban users from posts. I think I lean more towards “ability to hide any posts from a user” as a feature, more than “prevent users from commenting”. That is to say, I think if you’re triggered by Said or Zack, then the site should offer you tools to hide those posts automatically. But I don’t think that you should be able to prevent Said or Zack from commenting on your posts, or prevent other commentators from seeing that criticism. In part, I agree strongly (and upvoted strongly) with @Wei_Dai’s point elsewhere in this thread that blocking posters means we can’t tell the difference between “no one criticized this” and “people who would criticize it couldn’t”, unless they write their own post, as @Zack_M_Davis did.

anonymousaisafety Apr 14, 2023, 12:37 AM
21 points
14
on: On “aiming for convergence on truth”
Sometimes when you work at a large tech-focused company, you’ll be pulled into a required-but-boring all-day HR meeting to discuss some asinine topic like “communication styles”.
If you’ve had the ~~misfortune~~ fun of attending one of those meetings, you might remember that the topic wasn’t about teaching a hypothetically “best” or “optimal” communication style. The goal was to teach employees how to recognize when you’re speaking to someone with a different communication style, and then how to tailor your understanding of what they’re saying with respect to them. For example, some people are more straightforward than others, so a piece of seemingly harsh criticism like “This won’t work for XYZ reason.” doesn’t mean that they disrespect you—they’re just not the type of person who would phrase that feedback as “I think that maybe we’ve neglected to consider the impact of XYZ on the design.”
I have read the many pages of debate on this current disagreement over the past few days. I have followed the many examples of linked posts that were intended to show bad behavior by one side or the other.
I think Zack and gjm have a good job at communicating with each other despite differences in their preferred communication styles, and in particular, I agree strongly with gjm’s analysis:
I think this is the purpose of Duncan’s proposed guideline 5. Don’t engage in that sort of adversarial behaviour where you want to win while the other party loses; aim at truth in a way that, if you are both aiming at truth, will get you both there. And don’t assume that the other party is being adversarial, unless you have to, because if you assume that then you’ll almost certainly start doing the same yourself; starting out with a presumption of good faith will make actual good faith more likely.
And then with Zack’s opinion:
That said, I don’t think there’s a unique solution for what the “right” norms are. Different rules might work better for different personality types, and run different risks of different failure modes (like nonsense aggressive status-fighting vs. nonsense passive-aggressive rules-lawyering). Compared to some people, I suppose I tend to be relatively comfortable with spaces where the rules err more on the side of “Punch, but be prepared to take a punch” rather than “Don’t punch anyone”—but I realize that that’s a fact about me, not a fact about the hidden Bayesian structure of reality. That’s why, in “‘Rationalist Discourse’ Is Like ‘Physicist Motors’”, I made an analogy between discourse norms and motors or martial arts—there are principles governing what can work, but there’s not going to be a unique motor, a one “correct” martial art.
I also agree with Zack when they said:
I’m unhappy with the absence of an audience-focused analogue of TEACH. In the following, I’ll use TEACH to refer to making someone believe X if X is right; whether the learner is the audience or the interlocutor B isn’t relevant to what I’m saying.
I seldom write comments with the intent of teaching a single person. My target audience is whoever is reading the posts, which is overwhelmingly going to be more than one person.
From Duncan, I agree with the following:
It is in fact usually the case that, when two people disagree, each one possesses some scrap of map that the other lacks; it’s relatively rare that one person is just right about everything and thoroughly understands and can conclusively dismiss all of the other person’s confusions or hesitations. If you are trying to see and understand what’s actually true, you should generally be hungry for those scraps of map that other people possess, and interested in seeing, understanding, and copying over those bits which you were missing.
Almost all of my comments tend to focus on a specific disagreement that I have with the broader community. That disagreement is due to some prior that I hold, that is not commonly held here.
And from Said, I agree with this:
Examples?
This community is especially prone to large, overly-wordy armchair philosophy about this-or-that with almost no substantial evidence that can tie the philosophy back down to Earth. Sometimes that philosophy gets camouflaged in a layer of pseudo-math; equations, lemmas, writing as if the post is demonstrating a concrete mathematical proof. To that end, focusing the community on providing examples is a valuable, useful piece of constructive feedback. I strongly disagree that this is an unfair burden on authors.
EDIT: I forgot to write an actual conclusion. Maybe “don’t expect everyone to communicate in the same way, even if we assume that all interested parties care about the truth”?
What links here?
- anonymousaisafety's comment on Moderation notes re: recent Said/Duncan threads by Raemon (Apr 14, 2023, 10:13 PM; 47 points)

anonymousaisafety Apr 11, 2023, 7:48 PM
5 points
1
in reply to: rotatingpaguro’s comment on: Is “Strong Coherence” Anti-Natural?
It seems to me that humans are more coherent and consequentialist than other animals. Humans are not perfectly coherent, but the direction is towards more coherence.
This isn’t a universally held view. Someone wrote a fairly compelling argument against it here: https://sohl-dickstein.github.io/2023/03/09/coherence.html

anonymousaisafety Apr 9, 2023, 7:17 PM
6 points
1
in reply to: quanticle’s comment on: The surprising parameter efficiency of vision models
We don’t do any of these things for diffusion models that output images, and yet these diffusion models manage to be much smaller than models that output words, while maintaining an even higher level of output quality. What is it about words that makes the task different?
I’m not sure that “even higher level of output quality” is actually true, but I recognize that it can be difficult to judge when an image generation model has succeeded. In particular, I think current image models are fairly bad at specifics in much the same way as early language models.
But I think the real problem is that we seem to still be stuck on “words”. When I ask GPT-4 a logic question, and it produces a grammatically correct sentence that answers the logic puzzle correctly, only part of that is related to “words”—the other part is a nebulous blob of reasoning.
I went all the way back to GPT-1 (117 million parameters) and tested next word prediction—specifically, I gave a bunch of prompts, and I looked for only if the very next word was what I would have expected. I think it’s incredibly good at that! Probably better than most humans.
Or are you suggesting that image generators could also be greatly improved by training minimal models, and then embedding those models within larger networks?
No, because this is already how image generators work. That’s what I said in my first post when I noted the architectural differences between image generators and language models. An image generator, as a system, consists of multiple models. There is a text → image space, and then an image space → image. The text → image space encoder is generally trained first, then it’s normally frozen during the training of the image decoder.^[1] Meanwhile, the image decoder is trained on a straightforward task: “given this image, predict the noise that was added”. In the actual system, that decoder is put into a loop to generate the final result. I’m requoting the relevant section of my first post below:
The reason why I’m discussing the network in the language of instructions, stack space, and loops is because I disagree with a blanket statement like “scale is all you need”. I think it’s obvious that scaling the neural network is a patch on the first two constraints, and scaling the training data is a patch on the third constraint.
This is also why I think that point #3 is relevant. If GPT-3 does so well because it’s using the sea of parameters for unrolled loops, then something like Stable Diffusion at 1/200th the size probably makes sense.
1. ^
  Refer to figure 2 in https://cdn.openai.com/papers/dall-e-2.pdf. Or read this:
  The trick here is that they decoupled the encoding from training the diffusion model. That way, the autoencoder can be trained to get the best image representation and then downstream several diffusion models can be trained on the so-called latent representation
  This is the idea that I’m saying could be applied to language models, or rather, to a thing that we want to demonstrate “general intelligence” in the form of reasoning / problem solving / Q&A / planning / etc. First train a LLM, then train a larger system with the LLM as a component within it.

anonymousaisafety Apr 9, 2023, 8:30 AM
2 points
0
in reply to: Nanda Ale’s comment on: The surprising parameter efficiency of vision models
Yes, it’s my understanding that OpenAI did this for GPT-4. It’s discussed in the system card PDF. They used early versions of GPT-4 to generate synthetic test data and also as an evaluator of GPT-4 responses.

anonymousaisafety Apr 9, 2023, 8:18 AM
10 points
1
in reply to: quanticle’s comment on: The surprising parameter efficiency of vision models
First, when we say “language model” and then we talk about the capabilities of that model for “standard question answering and factual recall tasks”, I worry that we’ve accidentally moved the goal posts on what a “language model” is.
Originally, a language model was a stochastic parrot. They were developed to answer questions like “given these words, what comes next?” or “given this sentence, with this unreadable word, what is the most likely candidate?” or “what are the most common words?”^[1] It was not a problem that required deep learning.
Then, we applied deep learning to it, because the path of history so far has been to take straightforward algorithms, replace them with a neural network, and see what happens. From that, we got … stochastic parrots! Randomizing the data makes perfect sense for that.
Then, we scaled it. And we scaled it more. And we scaled it more.
And now we’ve arrived at a thing we keep calling a “language model” due to history, but it isn’t a stochastic parrot anymore.
Second, I’m not saying “don’t randomize data”, I’m saying “use a tiered approach to training”. We would use all of the same techniques: randomization, masking, adversarial splits, etc. What we would not do is throw all of our data and all of our parameters into a single, monolithic model and expect that would be efficient.^[2] Instead, we’d first train a “minimal” LLM, then we’d use that LLM as a component within a larger NN, and we’d train that combined system (LLM + NN) on all of the test cases we care about for abstract reasoning / problem solving / planning / etc. It’s that combined system that I think would end up being vastly more efficient than current language models, because I suspect the majority of language model parameters are being used for embedding trivia that doesn’t contribute to the core capabilities we recognize as “general intelligence”.
1. ^
  This wasn’t for auto-complete, it was generally for things like automatic text transcription from images, audio, or videos. Spam detection was another use-case.
2. ^
  Recall that I’m trying to offer a hypothesis for why a system like GPT-3.5 takes so much training and has so many parameters and it still isn’t “competent” in all of the ways that a human is competent. I think “it is being trained in an inefficient way” is a reasonable answer to that question.

anonymousaisafety Apr 8, 2023, 11:25 PM
9 points
4
on: The surprising parameter efficiency of vision models
I suspect it is a combination of #3 and #5.
Regarding #5 first, I personally think that language models are being trained wrong. We’ll get OoM improvements when we stop randomizing the examples we show to models during training, and instead provide examples in a structured curriculum.
This isn’t a new thought, e.g. https://arxiv.org/abs/2101.10382
To be clear, I’m not saying that we must present easy examples first and then harder examples later. While that is what has been studied in the literature, I think we’d actually get better behavior by trying to order examples on a spectrum of “generalizes well” to “very specific, does not generalize” and then training in that order. Sometimes this might be equivalent to “easy examples first”, but that isn’t necessarily true.
I recognize that the definitions of “easy” and “generalizes” are nebulous, so I’m going to try and explain the reasoning that led me here.
Consider the architecture of transformers and feed-forward neural networks (specifically not recurrent neural networks). We’re given some input, and we produce some output. In a model like GPT, we’re auto-regressive, so as we produce our outputs, those outputs become part of the input during the next step. Each step is fundamentally a function $F (S_{1}) - > S_{2}$ .
Given some input, the total output can be thought as:
def reply_to(input):
output = “”
while True:
token = predict_next(input + output)
if token == STOP:
break
output += token
return output
We’d like to know exactly what `predict_next` is doing, but unfortunately, the programmer who wrote it seems to have done their implementation entirely in matrix math and they didn’t include any comments. In other words, it’s deeply cursed and not terribly different from the output of Simulink’s code generator.
def predict_next(input):
# … matrix math
return output
Let’s try to think about the capabilities and constraints on this function.
- There is no unbounded `loop` construct. The best we can do is approximate loops, e.g. by supporting an unrolled loop up to some bounded number of iterations. What determines the bounds? Probably the depth of the network?
- If the programmer were sufficiently deranged, they could implement `predict_next` in such a way that if they’ve hit the bottom of their unrolled loop, they could rely on the fact that `predict_next` will be called again, and continue their previous calculations during the next call. What would be the limitations on this? Probably the size of each hidden layer. If you wanted to figure out if this is happening, you’d want to look for prompts where the network can answer the prompt correctly if it is allowed to generate text before the answer (e.g. step-by-step explanations) but is unable to do so if asked to provide the answer without any associated explanations.
- How many total “instructions” can fit into this function? The size of the network seems like a decent guess. Unfortunately, the network conflates instructions and data, and the network must use all parameters available to it. This leads to trivial solutions where the network just over-fits to the data (analogous to baking in a lookup table on the stack). It’s not unsurprising that throwing OoM more data at a fixed size NN results in better generalization. Once you’re unable to cheat with over-fitting you must learn algorithms that work more efficiently.
The reason why I’m discussing the network in the language of instructions, stack space, and loops is because I disagree with a blanket statement like “scale is all you need”. I think it’s obvious that scaling the neural network is a patch on the first two constraints, and scaling the training data is a patch on the third constraint.
This is also why I think that point #3 is relevant. If GPT-3 does so well because it’s using the sea of parameters for unrolled loops, then something like Stable Diffusion at 1/200th the size probably makes sense.
To tie this back to point #5:
- We start with a giant corpus of data. On the order of “all written content available in digital form”. We might generate additional data in an automated fashion, or digitize books, or caption videos.
- We divide it into training data and test data.
- We train the network on random examples from the training data, and then verify on random examples from the test data. For simplicity, I’m glossing over various training techniques like masking data or connections between nodes.
- Then we fine-tune it, e.g with Q&A examples.
- And then generally we deploy it with some prompt engineering, e.g. prefixing queries with past transcript history, to fake a conversation.
At the end of this process, what do we have?
I want to emphasize that I do not think it is a “stochastic parrot”. I think it is very obvious that the final system has internalized actual algorithms (or at least, pseudo-algorithms due to the limitation on loops) for various tasks, given the fact that the size of the data set is significantly larger than the size of the model. I think people who are surprised by the capabilities of these systems continue to assume it is “just” modeling likelihoods, when there was no actual requirement on that.
I also suspect we’ve wasted an enormous quantity of our parameters on embedding knowledge that does not directly contribute to system’s capabilities.
My hypothesis for how to fix this is vaguely similar to the idea of “maximizing divergence” discussed here https://ljvmiranda921.github.io/notebook/2022/08/02/splits/.
I think we could train a LLM on a minimal corpus to “teach” a language^[1] and then place that LLM inside of a larger system that we train to minimize loss on examples teaching logic, mathematics, and other components of reasoning. That larger system would distinguish between the weights for the algorithms it learns and the weights representing embedded knowledge. It would also have the capability to loop during the generation of an output. For comparison, think of the experiments being done with hooking up GPT-4 to a vector database, but now do that inside of the architecture instead of as a hack on top of the text prompts.
I think an architecture that cleanly separates embedded knowledge (“facts”, “beliefs”, “shards”, etc) from the algorithms (“capabilities”, “zero-shot learning”) is core to designing a neural network that remains interpretable and alignable at scale.
If you read the previous paragraphs and think, “that sounds familiar”, it’s probably because I’m describing how we teach humans: first language, then reasoning, then specialization. A curriculum. We need language first because we want to be able to show examples, explain, and correct mistakes. Especially since we can automate content generation with existing LLMs to create the training corpus in these steps. Then we want to teach reasoning, starting with the most general forms of reasoning, and working into the most specific. Finally, we grade the system (not train!) on a corpus of specific knowledge-based activities. Think of this step as describing the rules of a made-up game, providing the current game state, and then asking for the optimal move. Except that for games, for poems, for math, for wood working, for engineering, etc. The whole point of general intelligence is that you can reason from first principles, so that’s what we need to be grading the network on: minimizing loss with respect to arbitrarily many knowledge-based tasks that must be solved using the facts provided only during the test itself.
1. ^
  Is English the right language to teach? I think it would be funny if a constructed language actually found a use here.

anonymousaisafety Mar 23, 2023, 9:59 PM
2 points
in reply to: jacob_cannell’s comment on: The “Outside the Box” Box
I’m reminded of this thread from 2022: https://www.lesswrong.com/posts/27EznPncmCtnpSojH/link-post-on-deference-and-yudkowsky-s-ai-risk-estimates?commentId=SLjkYtCfddvH9j38T#SLjkYtCfddvH9j38T

anonymousaisafety Dec 4, 2022, 7:27 PM
6 points
4
in reply to: Charlie Steiner’s comment on: Inner and outer alignment decompose one hard problem into two extremely hard problems
I realize that my position might seem increasingly flippant, but I really think it is necessary to acknowledge that you’ve stated a core assumption as a fact.
Alignment doesn’t run on some nega-math that can’t be cast as an optimization problem.
I am not saying that the concept of “alignment” is some bizarre meta-physical idea that cannot be approximated by a computer because something something human souls etc, or some other nonsense.
However the assumption that “alignment is representable in math” directly implies “alignment is representable as an optimization problem” seems potentially false to me, and I’m not sure why you’re certain it is true.
There exist systems that can be 1.) represented mathematically, 2.) perform computations, and 3.) do not correspond to some type of min/max optimization, e.g. various analog computers or cellular automaton.
I don’t think it is ridiculous to suggest that what the human brain does is 1.) representable in math, 2.) in some type of way that we could actually understand and re-implement it on hardware / software systems, and 3.) but not as an optimization problem where there exists some reward function to maximize or some loss function to minimize.

anonymousaisafety Dec 3, 2022, 7:33 PM
LW: 11 AF: 8
2
AF
in reply to: Charlie Steiner’s comment on: Inner and outer alignment decompose one hard problem into two extremely hard problems
I wasn’t intending for a metaphor of “biomimicry” vs “modernist”.
(Claim 1) Wings can’t work in space because there’s no air. The lack of air is a fundamental reason for why no wing design, no matter how clever it is, will ever solve space travel.
If TurnTrout is right, then the equivalent statement is something like (Claim 2) “reward functions can’t solve alignment because alignment isn’t maximizing a mathematical function.”
The difference between Claim 1 and Claim 2 is that we have a proof of Claim 1, and therefore don’t bother debating it anymore, while with Claim 2 we only have an arbitrarily long list of examples for why reward functions can be gamed, exploited, or otherwise fail in spectacular ways, but no general proof yet for why reward functions will never work, so we keep arguing about a Sufficiently Smart Reward Function That Definitely Won’t Blow up as if that is a thing that can be found if we try hard enough.
As of right now, I view “shard theory” sort of like a high-level discussion of chemical propulsion without the designs for a rocket or a gun. I see the novelty of it, but I don’t understand how you would build a device that can use it. Until someone can propose actual designs for hardware or software that would implement “shard theory” concepts without just becoming an obfuscated reward function prone to the same failure modes as everything else, it’s not incredibly useful to me. However, I think it’s worth engaging with the idea because if correct then other research directions might be a dead-end.
Does that help explain what I was trying to do with the metaphor?

anonymousaisafety Dec 3, 2022, 6:22 PM
LW: 2 AF: 1
3
AF
in reply to: Charlie Steiner’s comment on: Inner and outer alignment decompose one hard problem into two extremely hard problems
To some extent, I think it’s easy to pooh-pooh finding a flapping wing design (not maximally flappy, merely way better than the best birds) when you’re not proposing a specific design for building a flying machine that can go to space. Not in the tone of “how dare you not talk about specifics,” but more like “I bet this chemical propulsion direction would have to look more like birds when you get down to brass tacks.”

anonymousaisafety Jul 27, 2022, 4:20 AM
3 points
0
in reply to: Mateon1’s comment on: Eavesdropping on Aliens: A Data Decoding Challenge
(1) The first thing I did when approaching this was think about how the message is actually transmitted. Things like the preamble at the start of the transmission to synchronize clocks, the headers for source & destination, or the parity bits after each byte, or even things like using an inversed parity on the header so that it is possible to distinguish a true header from bytes within a message that look like a header, and even optional checksum calculations.
(2) I then thought about how I would actually represent the data so it wasn’t just traditional 8-bit bytes—I created encoders & decoders for 36/24/12/6 bit unsigned and signed ints, and 30 / 60 bit non-traditional floating point, etc.
Finally, I created a mock telemetry stream that consisted of a bunch of time-series data from many different sensors, with all of the sensor values packed into a single frame with all of the data types from (2), and repeatedly transmitted that frame over the varying time series, using (1), until I had >1 MB.
And then I didn’t submit that, and instead swapped to a single message using the transmission protocol that I designed first, and shoved an image into that message instead of the telemetry stream.
- To avoid the flaw where the message is “just” 1-byte RGB, I viewed each pixel in the filter as being measured by a 24-bit ADC. That way someone decoding it has to consider byte-order when forming the 24-bit values.
- Then, I added only a few LSB of noise because I was thinking about the type of noise you see on ADC channels prior to more extensive filtering. I consider it a bug that I only added noise in some interval [0, +N], when I should have allowed the noise to be positive or negative. I am less convinced that the uniform distribution is incorrect. In my experience, ADC noise is almost always uniform (and only present in a few LSB), unless there’s a problem with the HW design, in which case you’ll get dramatic non-uniform “spikes”. I was assuming that the alien HW is not so poorly designed that they are railing their ADC channels with noise of that magnitude.
- I wanted the color data to be more complicated than just RGB, so I used a Bayer filter, that way people decoding it would need to demosiac the color channels. This further increased the size of the image.
- The original, full resolution image produced a file much larger than 1 MB when it was put through the above process (3 8-bit RGB → 4 24-bit Bayer), so I cut the resolution on the source image until the output was more reasonably sized. I wasn’t thinking about how that would impact the image analysis, because I was still thinking about the data types (byte order, number of bits, bit ordering) more so than the actual image content.
- “Was the source image actually a JPEG?” I didn’t check for JPEG artifacts at all, or analyze the image beyond trying to find a nice picture of bismuth with the full color of the rainbow present so that all of the color channels would be used. I just now did a search for “bismuth png” on Google, got a few hits, opened one, and it was actually a JPG. I remember scrolling through a bunch of Google results before I found an image that I liked, and then I just remember pulling & saving it as a BMP. Even if I had downloaded a source PNG as I intended, I definitely didn’t check that the PNG itself wasn’t just a resaved JPEG.

anonymousaisafety Jul 26, 2022, 3:47 PM
3 points
0
in reply to: Mateon1’s comment on: Eavesdropping on Aliens: A Data Decoding Challenge
My understanding of faul_sname’s claim is that for the purpose of this challenge we should treat the alien sensor data output as an original piece of data.
In reality, yes, there is a source image that was used to create the raw data that was then encoded and transmitted. But in the context of the fiction, the raw data is supposed to represent the output of the alien sensor, and the claim is that the decompressor + payload is less than the size of just an ad-hoc gzipping of the output by itself. It’s that latter part of the claim that I’m skeptical towards. There is so much noise in real sensors—almost always the first part of any sensor processing pipeline is some type of smoothing, median filtering, or other type of noise reduction. If a solution for a decompressor involves saving space on encoding that noise by breaking a PRNG, it’s not clear to me how that would apply to a world in which this data has no noise-less representation available. However, a technique of measuring & subtracting noise so that you can compress a representation that is more uniform and then applying the noise as a post-processing op during decoding is definitely doable.
Assuming that you use the payload of size 741809 bytes, and are able to write a decompressor + “transmitter” for that in the remaining ~400 KB (which should be possible, given that 7z is ~450 KB, zip is 349 KB, other compressors are in similar size ranges, and you’d be saving space since you just need to the decoder portion of the code), how would we rate that against the claims?
1. It would be possible for me, given some time to examine the data, create a decompressor and a payload such that running the decompressor on the payload yields the original file, and the decompressor program + the payload have a total size of less than the original gzipped file
2. The decompressor would legibly contain a substantial amount of information about the structure of the data.
(1) seems obviously met, but (2) is less clear to me. Going back to the original claim, faul_sname said ‘we would see that the winning programs would look more like “generate a model and use that model and a similar rendering process to what was used to original file, plus an error correction table” and less like a general-purpose compressor’.
So far though, this solution does use a general purpose compressor. My understanding of (2) is that I was supposed to be looking for solutions like “create a 3D model of the surface of the object being detected and then run lighting calculations to reproduce the scene that the camera is measuring”, etc. Other posts from faul_sname in the thread, e.g. here seem to indicate that was their thinking as well, since they suggested using ray tracing as a method to describe the data in a more compressed manner.
What are your thoughts?
Regarding the sensor data itself
I alluded to this in my post here, but I was waffling and backpedaling a lot on what would be “fair” in this challenge. I gave a bunch of examples in the thread of what would make a binary file difficult to decode—e.g. non-uniform channel lengths, an irregular data structure, multiple types of sensor data interwoven into the same file, and then did basically none of that, because I kept feeling like the file was unapproachable. Anything that was a >1 MB of binary data but not a 2D image (or series of images) seemed impossible. For example, the first thing I suggested in the other thread was a stream of telemetry from some alien system.
I thought this file would strike a good balance, but I now see that I made a crucial mistake: I didn’t expect that you’d be able to view it with the wrong number of bits per byte (7 instead of 6) and then skip almost every byte and still find a discernible image in the grayscale data. Once you can “see” what the image is supposed to be, the hard part is done.
I was assuming that more work would be needed for understanding the transmission itself (e.g. deducing the parity bits by looking at the bit patterns), and then only after that would it be possible to look at the raw data by itself.
I had a similar issue when I was playing with LIDAR data as an alternative to a 2D image. I found that a LIDAR point cloud is eerily similar enough to image data that you can stumble upon a depth map representation of the data almost by accident.

anonymousaisafety Jul 24, 2022, 4:37 AM
1 point
0
in reply to: blf’s comment on: Contest: An Alien Message
I have posted my file here https://www.lesswrong.com/posts/BMDfYGWcsjAKzNXGz/eavesdropping-on-aliens-a-data-decoding-challenge.

anonymousaisafety Jul 24, 2022, 4:36 AM
1 point
in reply to: faul_sname’s comment on: I No Longer Believe Intelligence to be “Magical”
I’ve posted it here https://www.lesswrong.com/posts/BMDfYGWcsjAKzNXGz/eavesdropping-on-aliens-a-data-decoding-challenge.

Eavesdropping on Aliens: A Data Decoding Challenge

anonymousaisafetyJul 24, 2022, 4:35 AM

49 points

9 comments4 min readLW link

anonymousaisafety Jul 15, 2022, 5:57 PM
1 point
in reply to: faul_sname’s comment on: I No Longer Believe Intelligence to be “Magical”
Which question are we trying to answer?
1. Is it possible to decode a file that was deliberately constructed to be decoded, without a priori knowledge? This is vaguely what That Alien Message is about, at least in the first part of the post where aliens are sending a message to humanity.
2. Is it possible to decode a file that has an arbitrary binary schema, without a priori knowledge? This is the discussion point that I’ve been arguing over with regard to stuff like decoding CAMERA raw formats, or sensor data from a hardware/software system. This is also the area where I disagree with That Alien Message—I don’t think that one-shot examples allow robust generalization.
I don’t think (1) is a particularly interesting question, because last weekend I convinced myself that the answer is yes, you can transfer data in a way that it can be decoded, with very few assumptions on the part of the receiver. I do have a file I created for this purpose. If you want, I’ll send you it.
I started creating a file for (2), but I’m not really sure how to gauge what is “fair” vs “deliberately obfuscated” in terms of encoding. I am conflicted. Even if I stick to encoding techniques I’ve seen in the real world, I feel like I can make choices on this file encoding that make the likelihood of others decoding it very low. That’s exactly what we’re arguing about on (2). However, I don’t think it will be particularly interesting or fun for people trying to decode it. Maybe that’s ok?
What are your thoughts?
What links here?
- anonymousaisafety's comment on Eavesdropping on Aliens: A Data Decoding Challenge by anonymousaisafety (Jul 26, 2022, 3:47 PM; 3 points)

anonymousaisafety

Eaves­drop­ping on Aliens: A Data De­cod­ing Challenge

Eavesdropping on Aliens: A Data Decoding Challenge