Arjun Pitchanathan comments on $500 Bounty Problem: Are (Approximately) Deterministic Natural Latents All You Need?

Arjun Pitchanathan 25 Jul 2025 17:31 UTC
3 points
0
To check my understanding: for random variables $A, B$ , the stochastic error of a latent $Λ$ is the maximum among $I (A; B | Λ), I (A; Λ | B), I (B; Λ | A)$ . The deterministic error is the maximum among $I (A; B ∣ Λ), H (Λ | A), H (Λ | B)$ . If so, the claim in my original comment holds—I also wrote code (manually) to verify. Here’s the fixed claim:
Let $X \sim Ber (p)$ . With probability $r$ , set $Z := X$ , and otherwise draw $Z \sim Ber (p)$ . Let $Y \sim Ber (1 / 2)$ . Let $A = X \oplus Y$ and $B = Y \oplus Z$ . We will investigate latents for $(A, B)$ . Let $ϵ$ be the stochastic error of latent $Λ := Y$ . Now compute the deterministic errors of each of the latents $X$ , $Y$ , $Z$ , $A$ , $B$ , $A \oplus B$ , $X \oplus Y \oplus Z$ . Then for $p := 0.9, r := 0.44$ , all of these latents have deterministic error greater than $5 ϵ$ .
It should be easy to modify the code to consider other latents. I haven’t thought much about proving that there aren’t any other latents better than these, though.
- Arjun Pitchanathan 28 Jul 2025 15:09 UTC
  3 points
  0
  Parent
  On this particular example you can achieve deterministic error $\approx 2.5 ϵ$ with latent $A \land B$ , but it seems easy to find other examples with ratio > 5 (including over latents $A \land B, A \lor B$ ) in the space of distributions over $(X, Y, Z)$ with a random-restart hill-climb. Anyway, my takeaway is that if you think you can derandomize latents in general you should probably try to derandomize the latent $Λ := Y$ for variables $A := X \oplus Y$ , $B := Z \oplus Y$ for distributions over boolean variables $X, Y, Z$ .
  (edited to fix typo in definition of $B$ )
  - Arjun Pitchanathan 28 Jul 2025 22:22 UTC
    3 points
    0
    Parent
    My impression is that prior discussion focused on discretizing $Λ$ . $Λ$ is already boolean here, so if the hypothesis is true then it’s for a different reason.