(we use cat to denote catastrophe)
The general hope of distillation is as follows:
You have some circuit
this is half of the iterated amplification approach.
remark: obviously, if you have a really great idea for distilling NNs, DO NOT PUBLISH IT unless youâre pretty confident that it deferentially helps safety stuff and isnât just shaving time off of how long we have before weâre cooked.
Iâm not expecting anything here to be dangerous though. Iâm mostly just trying to understand under what conditions this is theoretically possible.
try 1
edit: i think instead of constant sized TMs I shouldâve just assumed you had random access to the circuit. whatever.
Worked with to Nathan Sheffield on this idea.
Suppose
is a circuit with gates, and Alice has access to a distilled circuit with gates. Suppose Alice wants to convince Bob about the value of on some input . Is it possible to do this if Alice runs in time and Bob runs in time ?
Note: This sounds really audacious. But, something pretty similar is actually possible â itâs called âGKRâ
(Goldwasser, Kalai, Rothblum https://dl.acm.org/doi/10.1145/2699436)
- Suppose you have a log-space uniform circuit with depth
and input size . - Then there is a (âdoubly efficientâ) interactive proof where the prover takes time
- And the Verifier takes time
, and the communication complexity is . Basically you should think of this as, if thereâs a computation that can be done in time then thereâs a proof of the computation that can be generated in time ~ and checked in time ~ .
more formal statement, and a proof that itâs impossible.
Weâd like for there to be a
- If
then there exists such that for all . - If
then for any , and any with , .
why this is impossible: We will assume
Claim
Suppose I have a really smart guy Bob such that whenever
Now suppose we have some arbitrary circuit
- He âwrites downâ (remember that weâre assuming the circuit is generated by a small amount of TM code) the circuit
- Then, if itâs the case that
, Bob can guess a short proof that . - If Bob fails to guess such a proof, then he knows that
. - But of course
.
So anyways, Bob just evaluated this circuit. But as stated above, this is believed by complexity ppl to be impossible.
We also considered a âdistributionalâ version of this â i.e., you just need to make sure pr cat is quite small.
it seemed pretty unworkable.