Теорема Адлемана о бесконечных полуколец?

13

В 1978 году Адлеман показал, что BPPP/poly : если булева функция f из n переменных может быть вычислена с помощью вероятностной булевой схемы размера M , тогда f может быть вычислена с помощью детерминированной булевой схемы размера многочлен от M и n ; на самом деле, размером O(nM) .

Общий Вопрос: За каким другими (чем булевыми) полукольца делает BPPP/poly трюма?

Чтобы быть более конкретным, вероятностная схема над полукольцом ( S , + , , 0 , 1 ) использует свои операции «сложения» ( + ) и «умножения» ( ) в качестве затворов. Входы - это входные переменные x 1 , ... , х п , и , возможно , некоторое количество дополнительных случайных величин, которые принимают значения 0 и 1 независимо друг от друга с вероятностью 1 / 2 ; здесьC(S,+,,0,1)(+)()x1,,xn011/2 и 1 - аддитивная и мультипликативная тождества полукольца соответственно, такая схема C вычисляетзаданную функцию f : S n01C , если для каждого х S п , Р г [ С ( х ) = е ( х ) ] 2 / 3 . f:SnSxSnPr[C(x)=f(x)]2/3

Функция голосования из m переменных является частичной функцией, значение которой равно y, если элемент y появляется более чем в m / 2 раза среди y 1 , , y m и не определен , если такого элемента у не существует. Простое применение черновских и союзных границ дает следующее.Maj(y1,,ym)myym/2y1,,ymy

Уловка большинства: если вероятностная схема вычисляет функцию f : S nS на конечном множестве X S n , то существует m = O ( log | X | ) реализаций C 1 , , C m из C, таких что f ( x ) = M a j ( C 1 ( x ) , Cf:SnSXSnm=O(log|X|)C1,,CmC выполняется для всех х Х . f(x)=Maj(C1(x),,Cm(x))xX

Для логического полукольца функция голосования является мажоритарной функцией и имеет небольшие (даже монотонные) цепи. Итак, из теоремы Адлемана следует, что X = { 0 , 1 } n . MajX={0,1}n

Но как насчет других (особенно бесконечных) полуколец? А как насчет арифметического полукольца (с обычным сложением и умножением)?(N,+,,0,1)

Вопрос 1: Имеют ли трюма над арифметическим полукольцом? BPPP/poly

Хотя я держу пари на «да», я не могу показать это.

Remark: I am aware of this paper where the authors claim BPPP/poly over the real field (R,+,,0,1). They deal with non-monotone arithmetic circuits, and also arrive (in Theorem 4) to circuits with the voting function Maj as an output gate. But how to simulate this Maj-gate by an arithmetic circuit (be it monotone or not)? I.e. how to get their Corollary 3?

Actually, the following simple argument told to me by Sergey Gashkov (from Moscow University) seems to show that this is impossible (at least for circuits able to compute only polynomials). Suppose we can express Maj(x,y,z) as a polynomial f(x,y,z)=ax+by+cz+h(x,y,z). Then f(x,x,z)=x implies c=0, f(x,y,x)=x implies b=0, and f(x,y,y)=y implies a=0. This holds because, over fields of zero characteristic, equality of polynomial-functions means equality of coefficients. Note that in Question 1, the range of probabilistic circuits, and hence, the domain of the Maj-gate is infinite. I therefore have an impression that the linked paper deals only with arithmetic circuits computing functions f:RnY with small finite ranges Y, like Y={0,1}. Then Maj:YmY is indeed easy to compute by an arithmetic circuit. But what if Y=R?


Correction [6.03.2017]: Pascal Koiran (one of the authors of this paper) pointed to me that their model is more powerful than just arithmetic circuits: they allow Sign-gates (outputing 0 or 1 depending on whether the input is negative of not). So, the voting function Maj can be simulated in this model, and I take back my "confusion".


In the context of dynamic programming, especially interesting is the same question for tropical min-plus and max-plus semirings (N{+},min,+,+,0) and (N{},max,+,,0).

Question 2: Does BPPP/poly hold over tropical semirings?

Held BPPP/poly in these two semirings, this would mean that randomness cannot speed-up so-called "pure" dynamic programming algorithms! These algorithms only use Min/Max and Sum operations in their recursions; Bellman-Ford, Floyd-Warshall, Held-Karp, and many other prominent DP algorithms are pure.

So far, I can only answer Question 2 (affirmatively) under the one-sided error scenario, when we additionally require Pr[C(x)<f(x)]=0 over the min-plus semiring (minimization), or Pr[C(x)>f(x)]=0 over the max-plus semiring (maximization). That is, we now require that the the randomized tropical circuit can never produce any better than optimum value; it can, however, err by giving some worse-than-optimal values. My questions are, however, under the two-sided error scenario.


P.S. [added 27.02.2017]: Here is my attempt to answer Question 1 (affirmatively). The idea is to combine a simplest version of the "combinatorial Nullstellensatz" with an estimate for the Zarankiewicz problem for n-partite hypergraps, due to Erdos and Spencer. Modulo this latter result, the entire argument is elementary.

Note that Question 2 still remains open: the "naive Nullstellensatz" (at least in the form I used) does not hold in tropical semirings.

Stasys
источник
nit: BPP is a uniform class defined using PTMs not circuits.
Kaveh
@Kaveh: yes, in this sense, Adleman's result is even a bit stronger, it holds even for BPP/poly.
Stasys
Don't see how the simple argument shows impossibility... it does seem to show that the coefficients of the x, y, and z monomials must be zero... what am I missing? Also, if a polynomial can't compute Maj, how else can you representing a computation over a semiring? (What else besides a polynomial over the semiring?) Intuitively, over an infinite domain, each constraint on some y (enforcing that on >m/2 y's you must output y) seems "independent" of the others (no subset of constraints implies another) so it seems no "finite" polynomial could satisfy the infinitely many independent constraints.
Ryan Williams
@Ryan: yes, this only shows f=Maj implies h=Maj. But h has degree > 1, so h(x,x,z)=x is impossible. And you are right: circuits over semirings cannot compute anything else as polynomials. So, they cannot compute Maj. But the authors of that paper deal with {+,x,-,/} circuits, with all field operations allowed. Perhaps then Maj can still be somehow computed? (I however, do not see how.) B.t.w. instead of trying to simulate Maj itself, one could answer Q1&Q2 by showing that one Maj-gate cannot substantially decrease circuit size (which is quite plausible).
Stasys
@Ryan: P.S. Igor Sergeev observed that Maj "could" be probable computable over (R,+,x,-,/). E.g. Maj(x,y,z) is computable by f(x,y,z)=(xy+xz-2yz)/(2x-y-z) for all inputs with |{x,y,z}|=2. Note that the simple argument above implies that, already on such inputs, this cannot be done over (R,+,x,-). So, division can help. But we face the division by 0 issue ...
Stasys

Ответы:

3

This is only a partial answer to your general question (I'm not sure what a fully general formulation would be), but it suggests that working over sufficiently nice infinite semirings while constraining the randomness to a finite domain might actually trivialize the question of whether Adleman's theorem holds.

Suppose you're working over the complex numbers C, so that circuits compute polynomials over that field, and suppose the function f is itself computed by some polynomial (however complicated) of the x variables. Then it turns out that already for some fixed r, C(x,r)=f(x). The reason is that for each r, the set of x with C(x,r)=f(x) determines a Zariski-closed subset of Cn, and so must be all of Cn, or else a subset of measure zero. If all of these sets were to have measure zero, then, because there are only finitely many r's in consideration, the set of x where r:C(x,r)=f(x) would also have measure zero. On the other hand, the assumption that C computes f implies this that set must be all of Cn, so it cannot have measure zero.

Andrew Morgan
источник
Interesting. More generally, a probabilistic circuit of size M is some random variable C taking its values in the set of all circuits (of that type) with at most M gates. [B.t.w. that paper of Cucker at al. allows C be arbitrarily distributed. The "majority trick" stil works.] Can I conclude from your argument that, if the range of C is finite, then Adleman's theorem is trivial when Zarinski-closed subsets are either trivial (sets themselves) or have zero measure? Have we this - "all or nothing" - effect in tropical semirings? (I am mainly interested in them.)
Stasys
I don't know how or whether the argument would generalize to other semirings, sorry. One main thing that's missing (for me) is the geometric intuition akin to how "polynomials that disagree" translates to "measure-zero subsets of Cn". For tropical semirings in particular, the operations seem so different from ordinary polynomials that it's hard to even guess at what the appropriate adaptation ought to be.
Andrew Morgan