DeepAI

The Gotsman-Linial Conjecture is False

In 1991, Craig Gotsman and Nathan Linial conjectured that for all n and d, the average sensitivity of a degree-d polynomial threshold function on n variables is maximized by the degree-d symmetric polynomial which computes the parity function on the d layers of the hypercube with Hamming weight closest to n/2. We refute the conjecture for almost all d and for almost all n, and we confirm the conjecture in many of the remaining cases.

05/21/2019

A polynomial approach to the Collatz conjecture

The Collatz conjecture is explored using polynomials based on a binary n...
05/20/2021

Tuza's Conjecture for Threshold Graphs

Tuza famously conjectured in 1981 that in a graph without k+1 edge-disjo...
02/18/2022

On The "Majority is Least Stable" Conjecture

We show that the "majority is least stable" conjecture is true for n=1 a...
06/01/2018

The real tau-conjecture is true on average

Koiran's real τ-conjecture claims that the number of real zeros of a str...
04/17/2020

Counterexamples to the Low-Degree Conjecture

A conjecture of Hopkins (2018) posits that for certain high-dimensional ...
11/17/2020

Complete quadrics: Schubert calculus for Gaussian models and semidefinite programming

We establish connections between: the maximum likelihood degree (ML-degr...
06/22/2020

Logarithmic Pruning is All You Need

The Lottery Ticket Hypothesis is a conjecture that every large neural ne...

1 Introduction

We say that a boolean function is a Polynomial Threshold Function of degree if it can be expressed as the sign of a polynomial of degree at most evaluated on the boolean hypercube. For brevity, we will use the term -PTF (or simply PTF, when and are either implicit or irrelevant) to refer to a polynomial threshold function of degree on variables. We say that the coefficients of are the realizing weights of . Note that these realizing weights are not unique, as any sufficiently small perturbation of will not affect its sign on the discrete set . This definition alone is not terribly exciting without restrictions on , as every boolean function on variables can be written as the sign of (and in fact can be written exactly as) a multilinear polynomial of degree . We are interested particularly in the case where is small.

In an influential paper, Craig Gotsman and Nathan Linial [GL94] applied Fourier analytic techniques to the study of PTFs. They were mainly interested in connecting different measures of the complexity of boolean functions, and of low-degree PTFs in particular. One such measure was the Average Sensitivity of a boolean function, defined in Fourier analytic terms. For simplicity, in this paper we use the following (equivalent) combinatorial definition:

Definition 1.1

For a function , we define its Dichromatic Count to be the number of (unordered) pairs of Hamming neighbors such that .

We say that such a pair of Hamming neighbors is a dichromatic edge of .

Definition 1.2

The Average Sensitivity of a boolean function is .

Among other things, Gotsman and Linial proved a tight upper bound on the average sensitivity of -PTFs, achieved by the MAJORITY function on variables. They conjectured that this bound generalizes to higher degree PTFs, in that the -PTF of maximal average sensitivity is the obvious symmetric candidate, which alternates signs on the values of closest to .

Conjecture 1.1 (Gotsman-Linial)

Let be the monic univariate polynomial of degree with (non-repeated) roots at the integers closest to of opposite parity from . Let . Then for every -PTF , .

This conjecture was listed as a prominent open problem in [OD14] and [FHHMOSWW14]. If true, it would have many applications in complexity and learning (see for example [HKM09, GS10, Kan12, KW16, CSS16]), although most of the applications would already be implied by an asymptotic version of the conjecture, stated below. Gotsman and Linial proved their conjecture for the case where , and it is also known to be true in the case where . However, it was left open whether the conjecture holds for any . Two weaker versions of this conjecture have since been formulated and studied.

Conjecture 1.2 (Gotsman-Linial - Asymptotic)

Let be an -PTF. Then the average sensitivity .

Conjecture 1.3 (Gotsman-Linial - Weak)

Let be an -PTF. Then the average sensitivity for some function depending only on .

Conjecture 1.3 was resolved by Daniel Kane [Kan13].

1.1 Result

In this paper, we resolve the Gotsman-Linial Conjecture (Conjecture 1.1) for all pairs except the case when is even and . The main result of this paper is the following.

Theorem 1.1

For all pairs of natural numbers satisfying one of the following criteria, there exists an -PTF witnessing a counterexample to the Gotsman-Linial Conjecture (Conjecture 1.1):

• is odd, and

.

• , and .

Moreover, .

In addition, the conjecture holds in many of the remaining cases.

Theorem 1.2

For all pairs of natural numbers satisfying one of the following criteria, has the greatest average sensitivity among -PTFs.

• .

• .

• .

Our results (and the remaining open cases) are summarized in Figure 1. Although we refute the Gotsman-Linial Conjecture for most cases that are of interest for applications, the asymptotic conjecture (Conjecture 1.2), which would suffice for most known applications, remains open.

The remainder of this paper is structured as follows. We first present some high level intuition relating to the Gotsman-Linial Conjecture. Section 2 contains background information. Section 3 contains constructions of the refutations indicated in Figure 1. Section 4 concludes the paper and presents a revised conjecture.

1.2 Intuition

We start with some very high level intuition as to why the Gotsman-Linial Conjecture might be (approximately) true. The conjecture holds in the case of symmetric PTFs (boolean functions which can be expressed as the sign of a univariate polynomial in the sum of the input bits). This follows from the Fundamental Theorem of Algebra and a simple counting argument. In the more general case, we might expect that a degree- PTF can be expressed (at least approximately) in terms of unate functions. This generalizes the observation that every linear threshold function is unate. For a sufficiently close approximation, this would prove the Asymptotic Gotsman-Linial Conjecture. Intuition may also be drawn from Kane’s proof of Conjecture 1.3

. If inputs are chosen from a Gaussian distribution instead of a Bernoulli distribution, a polynomial

is expected to be too large in magnitude for a small change in its input to change its sign. Under certain conditions, a similar result can be extended to polynomial threshold functions on the boolean hypercube.

As for why the Gotsman-Linial Conjecture is not (exactly) true, we observe that the PTF of conjectured maximal average sensitivity is the product of

linear threshold functions, with parallel separating hyperplanes between two of the middle

layers (sets of vertices of equal Hamming weight) in the hypercube. For some , one might expect to be able to find a PTF of greater average sensitivity approximated by turning one of these separating hyperplanes ‘sideways’, i.e. replacing a hyperplane that cuts the fewest edges with a hyperplane orthogonal to the rest. Intuitively, this would require that be sufficiently large that some of the hyperplanes cut many more edges than others, but also sufficiently small that not too many edges are cut by two hyperplanes. As it turns out, this intuition can be formalized for many and , refuting the Gotsman-Linial Conjecture.

2 Preliminaries

2.1 Background

Low-degree PTFs, in particular linear threshold functions (degree- PTFs) with integral and polynomially bounded realizing weights, are of interest in the study of complexity classes such as

(i.e. circuits composed of AND, OR, NOT, and MAJORITY gates of unbounded fan-in) and of neural networks. More generally, we say that a circuit (with unbounded fan-in) is a

degree- polynomial threshold circuit if each of its constituent gates computes a degree- PTF of its inputs. Note that since AND, OR, MAJORITY, and NOT are all linear threshold functions, and circuits are degree- polynomial threshold circuits. Despite much research, the power of polynomial threshold circuits is poorly understood. For instance, it is currently an open question, and a rather embarrassing one at that, whether (the class of functions computable in nondeterministic time) is contained in (the class of functions computable by families of depth-, polynomial size linear threshold circuits with polynomially bounded realizing weights). Recent work by Daniel Kane and Ryan Williams [KW16] gave a partial answer to this question. They studied the sensitivity of PTFs to random restrictions, proving (among other things) that (and in fact, -uniform ) does not have depth-3 circuits of gates or wires.

2.2 Progress

Conjecture 1.1 is trivially true in the cases and (the only -PTFs are the constant functions, and is the parity function, which has the maximum possible average sensitivity). Gotsman and Linial originally noted that Conjecture 1.1 had already been proven in the case where by Patrick O’Neil in 1971 [ON71].

Theorem 2.1 (O’Neil)

The maximal number of edges of which may be cut by a hyperplane is given by .

Very little additional progress was made towards resolving the above conjectures until recently. The first non-trivial bounds on the average sensitivity of PTFs of arbitrary degree were found independently by two groups [HKM09, DRST14] and published jointly [DHKMRST10]. Daniel Kane in 2012 obtained the first bound which was truly sublinear in [Kan12], and in 2013, he proved the weak version of the Gotsman-Linial Conjecture (Conjecture 1.3) [Kan13].

3 Resolution of Gotsman-Linial Conjecture

For simplicity, we start by introducing some notation.

Definition 3.1

Let . We say , iff there exist and such that the function is a constant.

Note that defines an equivalence relation on boolean functions. Two functions are equivalent iff one can be turned into the other through a combination of permuting the inputs and negating the inputs/output.

Definition 3.2

An -Hypersensitive Function, or -HSF is an -PTF such that .

More generally, we say that a PTF is an HSF if and are either implicit or irrelevant. We may now restate the original Gotsman-Linial Conjecture (Conjecture 1.1) as follows:

Conjecture 3.1

For all , -HSFs do not exist.

We first prove some simple cases of Conjecture 3.1. The following corollary of O’Neil’s theorem (Theorem 2.1) uses our notation.

Corollary 3.1

For every , -HSFs do not exist.

Proof. Every -PTF is defined by a separating hyperplane which cuts all of the dichromatic edges of . From O’Neil, , so is not an HSF.

The case is a simple consequence of a result first proven in 1968 by Marvin Minsky and Seymour Papert [MP68] and since re-proven several times. We present here a variation on the proof by Aspnes et al. [ABFR94].

Theorem 3.1 (Minsky-Papert)

Any PTF which computes parity on variables must have degree at least .

Proof. Let be a multilinear polynomial of degree which is never zero on . The set of monomials of degree at most

is an orthogonal basis for the vector space of degree-

multilinear polynomials on the boolean hypercube. Hence is orthogonal to the parity function , i.e. . By assumption, every term in the sum on the RHS is non-zero, so at least one of them is negative, i.e. .

Corollary 3.2

For every , -HSFs do not exist.

Proof. Let be an -PTF. Then , and . Let and . Take and . There are edge-disjoint paths between and in the boolean hypercube, and each must contain at least one edge crossing the cut between and (i.e. a monochromatic edge). Hence , so is not an HSF.

Lemma 3.1

Let , and let have maximal over all -PTFs. Then for every -PTF , .

Proof. Let , and let be an -PTF with maximal. Let be an -PTF. Any restriction of to a function on variables is also a degree- PTF, so . There are such restrictions , and each dichromatic edge of appears in exactly of them. Hence , from which the desired result follows immediately.

Lemma 3.2

Let with . If and have the same parity, and -HSFs do not exist, then -HSFs do not exist.

Proof. Assume that no -PTF is an HSF. If and have the same parity, then every restriction of is equivalent (with respect to ) to . There are such restrictions, and each dichromatic edge of appears in exactly of them, so . Hence by Lemma 3.1, -HSFs do not exist.

Corollary 3.3

For every , -HSFs do not exist.

Proof. This follows from Corollary 3.2 and Lemma 3.2.

Corollary 3.4

Let . If and , then -HSFs do not exist.

Proof. This follows from Corollaries 3.1, 3.2 and 3.3, and the fact that -HSFs trivially do not exist when .

3.1 A Simple Counterexample

In the statement of Corollary 3.4, the caveat cannot be removed.

Lemma 3.3

There exists a unique -HSF , modulo .

Proof. In the case where and , , and . Let be defined by , let such that , and let . Since is quadratic, is a -PTF. It is not difficult to verify that , so is a -HSF. For uniqueness, see Appendix A.1.

The existence of a -HSF precludes the use of Lemma 3.2 to prove that -HSFs do not exist. However, the uniqueness of , along with the fact that it only has one additional dichromatic edge, allows for a proof using Lemma 3.1.

Lemma 3.4

For every , -HSFs do not exist.

Proof. The cases have already been covered. For , see Appendix A.3. The case remains. Assume for the sake of contradiction that is a -HSF. The dichromatic count of every boolean function on an even number of variables is an even integer. Since for every -PTF , , Lemma 3.1 implies that , and hence that . There are restrictions of to a function on variables, all of which satisfy . Every dichromatic edge in appears in exactly five such , so the expectation over a uniformly random restriction of is . Since is always an integer,

with probability strictly greater than

. In particular, there exists such that (*). However, it is easily verified (see Appendix A.2) that no function satisfying both (*) and is a -PTF. This contradicts the initial choice of . Hence no -HSFs exist.

This also completes the proof of Theorem 1.2.

3.2 Extension to Odd n

We may extend to an -HSF for any odd .

Theorem 3.2

For every odd with , there exists an -HSF with
.

Intuitively, behaves exactly as , with the additional variables contributing to the second argument of .
Proof. Let be an odd integer. Let and . Let be the -dimensional boolean hypercube, and let be the graph with vertex set and an edge between and exactly when . Let be the graph homomorphism defined by . Let as above, let , and take . Note that because is a graph homomorphism, we may compute by counting the dichromatic edges induced by on , weighted by . To this end, we observe that an edge between and has a preimage under of cardinality

 |ϕ−1(e)|=(2i)(n−2j)(n−2−j).

Similarly, for an edge between and ,

 |ϕ−1(e)|=(2i)(n−2j)(2−i).

We observe that is positive on except at the four points . Hence gives nine dichromatic edges, as indicated by the black lines below.

Summing the above expressions over these nine edges, we have

 D[fn,2] =(n−2n−12)((20)n+(21)(n−1)+(22)(n−1)) =(n−2n−12)(4n−3) =(n−2n−12)(n−3+3n) =(n−2n+12)(n+1)+(n−2n−12)3n =((n−1n+12)+(n−1n−12))n+(n−2n+12) =(nn+12)n+(n−2n+12) ∈(1+Θ(n−1))D[f∗n,2].

Hence is an -HSF, as desired.

3.3 The General Case

Using a similar construction, we now prove the existence of HSFs of arbitrary degree.

Theorem 3.3

For every with and , there exists an -HSF with .

We first consider the case where and have the same parity. The case where and have opposite parity is similar but handled later.

Theorem 3.4

For every with and even, there exists an -HSF with .

Proof. Let be integers of the same parity with . Let and let . Let be the -dimensional boolean hypercube, and let be the graph with vertex set and an edge between and exactly when . Let be the graph homomorphism defined by . We now define four polynomials on as follows:

 p1(x,y) :=(y−1+d)(y+1−d) p2(x,y) :=1−2(x(d−1)+y)2 p3(x,y) :=(y−3+d)(y−5+d)⋯(y+5−d)(y+3−d) p4(x,y) :=x(x+2)(x−2)(y−4+d)(y−6+d)⋯(y+6−d)(y+4−d)

Since , there exists such that for every with , . Similarly, , so there exists such that for every with , . For instance, we may take . Take , take , and take . Since and have degree , has degree , and has degree , is a polynomial threshold function of degree . Towards computing , we first consider the relevant behaviors of separately. All four are integer-valued (evaluations of) polynomials on the domain . Both and are always odd, so in particular, are non-zero everywhere. Firstly, is positive when , is zero when , and has the same sign as when . Clearly, is positive when and zero when . By choice of , is never in the interval and always has the same sign as when is non-zero. Similarly, is always non-zero and always has the same sign as when is non-zero. Hence we may rewrite as the following piecewise function:

 g(x,y) =⎧⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪⎨⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪⎩(−1)dy<1−dsgn((−1)dp2(x,y))y=1−dsgn(p4(x,y))|y|d−1

Since is positive only at the two points and when , the above piecewise representation shows that when , computes the parity function except at the two points and (illustrated in Figures 2 and 3).

We now define by . Note that because is symmetric, this gives a well-defined function . It is easily verified that for all such that and at the two points and , , and that for all other , . Hence there are ten edges in for which . This allows us to compute as follows:

 D[fn,d]= D[f∗n,d] +(n+d−2)(n−3n−d−42)+3(n+d−2)(n−3n−d−42) −3(n+d−2)(n−3n−d−42)−6(n−3n−d−42)−(n−d−4)(n−3n−d−42) = D[f∗n,d]+(2d−4)(n−3n−d−42) ∈ ⎛⎜⎝1+Ω⎛⎜⎝(nn−d2)n(nn2)⎞⎟⎠⎞⎟⎠D[f∗n,d] ⊆ (1+Ω(n−1e−d2/n))D[f∗n,d]. (Stirling’s Inequality)

Hence is an -HSF, as desired.

Theorem 3.5

For every with , and odd, there exists an -HSF with .

Proof. The proof proceeds similarly to the previous case. We define , , , , , , , , , , and as above, and we define . We now define analogously to above. The computation of now proceeds as follows:

 D[fn,d]= D[f∗n,d] +n+d−32(n−3n−d−32)+3n+d−32(n−3n−d−32) −3n+d−32(n−3n−d−32)−3(n−3n−d−32)−n−d−32(n−3n−d−32) +n+d−12(n−3n−d−52)+3n+d−12(n−3n−d−52) −3n+d−12(n−3n−d−52)−3(n−3n−d−52)−n−d−52(n−3n−d−52) = D[f∗n,d]+(d−3)(n−3n−d−32)+(d−1)(n−3n−d−52) ∈ ⎛⎜⎝1+Ω⎛⎜⎝(nn−d2)n(nn2)⎞⎟⎠⎞⎟⎠D[f∗n,d] ⊆ (1+Ω(n−1e−d2/n))D[f∗n,d]. (Stirling’s Inequality)

Hence is an -HSF, as desired.

This also completes the proofs of Theorems 3.3 and 1.1.

4 Conclusion

For almost all and almost all , we refute the Gotsman-Linial Conjecture (Conjecture 1.1) with a multiplicative separation of . This separation is too weak to refute most known applications of the conjecture. We would need to improve to to refute the Asymptotic Gotsman-Linial Conjecture (Conjecture 1.2), on which the applications depend. Although for every -HSF given in this paper, , it should be noted that the RHS is still an upper bound in a limiting sense. This, along with the intuition presented in Section 1.2, invites the following revised conjecture.

Conjecture 4.1 (Gotsman-Linial - Limit)

Let be an -PTF. Then the average sensitivity .

Conjecture 4.1 would resolve the remaining cases of Conjectures 3.1 and 1.1, i.e.

Conjecture 4.2

For every even , -HSFs do not exist.

Furthermore, our revised conjecture would imply the Asymptotic Gotsman-Linial Conjecture (Conjecture 1.2) and its consequent applications.

Acknowledgments

The author would like to thank Ryan Williams especially for inspiration, advice, feedback, and an admirable tolerance of cheesemonkeys; the Williams family, the Chap-people, Henry Qin, and Carolyn Kim for moral support and a good work environment; and Not Luke the goldfish for surviving.

References

• [ABFR94] James Aspnes, Richard Biegel, Merrick Furst, and Steven Rudich. The Expressive Power of Voting Polynomials. Combinatorica, 14(2):1-14, 1994.
• [Bou79] Jean Bourgain. Walsh subspaces of product spaces. Séminaire D’Analyse Fonctionelle, École Polytechnique, Centre de Mathematiques, pp. IV.1-IV.9, 1979.
• [CSS16] Ruiwen Chen, Rahul Santhanam, and Srikanth Srinivasan. Average-Case Lower Bounds and Satisfiability Algorithms for Small Threshold Circuits. Proceedings for the st Conference on Computational Complexity, pp. 1:1-1:35, 2016.
• [DHKMRST10] Ilias Diakonikolas, Prahladh Harsha, Adam Klivans, Raghu Meka, Prasad Raghavendra, Rocco A. Servedio, and Li-Yang Tan. Bounding the average sensitivity and noise sensitivity of polynomial threshold functions. In

ACM Symposium on Theory of Computing (STOC)

, pp. 533–542, 2010.
• [DRST14] Ilias Diakonikolas, Prasad Raghavendra, Rocco A. Servedio, and Li-Yang Tan. Average sensitivity and noise sensitivity of polynomial threshold functions. SIAM Journal on Computing, pp. 231–253, 2014.
• [FHHMOSWW14] Yuval Filmus, Hamed Hatami, Steven Heilman, Elchanan Mossel, Ryan O’Donnell, Sushant Sachdeva, Andrew Wan, and Karl Wimmer. Real Analysis in Computer Science: A collection of Open Problems. Simons Institute, Berkeley, CA, compiled in 2014.
• [GS10] Parikshit Gopalan and Rocco Servedio. Learning and Lower Bounds for with Threshold Gates. Proceedings for the th International Workshop on Randomization and Computation, pp. 588-601, 2010.
• [GL94] Craig Gotsman and Nathan Linial. Spectral Properties of Threshold Functions. Combinatorica, 14(1):35-50, 1994.
• [HKM09] Prahladh Harsha, Adam Klivans and Raghu Meka. Bounding the sensitivity of polynomial threshold functions. arXiv:0909.5175, 2009.
• [Kan12] Daniel M. Kane. A structure theorem for poorly anticoncentrated Gaussian chaoses and applications to the study of polynomial threshold functions. In Foundations of Computer Science (FOCS), pp. 91–100, 2012.
• [Kan13] Daniel M. Kane. The Correct Exponent for the Gotsman-Linial Conjecture. arXiv:1210.1283, 2013.
• [KW16] Daniel Kane and Ryan Williams. Super-Linear Gate and Super-Quadratic Wire Lower Bounds for Depth-Two and Depth-Three Threshold Circuits. Proceedings for the th Annual ACM SIGACT Symposium on the Theory of Computing, pp. 633-643, 2016.
• [LMN93] Nathan Linial, Yishay Mansour, and Noam Nisan.

Constant depth circuits, Fourier transform, and learnability.

Journal of the ACM, 40(3):607-620, 1993.
• [MP68] Marvin Minsky and Seymour Papert. Perceptrons: an Introduction to Computational Geometry (Expanded Edition). MIT Press, Cambridge, MA, 1988.
• [OD12] Ryan O’Donnell. Open problems in analysis of Boolean functions. arXiv preprint arXiv:1204.6447, 2012.
• [OD14] Ryan O’Donnell. Analysis of Boolean Functions. Cambridge University Press, New York, New York, 2014.
• [ON71] Patrick E. O’Neil. Hyperplane Cuts of an -Cube. Discrete Maths. 1 (1971), 193-195.

Appendix A Appendix

Here we describe how a computer search resolved the cases of and of the Gotsman-Linial Conjecture. First, we note that the problem of determining whether a boolean function is an -PTF is equivalent to determining whether a particular linear program has any feasible solution. Unfortunately, leveraging this fact to compute the maximal average sensitivity of an -PTF with a naïve exhaustive search takes doubly exponential time so is intractable for large (i.e. ). However, by using Lemma 3.1, we can conduct a more efficient search. We maintain a partial function and conduct a DFS in which we define successively on inputs in increasing order of Hamming weight. This allows us to keep bounds on by counting the edges that are already constrained to be monochromatic or dichromatic. When becomes too low or too high, we can prune the search and backtrack before fully defining , allowing (tolerably) efficient searches up to .

a.1 (n,d)=(5,2)

In the case , Lemma 3.1 implies that for every -HSF , we have . Because a random boolean function on variables has far fewer than dichromatic edges with high probability, most search branches are pruned early. The modified search confirmed that every -HSF with satisfies , and hence that is the unique -HSF.

a.2 (n,d)=(6,2)

The proof of Lemma 3.4 relies on the claim that for every -PTF