Systems and Networking Review articles

The GCT Program Toward the P vs. NP Problem

Exploring the power and potential of geometric complexity theory.

Posted Jun 1 2012

Ketan Mulmuley on Geometric Complexity Theory from CACM on Vimeo.

Introduction
Key Insights
Geometric Obstructions
The Flip
Why Go for Explicit Proofs?
Why Should GOH and FH Hold?
Frequently Asked Questions
Acknowledgments
References
Author
Footnotes
Figures
Sidebar: Formal Definition of the Varieties
Sidebar: The Strong Flip Theorem
Sidebar: Self-Referential Difficulty
Sidebar: The Decomposition

The GCT Program Toward the P vs. NP Problem, illustration

Geometric complexity theory (GCT) is an approach via algebraic geometry and representation theory toward the P vs. NP and related problems.^9,13,15,29 It was proposed in a series of papers^{4,18,19,20,21,22,24,25,26} and was developed further in Bürgisser and Ikenmeyer,⁷ Bürgisser et al.,⁸ and Landsberg et al.¹⁴ This article gives an informal overview of GCT. It is meant to be an update on the status of the P vs. NP problem reported in Fortnow.¹¹ See Mulmuley²³ for a more detailed, formal overview of GCT.

Key Insights

GCT provides an approach to the foundational questions of complexity theory via algebraic geometry and representation theory.
It reveals formidable explicit construction problems at the crossroads of algebraic geometry, representation theory, and complexity theory, and provides evidence that any approach to the foundational questions of complexity theory would have to resolve explicit construction problems of comparable difficulty. This law of conservation of difficulty rnay explain why P vs. NP and related problems, which look at the surface, have turned out to be so difficult.
It shows how to break the circle of self-reference around the arithmetic P vs. NP and related problems.

Let us begin by recalling an algebraic variant of the P vs. NP problem introduced in a seminal paper.²⁹ It can be formulated in a very concrete form as the permanent vs. determinant problem. Here the permanent of an n x n variable matrix X is defined just like the determinant but without signs.

Specifically,

where x_ij‘s denote the entries of X and σ ranges over all permutations of the integers from 1 to n. Let K, the base field or ring of computation, be , , , or a finite field F_p of p elements, p being an odd prime. We say that perm(X) can be linearly represented as the determinant of an m x m matrix if perm(X) = det(Y) for some m x m matrix Y whose entries are linear combinations (possibly nonhomogeneous) over K of the variable entries of X. The permanent vs. determinant conjecture in Valiant²⁹ is that perm(X) cannot be linearly represented as the determinant of an m x m matrix when m is small, that is, when m is polynomial in n, or more generally, when it is O(2^log^aⁿ) for some constant a > 0.

It is known^3,29 that this conjecture, when K is or F_p, and m is polynomial in n, is implied by a stronger (nonuniform) version^a of the P ≠ NP conjecture or even the weaker #P ≠ NC conjecture. Here, #P denotes the class of functions,^b like the number of satisfying assignments of a Boolean formula, that count the number of solutions of the problems in NP, and NC denotes the class of functions,^b like the determinant, that can be computed efficiently in parallel in polylogarithmic time using polynomially many processors. The implication of the permanent vs. determinant conjecture from the (nonuniform) #P vs. NC conjecture is based on the fact that the permanent is #P-complete²⁹ (in the spirit of the well-known NP-completeness) and that the determinant is (almost) NC-complete. It is also known that the permanent vs. determinant conjecture, when K is a large enough finite field F_p and m = O(2^clog²ⁿ) for some large enough constant c > 0, implies the #P NC conjecture. As such, the permanent vs. determinant conjecture is, strictly speaking, an algebraic analogue of the #P vs. NC conjecture, not the P vs. NP conjecture. There is also an analogous algebraic analogue of the P vs. NP conjecture^21,25 which, when K is a large enough finite field, implies the usual P ≠ NP conjecture. But its story is similar to that of the permanent vs. determinant conjecture. Hence, for simplicity, we only focus on the permanent vs. determinant conjecture here.

By the arithmetic case of this conjecture, we mean the case when K = , , or . This case for K = is implied by the case when K = F_p and also, as already mentioned, by the (nonuniform) #P vs. NC conjecture. The arithmetic case is easier than the case when K = F_p, because it avoids complications in algebra that arise in the case of finite fields.

Hence, let us first discuss the arithmetic case when K = , which implies the cases when K = or . The advantage of dealing with the arithmetic conjecture over , in contrast to the original Boolean conjectures, is that this arithmetic conjecture is a statement about multivariate polynomials over . Hence, we can use techniques from algebraic geometry, which is the study of the common zeroes of sets of multivariate polynomials. These techniques work best when the base field is algebraically closed of characteristic zero, such as . Since the permanent and the determinant are characterized by their symmetries, we can also use techniques from representation theory, which is the study of groups of symmetries. As such, the GCT approach that goes via algebraic geometry and representation theory is very natural in the arithmetic setting.

The articles by Mulmuley and Sohoni^25,26 reduce the arithmetic permanent vs. determinant conjecture to proving existence of geometric obstructions that are proof certificates of hardness of the permanent. The very existence of these obstructions for given n and m implies that the permanent of an n x n variable matrix cannot be linearly represented as the determinant of an m x m matrix. The geometric obstructions are objects that live in the world of algebraic geometry and representation theory. Their dimensions can be large, exponential in n and m. But they have short classifying labels. The basic strategy of GCT, called the flip,^21,22 is to construct the classifying label of some geometric obstruction explicitly in time polynomial in n and m when m is small. It is called the flip because it reduces the lower bound problem under consideration to the upper bound problem of constructing a geometric obstruction label efficiently. The flip basically means proving lower bounds using upper bounds. Its basic idea in a nutshell is (1) to understand the theory of upper bounds (algorithms) first and (2) to use this theory to prove lower bounds later. But one may wonder why we are going for explicit construction of obstructions, when proving existence of an obstruction even nonconstructively suffices in principle. This is because of the flip theorem in Mulmuley,^21,23 which says that in the problem under consideration we are essentially forced to construct some proof certificate of hardness explicitly.

The upper bound problems that arise in the context of the flip turn out to be formidable problems at the frontier of algebraic geometry. The flip theorem mentioned above also says that stronger versions of the permanent vs. determinant conjecture and a standard derandomization conjecture¹² in complexity theory imply together solutions to the upper bound problems in algebraic geometry that are akin to the ones that arise in the flip. Furthermore the article by Mulmuley²² gives evidence that even the upper bound problems that arise in the flip may be essentially implications of these conjectures in complexity theory. This suggests a law of conservation of difficulty, namely, that problems comparable in difficulty to the ones encountered in GCT would be encountered in any approach to the (nonuniform) P vs. NP problem (of which the arithmetic permanent vs. determinant conjecture over is an implication). This does not say that any approach to the P vs. NP problem has to necessarily go via algebraic geometry. But it does suggest that avoiding algebraic geometry may not be pragmatic since it would essentially amount to reinventing in some guise the wheels of this difficult field that have been developed over centuries.

There is also another reason why the explicit construction of geometric obstruction labels turns out to be hard. At the surface it seems that for such efficient construction one may need to compute the permanent itself efficiently, thereby contradicting the very hardness of the permanent that we are trying to prove. By the flip theorem in Mulmuley,^21,23 this self-referential difficulty akin to that in Gödel’s Incompleteness Theorem is also not specific to GCT. Any approach would have to cope with it. The article by Mulmuley²² shows how it can be tackled in GCT by decomposing the lower bound problem under consideration into subproblems without this difficulty. Conceptually, this is the main result of GCT in the arithmetic setting.

Finally, let us discuss the permanent vs. determinant conjecture over finite fields that implies the #P NC conjecture, the story for the algebraic variant of the P vs. NP problem in Mulmuley and Sohoni^21,25 that implies the usual (Boolean) P vs. NP conjecture being similar. Here, the GCT plan is to prove the arithmetic case via algebraic geometry over as outlined above first, and then extend this proof to finite fields by proving additional results in algebraic geometry over , or rather, algebraically closed fields of characteristic zero such as . At the surface, this plan may seem counter-intuitive. After all, how can one hope to prove statements about finite fields using algebraic geometry over ? A basic prototype for this plan is the analogue of the usual Riemann hypothesis for finite fields proved in Deligne¹⁰ using algebraic geometry over algebraically closed fields of characteristic zero such as . The proof of this result, a crowning achievement in mathematics, shows that difficult statements about finite fields can be proved using algebraic geometry over algebraically closed fields of characteristic zero. In the same spirit, the GCT approach in the arithmetic setting can be extended so that it applies to the usual (Boolean) #P vs. NC and P vs. NP conjectures. But this story is beyond the scope of this article. It will be described in a later paper.¹⁷ In this article, we confine ourselves to the arithmetic permanent vs. determinant problem, which captures the crux of the P vs. NP problem.

The rest of this article is organized as follows. We describe the notion of geometric obstructions for the arithmetic permanent vs. determinant problem. This is followed by a description of the flip strategy that goes toward explicit construction of geometric obstruction labels in polynomial time. We state the upper bound problems in algebraic geometry that arise in this context. We also describe the self-referential difficulty in the problem under consideration and how GCT tackles it by decomposing the problem into subproblems without this difficulty.

Geometric Obstructions

We now describe the GCT approach to the arithmetic permanent vs. determinant problem²⁹ over based on the notion of geometric obstructions (proof certificates of hardness).

The starting point of the approach is the classical result that the permanent and determinant are completely characterized by their symmetries in the following sense.²⁵

(D): Let Y be a variable m x m matrix. Then, det(Y) is the unique polynomial (up to a constant multiple) of degree m in the variable entries of Y such that, for any m x m invertible complex matrices A and B with det(A) det(B) = 1, det(Y) = det(AY* B), where Y* is Y or its transpose.

(P): Let X be a variable n x n matrix. Then, perm(X) is the unique polynomial (up to a constant multiple) of degree n in the variable entries of X such that, for any diagonal or permutation matrices A and B, perm(X) = perm(AX* B), where X* is X or its transpose, and the product of the entries of A is one, when A is diagonal, and similarly for B.

The goal is to solve the problem under consideration by exploiting these properties. Toward this end, the article²⁵ constructs algebraic varieties Δ[perm, n, m] and Δ[det, m] such that if perm(X), where X is an n x n variable matrix, can be linearly represented as the determinant of an m x m matrix, then

Here, by an algebraic variety we mean the set of common solutions of a system of multivariate polynomial equations over . These are generalizations of the usual curves and surfaces. For example, the set of common solutions in ⁴ of two polynomial equations

is a two-dimensional variety Z formed by intersecting the three-dimensional ellipsoid corresponding to the first equation with the three-dimensional paraboloid corresponding to the second equation. By the coordinate ring of a variety we mean the space of polynomial functions on it. This is obtained by restricting the polynomial functions on the ambient vector space containing the variety to the variety. For example, the coordinate ring of Z here is the space of polynomial functions on C⁴ restricted to Z.

The varieties Δ[det, m] and Δ[perm, n, m] are formally defined later. Intuitively, the points in the variety Δ[det, m] correspond to the functions in the arithmetic analogue of NC called VP²⁹ or the “limits” of such functions, and the points in Δ[perm, n,m] correspond to the functions in the arithmetic analogue of #P called VNP²⁹ or the “limits” of such functions. Since the permanent vs. determinant conjecture is the arithmetic analogue of the #P vs. NC conjecture, it thus suffices to show that the inclusion (1) does not hold when m is small.

The goal is to show using algebraic geometry and representation theory that the inclusion (1) is impossible, as conjectured in Mulmuley and Sohoni,²⁵ when m is polynomial in n. We call this the strong permanent vs. determinant conjecture. It implies the original conjecture and is almost equivalent to it in the sense that if (1) holds then perm(X) can be approximated infinitesimally closely by a linear representation of the form det(Y‘), with dim(Y‘) = m. The following is a partial result toward the above stronger conjecture.

Theorem¹⁴ The inclusion (1) is impossible if m ≤ n²/2.

This implies the earlier quadratic lower bound¹⁶ for the permanent but is a bit stronger.

As an aid to prove the strong permanent vs. determinant conjecture in general, Mulmuley and Sohoni²⁶ define the notion of a geometric obstruction to the inclusion (1). Informally, a geometric obstruction is a representation-theoretic object that lives on Δ[perm, n, m] but not on Δ[det, m]; see Figure 1. The very existence of such an obstruction serves as a guarantee that the inclusion as in (1) is not possible, because otherwise the obstruction would be living on Δ[det, m] as well.

To define geometric obstructions precisely, we need to recall some basic facts from representation theory. Let G = GL_k( ) be the general linear group of k x k complex invertible matrices. We call a vector space W a representation of G if there is a homomorphism from G to the group of invertible linear transformations of W. For example, ^k with the usual action of G is its standard representation. There are, of course, far more complex representations of G. Their building blocks were classified by Hermann Weyl.³⁰ He showed that irreducible (polynomial) representations of G are in one-to-one correspondence with nonnegative integer sequences (called partitions) λ = (λ₁,…,λ_l), where λ₁ ≥ λ₂ ≥ … λ_l ≥ 0, and l ≤ k. An irreducible representation of G in correspondence with λ is denoted by V_λ(G). It is called a Weyl module of G. For example, the standard representation ^k of G mentioned above is the Weyl module corresponding to the partition (1) consisting of just one integer 1. The Weyl module V_λ(G), when λ = r, is simply the space Sym^r(z₁,…, z_k) of all homogeneous polynomials of degree r in the variables z₁,…, z_k with the following action of G. Given a polynomial f( ) = f(z₁,…, z_k) ε Sym^r(z₁,…, z_k) and σ ε G, map f( ) to

Each finite-dimensional representation of G is like a complex building that can be decomposed into the building blocks—the Weyl modules. Fundamental significance of Weyl’s classification results from the complexity theoretic perspective is the following. The dimension of each Weyl module V_λ(K) is in general exponential in the bit length of λ. But it has a compact (polynomial size) specification, namely, the labeling partition λ. Existence of such compact specifications of irreducible representations of G plays a crucial role in what follows.

If W is a representation of G, then the elements of G act on W, moving its points around via invertible linear transformations. More generally, a group can similarly act on a variety too. As a simple example, consider the ellipsoid E ⊆ ³ with the equation x²₁ + x²₂ + x³₃/a = 0, a > 0. Let U be the unit circle. It becomes an additive group if we identify each point in U with its polar coordinate θ and let the usual addition of angles play the role of the group composition. The group U has a natural action on E: let θ ε U act on E by rotating E around the x₃ axis by the angle θ; see Figure 2. Let [E] be the coordinate ring of E. This is the space of polynomial functions on ³ restricted to E. Then, this action of U on E also makes [E] a representation of U: given θ ε U just map any polynomial function f( ) = f(x₁, x₂, x₃) on E to f(θ · ), where θ · E denotes the point obtained by rotating ε E around the x₃ axis by the angle θ.

Similarly, the group G = GL_k , with k = m², acts on the varieties Δ[det, m] and Δ[perm, n, m] moving their points around (Figure 1), and this action of G on the varieties makes their coordinate rings (the spaces of polynomial functions on them) representations of G. A formal definition of the action of G and the representation structures of the coordinate rings of Δ[det, m] and Δ[perm, n, m] are discussed later.

These representation structures turn out²⁶ to depend critically on the properties (D) and (P), respectively. Specifically, the properties (D) and (P) put strong restrictions as to which irreducible representations of G can occur as G-subrepresentations of these coordinate rings.

Formally, a geometric obstruction to the inclusion (1) for given n and m is an irreducible representation V_λ(G) of G (a Weyl module) that occurs as a G-subrepresentation in the coordinate ring of Δ[perm, n, m] but not in the coordinate ring of Δ[det, m]^c; see Figure 1. The partition λ here is called a geometric obstruction label. The existence of such an obstruction guarantees that the inclusion as in (1) is impossible, because otherwise the obstruction would occur as a G-subrepresentation in the coordinate ring of Δ[det, m] as well.

Thus, to solve the (strong) permanent vs. determinant conjecture, it suffices to show the following:

Geometric obstruction hypothesis (GOH).²⁶ A geometric obstruction exists when m is polynomial in n.

It is conjectured in Mulmuley²² that GOH, or rather its slightly relaxed form, is equivalent to the strong permanent vs. determinant conjecture.

The Flip

With the help of GOH, we have reduced the nonexistence problem under consideration to an existence problem. For general varieties, such an existence problem is hopeless. But we can hope to prove existence of a geometric obstruction using the characterization by symmetries provided by the properties (P) and (D). We turn to this story here.

The strategy is to construct, for any n and m polynomial in n, a geometric obstruction label λ explicitly in time polynominal in n and m by exploiting the properties (P) and (D). We call this strategy the flip, because it reduces the nonexistence problem under consideration to the problem of proving existence of a geometric obstruction, and furthermore, the lower bound problem is reduced to the upper bound problem of constructing a geometric obstruction label in polynomial time.

The following is a stronger and precise explicit form of GOH that says geometric obstructions can indeed be constructed explicitly.

Flip Hypothesis (FH).^22,23 The geometric obstruction family is explicit in the sense that it satisfies the following properties:

FH[Short]: A short geometric obstruction label λ, with bit length polynomial in n and m, exists if m is polynomial in n.

FH[Verification]: Given n, m, and a partition λ, it can be verified whether λ is a valid geometric obstruction label in time polynomial in n, m and the bit length of λ.

FH[Discovery and construction]: Given n and m, it can be decided whether a geometric obstruction exists in time polynomial in n and m. If an obstruction exists, one such geometric obstruction label λ can also be constructed in the same time. By FH[short], this discovery algorithm always succeeds if m is polynomial in n.

FH[Det]: For given m and λ, it can be verified whether V_λ(G) occurs as a G-subrepresentation in the coordinate ring^d of Δ[det, m] in time polynomial in m and the bit length of λ.

FH[Perm]: For given n, m and λ, it can be verified whether V_λ(G) occurs as a G-subrepresentation in the coordinate ring of Δ[perm, n, m] in time polynomial in n, m and the bit length of λ.

The flip strategy can now be elaborated further in three steps: (1) Prove FH[Det] and FH[Perm]. This clearly implies an efficient criterion for verifying a geometric obstruction label as in FH[Verification]. (2) Use this criterion to design an efficient algorithm for discovering an obstruction as in FH[Discovery]. (3) Prove that this discovery algorithm always succeeds if m is a polynomial in n. For this strategy to succeed, it is not enough if the verification and discovery algorithms are efficient only in theory. They should also have simple enough mathematical structure to carry out step (3). Otherwise, they have to be made simpler and simpler until (3) succeeds.

The best algorithms for verification and construction of a geometric obstruction label based on general-purpose algorithms in algebraic geometry and representation theory take triple exponential time in n and m in the worst case.

Later, we discuss why FH should hold. There is a huge gap between FH and what can be proved at present. Currently, the best algorithms for verification and construction of a geometric obstruction label based on general-purpose algorithms in algebraic geometry and representation theory take triple exponential time in n and m in the worst case. FH says this time bound can be brought down to a polynomial. This may seem impossible.

Why Go for Explicit Proofs?

If so, one may ask as to why we should go for explicit construction of obstructions when proving existence of obstructions even nonconstructively suffices in principle. The reason is provided by the strong flip theorem.²¹ It says that any proof of the arithmetic (strong) permanent vs. determinant conjecture can be converted into an explicit proof assuming a stronger form of a standard derandomization hypothesis¹² in complexity theory that is generally regarded as easier than the target lower bound. By an explicit proof, we mean that the proof also yields an algorithm for efficient construction of some proof certificate of hardness of the permanent, called an obstruction, that is analogous to the geometric obstruction above in the following sense: (1) its very existence for given n and m guarantees that the inclusion (1) is impossible, and (2) the family of obstructions satisfies analogues of FH[short], FH[verify], and FH. Thus, by the strong flip theorem, the strong permanent vs. determinant conjecture essentially forces an explicit proof, modulo derandomization.

There are similar flip theorems²¹ for other lower bound problems, such as the usual permanent vs. determinant and the arithmetic P vs. NP problems, and a certain average case stronger form of the Boolean P vs. NP problem. These results are the main reason why we are going toward explicit proofs, that is, toward explicit construction of obstructions, right from the beginning.

The derandomization hypothesis mentioned here is the following. Its importance is based on the fundamental result in Kabanets and Impagliazzo¹² that derandomization means proving circuit lower bounds. Let Y(X) be an m x m matrix, whose each entry is a complex linear combination (possibly nonhomogeneous) of the variable entries of X. The problem is to decide if det(Y(X)), for given Y(X), is an identically zero polynomial in the variable entries of X. There is a simple and efficient randomized algorithm for this test. Let A be a matrix obtained from X by substituting for each entry of X a large enough random integer of bit length polynomial in n and m. Evaluate det(Y(A)) modulo a large enough random integer b. If it is nonzero, then det(Y(X)) is certainly a nonzero polynomial. If it is zero, then det(Y(X)) is an identically zero polynomial with a high probability. This randomized test is a black-box test in the sense it only needs to know the value of det(Y(X)) for a given specialization of X to A. It does not need to know Y(X). The derandomization hypothesis mentioned earlier is essentially that this randomized black-box determinant identity test can be efficiently derandomized so as to get an efficiently deterministic black-box determinant identity testing algorithm. (The required hypothesis is actually a bit stronger.²¹) This derandomization hypothesis, which is somewhat different from the one in Kabanets and Impagliazzo,¹² is essentially equivalent to proving a determinantal lower bound for a multilinear function that can be evaluated in exponential time¹. This is generally regarded as easier than proving a determinantal lower bound for the permanent since #P is conjecturally smaller than EXP, the class of functions that can be computed in exponential time.

Why Should GOH and FH Hold?

The strong flip theorem^21,23 actually shows something much more. It shows that stronger forms of the permanent vs. determinant and derandomization conjectures together imply an analogue of FH in algebraic geometry of comparable difficulty. This reveals that formidable upper bound problems in algebraic geometry are hidden underneath the fundamental hardness and derandomization conjectures in complexity theory. This may explain why these conjectures, which look so elementary at the surface, have turned out to be so formidable. In view of the strong flip theorem, problems of comparable difficulty can be expected in any approach, even if the approach does not go via algebraic geometry. We refer to this as the “law of conservation of difficulty.”

The article by Mulmuley²² gives evidence based on the strong flip theorem and additional results in algebraic geometry, which suggests that FH itself may be in essence an implication of the strong permanent vs. determinant and derandomization conjectures together. At present, this is the main evidence for FH and, hence, GOH. Further evidence is provided by a recent article,⁷ which constructs explicit geometric obstructions in the analogous setting for the lower bound problem for matrix multiplication, albeit for a problem of very modest size. Explicit computation for any larger example is difficult at present due to the difficulty of the problems that arise.

The strong flip theorem for the permanent vs. determinant conjecture and analogous results in Mulmuley²¹ for other fundamental hardness conjectures in complexity theory, such as the arithmetic P vs. NP conjectures, show a fundamental difference between such hardness conjectures that are at least as hard as the derandomization conjectures and the known lower bound results in the restricted models of computation such as constant depth⁵ or monotone²⁷ circuits. The lower bounds in these restricted models are statements about the weakness of these models. In contrast, by the strong flip theorem, the permanent vs. determinant problem is a statement about the strength of the complexity class NC (or rather its arithmetic analogue²⁹ VP) for which the determinant is essentially complete. It does not say that NC (or rather VP) is small and weak, but rather that it is big and strong—strong enough to assert that “I am different from #P” (or rather its arithmetic analogue VNP²⁹), for the permanent is complete. Similarly, by an analogous flip theorem for the (arithmetic) P vs. NP problem, this problem is a statement about the strength of the complexity class P. It does not say that P is weak and small but rather that it is big and strong—strong enough to assert that “I am different from NP.”

It should also be remarked that FH will almost never hold for functions not characterized by their symmetries (in place of the determinant and the permanent), since the characterization by symmetries plays a crucial role in the proof of the strong flip theorem that forms the crux of the justification of FH. This is why the characterization by symmetries is so crucial for the flip strategy. It is indeed a fortunate coincidence that the fundamental complexity classes such as #P and NC have complete functions characterized by their symmetries.

Frequently Asked Questions

Can GCT be used to prove some modest lower bounds first?

Given the difficulty of the fundamental hardness conjectures, one may ask if GCT can be used to prove some modest lower bounds first. That is indeed so. Currently, the best known lower bounds in the context of the P vs. NC and strong permanent vs. determinant problems are both based on GCT. The first lower bound is a special case of the P ≠ NC conjecture proved in Mulmuley.¹⁸ It says that the P-complete max-flow problem cannot be solved in polylogarithmic time using polynomially many processors in the PRAM model without bit operations. This model is quite realistic and natural in contrast to the constant depth⁵ or monotone²⁷ circuit models used for proving lower bounds earlier. This lower bound is currently the only known super-polynomial lower bound that is a nontrivial implication of a fundamental separation conjecture like the P ≠ NC conjecture and holds unconditionally in a natural and realistic model of computation. Its proof is geometric and quasi-explicit. No combinatorial or elementary proof is known so far. This result was the beginning of the GCT approach to the fundamental hardness conjectures. The second lower bound based on GCT constructions, specifically the varieties Δ[det, m] and Δ[perm, n, m], is the quadratic lower bound¹⁴ stated earlier in the context of the strong permanent vs. determinant conjecture. It is a stronger form of the earlier quadratic lower bound¹⁶ for the usual permanent vs. determinant problem. The proof in Mignon and Ressayre¹⁶ is elementary and does not need GCT. The difference between the strong and usual versions of the permanent vs. determinant problem in Landsberg et al.¹⁴ and Mignon and Ressayre¹⁶ is akin to the difference between the tensor rank and usual versions of the lower bound problem for matrix multiplication.⁶

See also the lower bounds for matrix multiplication based on the fundamental work²⁸ that introduced invariant theory in complexity theory.

Are explicit proofs necessary?

By the strong flip theorem, we know that any proof of the strong permanent vs. determinant conjecture leads to an explicit proof modulo derandomization. This does not say that explicit proofs are necessary. There may be nonexplicit proofs that avoid derandomization altogether. But this does suggest that if derandomization is indeed easier than the fundamental hardness conjectures¹² as the complexity theory suggests, then even such nonexplicit proofs would essentially have the necessary mathematical ingredients to construct proof-certificates of hardness efficiently a posteriori. If so, it makes sense to go toward this efficient construction right from the beginning. This allows us to use the theory of algorithms—the main tool of complexity theory—in the study of the fundamental lower bounds. Indeed, it is unrealistic to expect that we can prove P ≠ NP without understanding the complexity class P and the theory of algorithms in depth first, as the flip strategy suggests.

The situation here may be compared to that for the well-known four color theorem.² In principle, this theorem may be proved nonconstructively. Yet, the fact remains that all known proofs of this theorem are explicit in the sense that they also yield efficient algorithms for finding a four coloring as a byproduct. The flip theorem suggests that the story of the fundamental hardness conjectures in complexity theory may be similar.

In this sense, these conjectures are fundamentally different from other conjectures in mathematics such as the Riemann Hypothesis. Since there is no analogous flip theorem for the Riemann Hypothesis, it may have a nonconstructive proof that gives no hint on how to test efficiently if the n-th zero of the Riemann zeta function lies on the critical line.

Is algebraic geometry necessary?

According to Valiant,²⁹ the arithmetic permanent vs. determinant conjecture over is implied by the #P vs. NC conjecture. By the strong flip theorem,^21,23 stronger forms of the fundamental hardness and derandomization hypotheses in the arithmetic setting imply an analogue of FH in algebraic geometry of comparable difficulty. We have already argued on the basis of these results as to why it is not pragmatic to avoid algebraic geometry, even though it is not formally necessary.

Another concrete evidence for the power of algebraic geometry even in the Boolean setting is provided by the proof of the special case of the P ≠ NC conjecture.¹⁸ It has to be emphasized here that, unlike the earlier lower bounds in the algebraic model,⁶ this lower bound is Boolean, not algebraic. This is because it is in terms of the bit length of the input, though the PRAM model in Mulmuley¹⁸ does not allow bit operations. At present, to our knowledge, this is the only nontrivial implication of a fundamental hardness conjecture that can be proved unconditionally in a natural and realistic model of computation. If we cannot prove even this easier implication of the P ≠ NC conjecture by elementary techniques, it seems unrealistic to expect that we can prove the far harder P ≠ NC (or NP) conjecture by elementary techniques.

When can we expect a hard lower bound?

The modest lower bounds based on GCT and the earlier modest lower bounds^3,6 are separated from the fundamental hardness conjectures that are at least as hard as derandomization by the circle of self-referential difficulty, see Figure 3. To break into this circle, we have to show that P contains formidable explicit construction problems in algebraic geometry and representation theory, such as the ones that arise in the strong flip theorem or FH. By the law of conservation of difficulty based on the strong flip theorem, comparable understanding of P is needed in any approach. Unfortunately, our current understanding of P is very modest. Until we understand P (the theory of algorithms) and geometry in the required depth, we may not expect any further lower bounds that are fundamentally different from the modest lower bounds discussed previously.

Conclusion

GCT has broken the circle of self-reference around the fundamental hardness conjectures in the arithmetic setting and, in the process, has revealed deep explicit construction and positivity problems at the crossroads of algebraic geometry, representation theory, and complexity theory hidden underneath the fundamental hardness conjectures in complexity theory. Given the formidable nature of these problems, this is undoubtedly only the beginning.

Acknowledgments

The author is grateful to Josh Grochow, Jimmy Qiao, Janos Simon, and the referees for helpful comments. The work on this paper was supported by NSF grant CCF-1017760.

Figures

Figure 1. A geometric obstruction.

Figure 2. An ellipsoid.

Figure 3. Division in the world of lower bounds by the circle of self-reference.

Sidebar: Formal Definition of the Varieties

For the interested readers, we now formally define the varieties Δ[det, m] and Δ[perm, n, m] and the action of G on them.

Let Y be an m x m variable matrix. Let X be an n x n submatrix of Y, say its lower-right n x n subminor. Let z be any entry of Y outside X. Let V be the vector space of homogeneous polynomials of degree m in the variable entries of Y. Thus, det(Y) is an element of V. Let [det, m] be the set of elements in V of the form det(Y‘), where Y‘ is an m x m matrix whose entries are complex homogeneous linear combinations of the variable entries of Y. Then, Δ[det, m] ⊆ V is the closure of [det, m] in V in the usual complex topology of V. It can be shown to be an algebraic variety. The variety Δ[perm, n, m] ⊆ V is constructed similarly using the homogeneous polynomial z^mn perm(X) ε V in place of the determinant. It can be shown²⁵ that these varieties have the required property mentioned in (1). Actually the varieties here are the affine cones of the projective varieties defined in Mulmuley and Sohoni.²⁵

The action of G = GL_k( ), k = m², on these varieties is defined as follows. First, observe that the space V is a representation of G by a natural action which, for any matrix σ ε G, maps any point (homogeneous polynomial) p(Y) ε V to the point p(σ⁻¹Y). Here, we think of Y as an m²-vector by straightening it, say, rowwise. It can be shown that Δ[det, m] and Δ[perm, n, m] are invariant under this action of G on V. This induces a natural action of G on these varieties as well. Under this action, each matrix in G acts on a variety by moving its points around and thereby inducing an automorphism. With this action, the coordinate ring of Δ[det, m], by which we mean the space of polynomial functions on V restricted to Δ[det, m], becomes a representation of G: just map a polynomial function f(v) to f(σ⁻¹·v) for any σ ε G and v ε Δ[det, m]. Here σ⁻¹·v denotes the point in Δ[det, m] obtained by letting σ⁻¹ ε G act on v. The coordinate ring of Δ[perm, n, m] is similarly a representation of G.

Sidebar: The Strong Flip Theorem

To state the strong flip theorem, we need a few definitions. Let Δ[det, m], Δ[perm, n, m], and V be as in the sidebar “Formal Definition of the varieties.”

By a global obstruction set for given n and small m polynomial in n, we mean a set S_n,m = {X₁,…,X_l} of nonnegative integral n x n matrices with the following property. Fix any point (homogeneous polynomial) p(Y) ε Δ[det, m] ⊆ V. Let p‘(X) denote the polynomial obtained from p(Y) by substituting zero for all variables in Y other than z and X, and 1 for z. Then, for any such p(Y) ε Δ[det, m], there exists a counterexample X_i ε S_n,m such that p‘(X_i) ≠ perm(X_i). Thus, S_n,m contains a counterexample against every point in Δ[det, m] that shows that the point does not specialize to perm(X). This guarantees that the inclusion as in (1) is not possible for given n and m. We say that S_n,m is small if l is polynomial in n.

We call a proof of the strong permanent vs. determinant conjecture extremely explicit if, for each n and small m polynomial in n, it shows existence of a set of bit strings called obstructions, which serve as proof certificates of hardness of perm(X), with the following properties E0E3.

E0 [Short]: For every n and small m polynomial in n, there exists a short obstruction of bit length polynomial in n.
E1 [Easy to decode]: Given any such short obstruction s for given n and small m polynomial in n, one can construct in time polynomial in n a small global obstruction set S_n,m(s). Thus, each short obstruction s denotes a small global obstruction set.
E2 [Easy to verify]: Given a bit string s and n and m, it can be decided whether s is the specification of an obstruction for n and m in time polynomial in n and m, and the bit length of s.
E3 [Easy to construct]: For each n and small m polynomial in n, a valid obstruction can be constructed in time polynomial in n.

There are some additional technical properties that an extremely explicit proof has to satisfy; see Mulmuley^21,23 for its details. The properties E0, E2, and E3 are analogues of the properties FH[Short], FH[Verification], and FH[Construction] of geometric obstruction labels (which we identify with geometric obstructions). The geometric obstruction labels also conjecturally satisfy the analogue of E1 for decoding (though this was not stated in the statement of FH).

We call a proof extremely explicit in a stronger N C-sense, if the various algorithms in the conditions E1E3 work in polylogarithmic time using polynomial number of processors instead of sequential polynomial time.

We say that a technique for proving the strong permanent vs. determinant conjecture is a flip if it leads to an extremely explicit proof of the conjecture. It is called a flip because it reduces the nonexistence problem to the problem of proving existence of obstructions and the lower bound problem to the upper bound problem of finding efficient algorithms to verify, construct, and decode obstructions.

For similar definitions of explicit proofs for other lower bound problems, such as the usual permanent vs. determinant and P vs. NP problems, see Mulmuley.^21,23

The Strong Flip Theorem.^21,23 Suppose the strong permanent vs. determinant conjecture holds and that black box determinant identity testing¹² can be derandomized (in a stronger form as specified in Mulmuley²¹). Then, the strong permanent vs. determinant conjecture has an extremely explicit proof in the stronger NC-sense. In particular, for any m polynomial in n, an explicit global obstruction set S_n,m can be constructed in time polynomial in n and, more strongly, in polylogarithmic time (in n) using polynomial number of processors.

The proof of the strong flip theorem depends critically on the characterization by symmetries of the permanent as per the property (P). Alternatively, one can also use downward self-reducibility of the permanent that has several other applications in complexity theory.³

Currently, the best algorithms for the construction of a global obstruction set S_n,m based on general purpose algorithms in algebraic geometry take triple exponential time in n and m in the worst case and the bit length of the constructed obstruction is double exponential in the worst case. Bringing this time down to polynomial may seem impossible at the surface. This situation is very similar to that for FH (see the remarks after the statement of FH). Thus, the difficulty of proving E0, E1, E2, and E3 for global obstruction sets is comparable to the difficulty of proving FH for geometric obstructions. The strong flip theorem says that the time bound can indeed be brought down to polynomial for geometric obstruction sets assuming the strong permanent vs. determinant conjecture and the derandomization hypothesis.

Sidebar: Self-Referential Difficulty

The strong flip theorem also reveals the self-referential difficulty in the permanent vs. determinant conjecture.

To see this, let us examine closely the properties E1E3 of an explicit proof that any proof of this conjecture can be converted into modulo derandomization by the strong flip theorem. Let S_n,m(s) be a global obstruction set denoted by an obstruction string s. To verify if S_n,m(s) indeed contains a counterexample against a given point (homogeneous polynomial) p(Y) ε Δ[det, m], we have to check if p‘(X) ≠ perm(X) for some X ε S_n,m(s). Assuming that the permanent is hard to compute, we cannot check efficiently if p‘(X) ≠ perm(X) for general X. Yet, by E2 (in the stronger NC-sense), it can be checked efficiently whether S_n,m(s) contains a counterexample against every p(Y) ε Δ_v[g, m] (even in parallel). This seems to contradict the very hardness of the permanent that we are trying to prove.

This self-referential paradox is only an apparent paradox, because assuming the permanent vs. determinant conjecture and an additional derandomization hypothesis we can construct an extremely explicit proof by the strong flip theorem. But this is a circular argument. The main difficulty is to make headway in the construction of an explicit proof without making an assumption in any guise that is as hard as or harder than the target lower bound assumption.

Analogous flip theorems²¹ reveal similar self-referential difficulty in other variants of the P vs. NP problem harder than derandomization, such as the arithmetic P vs. NP problem.²⁵ Intuitively, the apparent self-referential paradox arises because the P vs. NP conjecture, being a universal statement about mathematics that says that discovery is hard, can potentially make the discovery of its own proof hard.

The situation here is akin to (but far harder than) the situation for another universal statement about mathematics, Gödel’s Incompleteness Theorem. This result says that there are true statements that cannot be proved. This does not say that this universal statement itself cannot be proved. As we know now, it can be proved. But the crux of this proof is the resolution of this apparent self-referential paradox by the construction of a statement that says “I cannot be proved.” Similarly, the root difficulty in the P vs. NP (and the permanent vs. determinant) problem is the resolution of the apparent self-referential paradox in the construction of the statement that says “I am different from NP (#P).”

In view of this self-referential paradox, the main conceptual difficulty in proving the permanent vs. determinant conjecture is to break the circle of self-reference by decomposing the conjecture into subproblems without the self-referential difficulty.

Sidebar: The Decomposition

We now describe how the circle of self-reference is broken in GCT.

Toward this end, observe that FH[Det] does not have the self-referential difficulty in the sense that (1) m is not required to be a small function of n in its statement, and (2) it only depends on the properties of the determinant, and not on the relationship between the permanent and the determinant (or equivalently, between the complexity classes #P and NC). The case of FH[Perm] is similar.

FH[Det] and FH[Perm] together imply FH[verification], which says that geometric obstructions in GOH are easy to verify. We saw that the self-referential difficulty is the main obstacle to efficient verification of obstructions as needed in E2. Hence, once FH[Det] and FH[Perm] are proved, GOH does not have the self-referential difficulty in verification any more. This decomposes the strong permanent vs. determinant conjecture into three subproblems without the self-referential difficulty in verification, namely, FH[Det], FH[Perm], and GOH. Pictorially,

Here the solid arrow ← denotes the formal implication—this follows trivially since GOH itself implies the strong permanent vs. determinant conjecture. The dotted arrow … > indicates the evidence given in Mulmuley²² for the plausible converse based on the strong flip theorem. This decomposition breaks the circle of self-reference for verification. Intuitively, the circle is broken here because the task of verifying a geometric obstruction naturally breaks into two independent tasks, one depending only on the permanent (i.e., the complexity class #P) and the other only on the determinant (the complexity class NC). This is the fundamental difference between geometric obstructions and the global obstruction sets in the strong flip theorem.

The article by Mulmuley²² also describes an approach to prove FH assuming certain positivity hypotheses in algebraic geometry and representation theory. The first positivity hypothesis called PH1 basically says that, for given n, m, and λ, the number of copies of the Weyl module V_λ(G) that occur in the coordinate ring of Δ[det, m] (and similarly Δ[perm, n, m]) has a positive (#P) formula without alternating signs, akin to the usual positive formula for the permanent. We do not discuss other positivity hypotheses here. These hypotheses are again supported by the strong flip theorem, which suggests²² that these hypotheses too (like FH) may be in essence implications of the strong permanent vs. determinant and derandomization conjectures together. Furthermore, the self-referential difficulty is absent in these positivity hypotheses for the same reason that it is absent in FH[Det] and FH[Perm]. The decomposition theorem^22,23 decomposes the strong permanent vs. determinant conjecture in terms of these positivity hypotheses and a more refined form of GOH (called OH), which too is without the self-referential difficulty once the positivity hypotheses are proved. Unlike (3), this decomposition yields an approach to prove FH[Discovery] in addition to FH[Verification]. See Mulmuley^22,23 for its details.

The positivity hypotheses above turn out to be formidable because as explained in Mulmuley²² they encompass and go much further than the century-old plethysm problem in algebraic geometry and representation theory. Since in view of the strong flip theorem these may be essentially implications of the strong hardness and derandomization conjectures, problems of comparable difficulty can be expected in any approach. In this sense, positivity (like explicit construction) is a hidden root difficulty underneath the fundamental hardness conjectures of complexity theory. This provides yet another reason (in addition to the strong flip theorem) for why these conjectures have turned out to be so hard though they look so elementary at the surface.

The articles^4,19,20,22 suggest an approach to the positivity hypotheses via nonstandard quantum groups. But this story is beyond the scope of this article.

See Mulmuley²² for the GCT approach to the arithmetic P vs. NP problem.

Submit an Article to CACM

CACM welcomes unsolicited submissions on topics of relevance and value to the computing community.

You Just Read

The GCT Program Toward the P vs. NP Problem

View in the ACM Digital Library

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and full citation on the first page. Copyright for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or fee. Request permission to publish from permissions@acm.org or fax (212) 869-0481.

DOI

10.1145/2184319.2184341

June 2012 Issue

Published: June 1, 2012

Vol. 55 No. 6

Pages: 98-107

Table of Contents

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Explore More

News Apr 18 2024

Keeping AI Out of Elections

Bennie Mols

Artificial Intelligence and Machine Learning

BLOG@CACM Apr 17 2024

Technical Marvels

Herbert Bruderer

Computer History

BLOG@CACM Apr 16 2024

The Value of Data in Embodied Artificial Intelligence

Shaoshan Liu

Artificial Intelligence and Machine Learning

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More

Key Insights

Geometric Obstructions

The Flip

Why Go for Explicit Proofs?

Why Should GOH and FH Hold?

Frequently Asked Questions

Acknowledgments

Figures

Sidebar: Formal Definition of the Varieties

Sidebar: The Strong Flip Theorem

Sidebar: Self-Referential Difficulty

Sidebar: The Decomposition

The GCT Program Toward the P vs. NP Problem

DOI

June 2012 Issue

Related Reading

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

Shape the Future of Computing

Communications of the ACM (CACM) is now a fully Open Access publication.