Abstract
We analyze the probability that a random m-dimensional linear subspace of \(\mathbb{R}^{n}\) both intersects a regular closed convex cone \(C\subseteq\mathbb{R}^{n}\) and lies within distance α of an m-dimensional subspace not intersecting C (except at the origin). The result is expressed in terms of the spherical intrinsic volumes of the cone C. This allows us to perform an average analysis of the Grassmann condition number for the homogeneous convex feasibility problem ∃x∈C∖0:Ax=0. The Grassmann condition number is a geometric version of Renegar’s condition number, which we have introduced recently in Amelunxen and Bürgisser (SIAM J. Optim. 22(3):1029–1041 (2012)). We thus give the first average analysis of convex programming that is not restricted to linear programming. In particular, we prove that if the entries of \(A\in\mathbb {R}^{m\times n}\) are chosen i.i.d. standard normal, then for any regular cone C, we have
. The proofs rely on various techniques from Riemannian geometry applied to Grassmann manifolds.
Similar content being viewed by others
References
D. Amelunxen, Geometric analysis of the condition of the convex feasibility problem, Ph.D. thesis, Univ. Paderborn (2011).
D. Amelunxen, P. Bürgisser, Intrinsic volumes of symmetric cones, arXiv:1205.1863.
D. Amelunxen, P. Bürgisser, A coordinate-free condition number for convex programming, SIAM J. Optim. 22(3), 1029–1041 (2012).
A. Belloni, R.M. Freund, A geometric analysis of Renegar’s condition number, and its interplay with conic curvature, Math. Program., Ser. A 119(1), 95–107 (2009).
T. Bonnesen, W. Fenchel, Theorie der Konvexen Körper (Springer, Berlin, 1974). Berichtigter Reprint.
W.M. Boothby, An Introduction to Differentiable Manifolds and Riemannian Geometry, 2nd edn. Pure and Applied Mathematics, Vol. 120 (Academic Press, Orlando, 1986).
S. Boyd, L. Vandenberghe, Convex Optimization (Cambridge University Press, Cambridge, 2004).
P. Bürgisser, Smoothed analysis of condition numbers, in Foundations of Computational Mathematics, Hong Kong, 2008. London Math. Soc. Lecture Note Ser., Vol. 363 (Cambridge University Press, Cambridge, 2009), pp. 1–41.
P. Bürgisser, F. Cucker, Condition: The Geometry of Numerical Algorithms. Grundlehren der Mathematischen Wissenschaften, Bd. 349 (Springer, Berlin, 2013).
P. Bürgisser, F. Cucker, M. Lotz, The probability that a slightly perturbed numerical analysis problem is difficult, Math. Comp. 77(263), 1559–1583 (2008).
I. Chavel, Riemannian Geometry, 2nd edn. Cambridge Studies in Advanced Mathematics, Vol. 98 (Cambridge University Press, Cambridge, 2006). A modern introduction.
Z. Chen, J.J. Dongarra, Condition numbers of Gaussian random matrices, SIAM J. Matrix Anal. Appl. 27(3), 603–620 (2005) (electronic).
D. Cheung, F. Cucker, A new condition number for linear programming, Math. Program., Ser. A 91(1), 163–174 (2001).
F. Cucker, J. Peña, A primal-dual algorithm for solving polyhedral conic systems with a finite-precision machine, SIAM J. Optim. 12(2), 522–554 (2001/2002) (electronic).
J.W. Demmel, The probability that a numerical analysis problem is difficult, Math. Comp. 50, 449–480 (1988).
M.P. do Carmo, Riemannian Geometry. Mathematics: Theory & Applications (Birkhäuser Boston, Boston, 1992). Translated from the second Portuguese edition by Francis Flaherty.
M. Epelman, R.M. Freund, A new condition measure, preconditioners, and relations between different measures of conditioning for conic linear systems, SIAM J. Optim. 12(3), 627–655 (2002) (electronic).
H. Federer, Geometric Measure Theory. Die Grundlehren der mathematischen Wissenschaften, Bd. 153 (Springer, New York, 1969).
S. Filipowski, On the complexity of solving feasible linear programs specified with approximate data, SIAM J. Optim. 9(4), 1010–1040 (1999). Dedicated to John E. Dennis Jr. on his 60th birthday.
R.M. Freund, F. Ordóñez, On an extension of condition number theory to nonconic convex optimization, Math. Oper. Res. 30(1), 173–194 (2005).
R.M. Freund, J.R. Vera, Condition-based complexity of convex optimization in conic linear form via the ellipsoid algorithm, SIAM J. Optim. 10(1), 155–176 (1999) (electronic).
R.M. Freund, J.R. Vera, Some characterizations and properties of the “distance to ill-posedness” and the condition measure of a conic linear system, Math. Program., Ser. A 86(2), 225–260 (1999).
F. Gao, D. Hug, R. Schneider, Intrinsic volumes and polar sets in spherical space, Math. Notae 41, 159–176 (2003), (2001/2002). Homage to Luis Santaló. Vol. 1 (Spanish).
S. Glasauer, Integralgeometrie konvexer Körper im sphärischen Raum, Thesis, Univ. Freiburg i. Br. (1995).
S. Glasauer, Integral geometry of spherically convex bodies, Diss. Summ. Math. 1(1–2), 219–226 (1996).
G.H. Golub, C.F. Van Loan, Matrix Computations, 4th edn. Johns Hopkins Studies in the Mathematical Sciences (Johns Hopkins University Press, Baltimore, 2013).
S. Helgason, Differential Geometry, Lie Groups, and Symmetric Spaces. Pure and Applied Mathematics, Vol. 80 (Academic Press/Harcourt Brace Jovanovich, New York, 1978).
R.A. Horn, C.R. Johnson, Matrix Analysis (Cambridge University Press, Cambridge, 1990). Corrected reprint of the 1985 original.
R. Howard, The kinematic formula in Riemannian homogeneous spaces, Mem. Amer. Math. Soc. 106(509), vi + 69 (1993).
D.A. Klain, G.-C. Rota, Introduction to Geometric Probability. Lezioni Lincee (Cambridge University Press, Cambridge, 1997). [Lincei Lectures]
S. Kobayashi, K. Nomizu, Foundations of Differential Geometry, vol. I (Interscience/Wiley, New York/London, 1963).
F. Morgan, Geometric Measure Theory, 2nd edn. (Academic Press, San Diego, 1995). A beginner’s guide.
M. Moszyńska, Selected Topics in Convex Geometry (Birkhäuser Boston, Boston, 2006). Translated and revised from the 2001 Polish original.
J. Peña, Understanding the geometry of infeasible perturbations of a conic linear system, SIAM J. Optim. 10(2), 534–550 (2000) (electronic).
J. Peña, A characterization of the distance to infeasibility under block-structured perturbations, Linear Algebra Appl. 370, 193–216 (2003).
J. Peña, J. Renegar, Computing approximate solutions for convex conic systems of constraints, Math. Program., Ser. A 87(3), 351–383 (2000).
J. Renegar, Some perturbation theory for linear programming, Math. Program., Ser. A 65(1), 73–91 (1994).
J. Renegar, Incorporating condition measures into the complexity theory of linear programming, SIAM J. Optim. 5(3), 506–524 (1995).
J. Renegar, Linear programming, complexity theory and elementary functional analysis, Math. Program., Ser. A 70(3), 279–351 (1995).
R. Schneider, Convex Bodies: The Brunn-Minkowski Theory. Encyclopedia of Mathematics and Its Applications, Vol. 44 (Cambridge University Press, Cambridge, 1993).
R. Schneider, W. Weil, Stochastic and Integral Geometry. Probability and Its Applications (New York) (Springer, Berlin, 2008).
S. Smale, Complexity theory and numerical analysis, in Acta Numerica, 1997. Acta Numer., Vol. 6 (Cambridge University Press, Cambridge, 1997), pp. 523–551.
M. Spivak, Calculus on Manifolds. A Modern Approach to Classical Theorems of Advanced Calculus (Benjamin, New York/Amsterdam, 1965).
M. Spivak, A Comprehensive Introduction to Differential Geometry. Vol. I, 3rd edn. (Publish or Perish, Houston, 1999).
R.P. Stanley, Log-concave and unimodal sequences in algebra, combinatorics, and geometry, in Graph Theory and Its Applications: East and West, Jinan, 1986. Ann. New York Acad. Sci., Vol. 579 (N.Y. Acad. Sci., New York, 1989), pp. 500–535.
J.A. Thorpe, Elementary Topics in Differential Geometry. Undergraduate Texts in Mathematics (Springer, New York, 1994).
J.R. Vera, Ill-posedness and the complexity of deciding existence of solutions to linear programs, SIAM J. Optim. 6(3), 549–569 (1996).
J.R. Vera, On the complexity of linear programming under finite precision arithmetic, Math. Program., Ser. A 80(1), 91–123 (1998).
J.C. Vera, J.C. Rivera, J. Peña, Y. Hui, A primal-dual symmetric relaxation for homogeneous conic systems, J. Complex. 23(2), 245–261 (2007).
H. Weyl, On the volume of tubes, Am. J. Math. 61(2), 461–472 (1939).
Acknowledgements
We are grateful to one of the anonymous referees for criticism that has led to a substantial improvement of the paper’s presentation.
This work has been supported by the DFG Research Training Group on Scientific Computation GRK 693 (PaSCo GK) and DFG grants BU 1371/2-2, AM 386/1-1, and AM 386/1-2.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Michael J. Todd.
Dedicated to Mike Shub, on the occasion of his 70th birthday.
Appendices
Appendix A: Averaging the Twisted Characteristic Polynomial
Recall the setting of Sect. 4.2. Let \(A\in\mathbb{R}^{k\times k}\) and \(\varphi\colon\mathbb{R} ^{k}\to\mathbb{R} ^{k},x\mapsto Ax\). Using the notation \(\operatorname{ch}_{\ell}(A,t) = \operatorname{ch}_{Y_{0}}(\varphi,t)\), where \(Y_{0} =\mathbb{R} ^{\ell}\times0\), we have
(The reason is that \(QY_{0}\in\operatorname{Gr}_{k,\ell}\) is uniform random when Q∈O(k) is uniformly chosen at random.) This equality allows us to prove Theorem 4.1 with basic matrix calculus. The main idea is to use the multilinearity of the determinant and the invariance of the coefficients σ i (A) under similarity transformations, to show that the coefficients of \(\mathbb{E}[\operatorname{ch}_{\ell}(Q^{\mathrm{T}}AQ,t)]\) are linear combinations of the σ i (A). We then compute the coefficients of these linear combinations by choosing for A scalar multiples of the identity matrix.
In the following, we use the notation [k]:={1,…,k}, and we denote by \(\binom{[k]}{i}\) the set of all i-element subsets of [k]. For \(A\in\mathbb{R}^{k\times k}\) and \(J\in\binom {[k]}{i}\), we denote by \(\operatorname{pm}_{J}(A) := \det(A_{J})\) the Jth principal minor of A, where A J denotes the submatrix of A obtained by selecting the rows and columns of A whose indices lie in J. It is well known (cf. for example [28, Theorem 1.2.12]) that σ i (A) is the sum of all principal minors of A of size i, i.e.,
For 0≤ℓ≤k, the ℓth leading principal minor \(\operatorname{lpm}_{\ell}(A) := \operatorname{pm}_{[\ell]}(A)\) is defined as the principal minor for J=[ℓ]. Note that \(\operatorname{lpm}_{\ell}(A) = \det_{\mathbb {R}^{\ell}\times 0}(\varphi)\).
Lemma A.1
Let
\(A\in\mathbb{R}^{k\times k}\), and let
Q∈O(k) be chosen uniformly at random. Then, for all
, we have
Proof
For the first equality let J={j 1,…,j ℓ }, j 1<⋯<j ℓ , and let π be any permutation of [k] such that π(i)=j i for all i=1,…,ℓ. If M π denotes the permutation matrix corresponding to π, i.e., M π e i =e π(i), then \(A_{J}=(M_{\pi}^{\mathrm{T}} A M_{\pi})_{[\ell]}\), and therefore \(\operatorname{pm}_{J}(A)=\operatorname{lpm}_{\ell}( M_{\pi}^{\mathrm{T}} A M_{\pi})\). This implies
where we have used the fact that right multiplication by the fixed element M π leaves the uniform distribution on O(k) invariant. This implies

which was to be shown. □
Proof of Theorem 4.1
Statement (4.4) follows from (A.1) and Lemma A.1.
For proving (4.5), we start with a general observation. By multilinearity, we write the determinant of a matrix with columns v 1,…,v i−1,v i +α⋅e i ,v i+1,…,v n in the form
Using this repeatedly, we obtain
By (4.3) we can express the twisted characteristic polynomial as
Applying (A.3), we expand this to obtain
where \(c_{1},c_{2}\colon2^{[k]}\to\mathbb{N}\) are some integer-valued functions on the power set of [k]. Averaging the twisted characteristic polynomial with the help of Lemma A.1 yields
for some rational constants \(\tilde{d}_{ij}\). It remains to prove that the \(\tilde{d}_{ij}\) coincide with the d ij defined in Theorem 4.1.
To compute the \(\tilde{d}_{ij}\), we choose A=sI
k
. Then, \(\operatorname{ch}_{\ell}(s I_{k},t) = (s-t)^{\ell}\cdot(1+s t)^{k-\ell}\). Since , and Q
T
sI
k
Q=sI
k
, we get
Let us expand the first term:
where again we interpret \(\binom{n}{m}=0\) if m<0 or m>n, i.e., the above summation over i, j in fact only runs over the rectangle determined by the inequalities \(0\leq\frac{i-j}{2}+\frac{\ell}{2}\leq\ell\) and \(0\leq k-\frac{i+j+\ell}{2}\leq k-\ell\). (See Fig. 1 for an illustration of the change of summation.) Note that the reverse substitution is given by \(\lambda=\frac {i-j+\ell}{2}\) and \(\mu=k-\frac{i+j+\ell}{2}\).
Illustration of the change of summation in (A.5) (k=8, ℓ=5)
Comparing the coefficients of the above two expressions for (s−t)ℓ⋅(1+st)k−ℓ reveals that indeed \(\tilde{d}_{ij}=d_{ij}\) as defined in Theorem 4.1. This completes the proof of (4.5).
For the last claim (4.6) in Theorem 4.1, we define the positive characteristic polynomial
by replacing −t by t in (4.3). By the same reasoning as for (A.4), we get
with the same function \(c_{2}\colon2^{[k]}\to\mathbb{N}\) as in (A.4). Moreover, by arguing as in the proof of (4.5) before, we show that
Now assume that A is positive semidefinite. Then each of its principal minors is nonnegative, i.e., \(\operatorname{pm}_{J}(A)\geq0\) for all J⊆[k]. Therefore, if t≥0, we get from (A.4) and (A.6) that
Taking into account (A.7) completes the proof of Theorem 4.1. □
Appendix B: Proof of Some Technical Estimations
Recall that the symbol is used to mark simple estimates, which are easily checked with a computer algebra system.
Proof of Lemma 5.2
(1) We make a case distinction by the parity of ℓ. Using Γ(x+1)=x⋅Γ(x), for odd ℓ we obtain
Using additionally \(\varGamma(x+\frac{1}{2})<\sqrt{x}\cdot\varGamma(x)\), for even ℓ we get
(2) As for the second estimate, we distinguish the cases i≥2k and i≤2k. From 0≤k≤m−1 and 0≤i−k≤n−m−1, we get
For i≥2k we thus have
and for i≤2k
(3) The I-functions have been estimated in [10, Lemma 2.2] in the following way. Let \(\varepsilon:=\sin(\alpha)=\frac{1}{t}\). For i<n−2,
and for i=n−2, assuming n≥3,

With these estimates we obtain

For \(\varepsilon<n^{-\frac{3}{2}}\) we thus get

Similarly, we derive
For \(\varepsilon<\frac{1}{m}\) we thus have

which proves the assertion. □
Rights and permissions
About this article
Cite this article
Amelunxen, D., Bürgisser, P. Probabilistic Analysis of the Grassmann Condition Number. Found Comput Math 15, 3–51 (2015). https://doi.org/10.1007/s10208-013-9178-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10208-013-9178-4
Keywords
- Convex programming
- Perturbation
- Condition number
- Average analysis
- Spherically convex sets
- Grassmann manifold
- Tube formulas