Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Cai, T. Tony; Xia, Dong; Zha, Mengyue

Mathematics > Statistics Theory

arXiv:2401.03820 (math)

[Submitted on 8 Jan 2024 (v1), last revised 27 Sep 2024 (this version, v2)]

Title:Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Authors:T. Tony Cai, Dong Xia, Mengyue Zha

View PDF

Abstract:Estimating a covariance matrix and its associated principal components is a fundamental problem in contemporary statistics. While optimal estimation procedures have been developed with well-understood properties, the increasing demand for privacy preservation introduces new complexities to this classical problem. In this paper, we study optimal differentially private Principal Component Analysis (PCA) and covariance estimation within the spiked covariance model. We precisely characterize the sensitivity of eigenvalues and eigenvectors under this model and establish the minimax rates of convergence for estimating both the principal components and covariance matrix. These rates hold up to logarithmic factors and encompass general Schatten norms, including spectral norm, Frobenius norm, and nuclear norm as special cases. We propose computationally efficient differentially private estimators and prove their minimax optimality for sub-Gaussian distributions, up to logarithmic factors. Additionally, matching minimax lower bounds are established. Notably, compared to the existing literature, our results accommodate a diverging rank, a broader range of signal strengths, and remain valid even when the sample size is much smaller than the dimension, provided the signal strength is sufficiently strong. Both simulation studies and real data experiments demonstrate the merits of our method.

Subjects:	Statistics Theory (math.ST); Information Theory (cs.IT); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2401.03820 [math.ST]
	(or arXiv:2401.03820v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2401.03820

Submission history

From: Dong Xia [view email]
[v1] Mon, 8 Jan 2024 11:18:14 UTC (71 KB)
[v2] Fri, 27 Sep 2024 14:15:12 UTC (1,538 KB)

Mathematics > Statistics Theory

Title:Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators