On Sparse Linear Regression in the Local Differential Privacy Model
Type
ArticleAuthors
Wang, Di
Xu, Jinhui
Date
2020-11-25Online Publication Date
2020-11-25Print Publication Date
2021-02Submitted Date
2005-04-19Permanent link to this record
http://hdl.handle.net/10754/666352
Metadata
Show full item recordAbstract
In this paper, we study the sparse linear regression problem under the Local Differential Privacy (LDP) model. We first show that polynomial dependency on the dimensionality p of the space is unavoidable for the estimation error in both non-interactive and sequential interactive local models, if the privacy of the whole dataset needs to be preserved. Similar limitations also exist for other types of error measurements and in the relaxed local models. This indicates that differential privacy in high dimensional space is unlikely achievable for the problem. With the understanding of this limitation, we then present two algorithmic results. The first one is a sequential interactive LDP algorithm for the low dimensional sparse case, called Locally Differentially Private Iterative Hard Thresholding (LDP-IHT), which achieves a near optimal upper bound. This algorithm is actually rather general and can be used to solve quite a few other problems, such as (Local) DP-ERM with sparsity constraints and sparse regression with non-linear measurements. The second one is for the restricted (high dimensional) case where only the privacy of the responses (labels) needs to be preserved. For this case, we show that the optimal rate of the error estimation can be made logarithmically dependent on p (i.e., log p) in the local model, where an upper bound is obtained by a label-privacy version of LDP-IHT. Experiments on real world and synthetic datasets confirm our theoretical analysis.Citation
Wang, D., & Xu, J. (2020). On Sparse Linear Regression in the Local Differential Privacy Model. IEEE Transactions on Information Theory, 1–1. doi:10.1109/tit.2020.3040406Additional Links
https://ieeexplore.ieee.org/document/9269994/ae974a485f413a2113503eed53cd6c53
10.1109/TIT.2020.3040406