preprintarXiv (Cornell University)Oct 18, 2017GREEN OA

Learning Differentially Private Recurrent Language Models

Indexed inarxivdatacite

Abstract

We demonstrate that it is possible to train large recurrent language models with user-level differential privacy guarantees with only a negligible cost in predictive accuracy. Our work builds on recent advances in the training of deep networks on user-partitioned data and privacy accounting for stochastic gradient descent. In particular, we add user-level privacy protection to the federated averaging algorithm, which makes "large step" updates from user-level data. Our work demonstrates that given a dataset with a sufficiently large number of users (a requirement easily met by even small internet-scale datasets), achieving differential privacy comes at the cost of increased computation, rather than in…

Citation impact

671
total citations
FWCI
Percentile
References
0
Citations per year

Authors

4

Topics & keywords

Keywords
  • Differential privacy
  • Computer science
  • Computation
  • Language model
  • Stochastic gradient descent
  • The Internet
  • Scale (ratio)
  • Work (physics)
No related works found for this paper.