TR2012-064

A Framework for Privacy Preserving Statistical Analysis on Distributed Databases


    •  Lin, B.-R.; Wang, Y.; Rane, S., "A Framework for Privacy Preserving Statistical Analysis on Distributed Databases", IEEE International Workshop on Information Forensics and Security (WIFS), ISBN: 978-1-4673-2286-7, December 2012, pp. 61-66.
      BibTeX Download PDF
      • @inproceedings{Lin2012dec,
      • author = {Lin, B.-R. and Wang, Y. and Rane, S.},
      • title = {A Framework for Privacy Preserving Statistical Analysis on Distributed Databases},
      • booktitle = {IEEE International Workshop on Information Forensics and Security (WIFS)},
      • year = 2012,
      • pages = {61--66},
      • month = dec,
      • isbn = {978-1-4673-2286-7},
      • url = {http://www.merl.com/publications/TR2012-064}
      • }
  • MERL Contact:
  • Research Areas:

    Information Security, Multimedia


Alice and Bob are mutually untrusting curators who possess separate databases containing information about a set of respondents. This data is to be sanitized and published to enable accurate statistical analysis, while retaining the privacy of the individual respondents in the databases. Further, an adversary who looks at the published data must not even be able to compute statistical measures on it. Only an authorized researcher should be able to compute marginal and joint statistics. This work is an attempt toward providing a theoretical formulation of privacy and utility for problems of this type. Privacy of the individual respondents is formulated using differential privacy. Privacy of the marginal and joint statistics on the distributed databases is formulated using a new model called distributional differential privacy. Finally, a constructive scheme based on randomized response is presented as an example mechanism that satisfies the formulated privacy requirements.