计算机工程2017,Vol.43Issue(4):160-165,6.DOI:10.3969/j.issn.1000-3428.2017.04.027
一种增强的差分隐私数据发布算法
An Enhanced Differential Privacy Data Release Algorithm
摘要
Abstract
In order to improve the classification accuracy of released data under the same privacy preserving strength,on the basis of DiffGen algorithm,an enhanced differential privacy data release algorithm named as GiniDiff is proposed.This algorithm completely generalizes original dataset,selects specialization scheme by using exponential mechanism in each round of iteration,and classifies specialized records into new equivalence classes in the way of building decision tree,and uses Laplace mechanism to add noise to counters of equivalence classes,and generates dataset for release.Owing to the fact that the algorithm uses gini-index gain for the utility of different specialization schemes,reasonable privacy budget allocation and dynamical budget consumption calculation,the utility of the dataset for release is effectively improved.Experimental results show that the algorithm outperforms DiffGen algorithms in classification accuracy and the classification accuracy is close to ideal level.关键词
差分隐私/数据发布/决策树/基尼系数增益/指数机制/拉普拉斯机制Key words
differential privacy/data release/decision tree/gini-index gain/exponential mechanism/Laplace mechanism分类
信息技术与安全科学引用本文复制引用
孙奎,张志勇,赵长伟..一种增强的差分隐私数据发布算法[J].计算机工程,2017,43(4):160-165,6.基金项目
国家自然科学基金(61370220) (61370220)
河南省高校科技创新团队支持计划项目(15IRTSTHN010) (15IRTSTHN010)
河南省科技攻关计划项目(142102210425) (142102210425)
河南省教育厅科学技术研究重点基础研究计划项目(13A520240,14A520048) (13A520240,14A520048)
河南科技大学科研创新能力培育基金(2013ZCX022). (2013ZCX022)