基于概率论的隐私保持分类挖掘

Privacy-preserving Classification Mining Based on Probability Theory

下载PDF

导出

摘要在现有的基于数据扰动的隐私保持分类挖掘算法中,扰动数据和原始数据相关联,对隐私数据的保护并不完善,且扰动算法和分类算法耦合度高,不适合在实际中使用。为此,提出一种基于概率论的隐私保持分类挖掘算法。扰动后可得到一组与原始数据独立同分布的数据,使扰动数据和原始数据不再相互关联,各种分类算法也可直接应用于扰动后的数据。 In the existed privacy-preserving classification mining methods based on data perturbation, the privacy data is not protected perfectly because the perturbed data and the original data have been related. The classification algorithm and the data perturbation algorithm have high coupling It is not easy to use these methods in practice. To solve these problems, it proposes a privacy-preserving classification mining algorithm based on probability theory. The perturbed data is independent from the original data and they have the same distribution. This proposed method overcomes the shortcomings of others. The perturbed data is no relation with the original data and the classification methods can be used on the perturbed data directly.

作者李光王亚东苏小红

机构地区哈尔滨工业大学计算机科学与工程系

出处《计算机工程》 CAS CSCD 2012年第3期12-13,18,共3页 Computer Engineering

基金国家"863"计划基金资助项目(2007AA02Z329)

关键词数据挖掘隐私保持数据扰动随机噪声决策树 data mining privacy protection data perturbation random noise decision tree

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1Agrawal R, Srikant R. Privacy-preserving Data Mining[J]. ACM SIGMOD Record, 2000, 29(2): 439-450.
2Du Wenliang, Zhan Zhijun. Using Randomized Response Tech- niques for Privacy-preserving Data Mining[C] //Proceedings of the 9th ACM International Conference on Knowledge Discovery and Data Mining. Washington D. C., USA: [s. n.] , 2003: 505-510.
3Agrawal D, Aggarwal C C. On the Design and Quantification of Privacy Preserving Data Mining Algorithms[C] //Proceedings of the 12th ACM Symposium on Principles of Database Systems. Santa Barbara, USA: [s. n.] , 2001: 247-255.
4葛伟平,汪卫,周皓峰,施伯乐.基于隐私保护的分类挖掘[J].计算机研究与发展,2006,43(1):39-45. 被引量：20
5隗晨雪,朱建明.基于隐私保护的朴素贝叶斯分类协议[J].计算机工程,2010,36(18):26-28. 被引量：3
6Huang Zhengli, Du Wenliang, Chen Biao. Deriving Private Infor- mation from Randomized Data[C] //Proceedings of ACM’s Special Interest Group on Management of Data. Baltimore, Maryland, USA: [s. n.] , 2005: 37-48.

二级参考文献17

1谢建全.一种实用的密钥共享方法[J].微型电脑应用,2005,21(6):40-41. 被引量：1
2Rakesh Agrawal.Data mining:Crossing the chasm.The 5th Int'l Conf.Knowledge Discovery in Databases and Data Mining,San Diego,California,1999.
3Rakesh Agrawal,Ramakrishnan Srikant.Privacy-preserving data mining.The ACM SIGMOD Conf.Management of Data,Dallas,Texas,2000.
4Yehuda Lindell,Benny Pinkas.Privacy preserving data mining.In:Advances in Cryptology-Crypto.Berlin:Springer-Verlag,2000.36～ 54.
5Dakshi Agrawal,Charu C.Aggarwal.On the design and quantification of privacy preserving data mining algorithms.The 20th Symposium on Principles of Database Systems,Santa Barbara,California,2001.
6Wenliang Du,Zhijun Zhan.Using randomized response techniques for privacy-preserving data mining.The 9th ACM SIGKDD Int'l Conf.Knowledge Discovery in Databases and Data Mining,Washington,D.C.,2003.
7L.F.Cranor,J.Reagle,M.S.Ackerman.Beyond concern:Understanding net users' attitudes about online privacy.AT&T Labs-Research,Tech.Rep.,1999.http://www.research.att.com/library/trs/TRs/99/99.4.3/report.htm.
8J.R.Quinlan.C4.5:Programs for Machine Learning.San Mateo,CA:Morgan Kaufmann,1993.
9Rakesh Agrawal,Sakti Ghost,Tomasz Imielinski,et al.An interval classifier for database mining applications.In:Proc.VLDB Conf.,Vancouver,British Columbia,Canada,1992.
10L.Breiman,J.H.Friedman,R.A.Olshen,et al.Classification and Regression Trees.Boca Raton,Florida:CRC Press,1984.

共引文献21

1李玉华,陈云开,卢正鼎.基于质量的数据挖掘服务选择[J].计算机科学,2007,34(8):159-164. 被引量：2
2燕彩蓉,朱明,史有群.基于隐私保护的序列模式挖掘[J].小型微型计算机系统,2008,29(7):1241-1244. 被引量：3
3周水庚,李丰,陶宇飞,肖小奎.面向数据库应用的隐私保护研究综述[J].计算机学报,2009,32(5):847-861. 被引量：222
4熊国华.数据挖掘中的隐私保护策略[J].科技信息,2009(13):47-48.
5刘玉葆,黄志兰,傅慰慈,印鉴.基于有损分解的数据隐私保护方法[J].计算机研究与发展,2009,46(7):1217-1225. 被引量：21
6李光,王亚东,苏小红.隐私保持的决策树分类挖掘[J].电子学报,2010,38(1):204-212. 被引量：9
7白似雪,李婵.M-distinct算法改进:面向动态数据集重发布的隐私保护算法[J].南昌大学学报（工科版）,2010,32(3):297-300.
8许大宏.基于隐私保护关联规则的改进算法[J].福建电脑,2010,26(12):93-95.
9刘亮,谢舒婷,李顺东.一种为保密挖掘预处理数据的新方法[J].计算机科学,2011,38(7):165-169. 被引量：1
10方炜炜,任江,夏红科.异构分布的多元线性回归隐私保护模型[J].计算机研究与发展,2011,48(9):1685-1692. 被引量：11

1杨攀,桂小林,安健,田丰,王刚.利用贝叶斯原理在隐私保护数据上进行分类的方法[J].西安交通大学学报,2015,49(4):46-52. 被引量：1
2付帅,姜奇,马建峰.一种无线传感器网络隐私保护数据聚合方案[J].计算机研究与发展,2016,53(9):2030-2038. 被引量：9
3李光,惠萌.改进的使用非负矩阵分解的隐私保护分类方法[J].计算机工程与应用,2015,51(21):1-5. 被引量：1
4徐尽.添加均匀分布噪声的数据扰动小样本分类算法[J].科技通报,2013,29(6):122-124. 被引量：1
5翁国庆,张森,倪巍伟.一种基于扰动的轨迹数据隐藏发布方法[J].东南大学学报（自然科学版）,2014,44(1):51-57. 被引量：3
6蔡春华,赵杰,宋丽.基于随机投影的隐私保护分布式聚类算法研究[J].牡丹江师范学院学报（自然科学版）,2014,40(3):1-3. 被引量：1
7季文韬,魏巍.基于奇异值分解的银行客户数据隐私保护算法研究[J].电子技术与软件工程,2017(4):228-229.
8吴雁.基于支持向量机的汽轮机参数辨识方法[J].机电产品开发与创新,2013,26(5):14-16.
9荣秋生.基于网格的隐私保护分类挖掘算法的研究[J].微计算机信息,2006(12X):220-222.
10徐春,李广原.几种隐私数据挖掘算法研究进展[J].大众科技,2016,18(7):31-34. 被引量：2

计算机工程

2012年第3期

浏览历史

内容加载中请稍等...

基于概率论的隐私保持分类挖掘

参考文献6

二级参考文献17

共引文献21

相关作者

相关机构

相关主题

浏览历史