Download presentation
Presentation is loading. Please wait.
Published bySharon Barker Modified over 9 years ago
1
Privacy Preserving Data Mining Benjamin Fung bfung(at)cs.sfu.ca
2
Privacy Preserving Data Mining What is data mining? –Non-trivial extraction of implicit, previously unknown, and potentially useful information from large data sets or databases [W. Frawley and G. Piatetsky- Shapiro and C. Matheus, 1992] What is privacy preserving data mining? –Study of achieving some data mining goals without scarifying the privacy of the individuals
3
Scenario (Information Sharing) A data owner wants to release a person-specific data table to another party (or the public) for the purpose of classification analysis without scarifying the privacy of the individuals in the released data. Data ownerData recipients Person-specific data
4
Privacy Threat If a description on (Education, Sex) is so specific that not many people match it, releasing the table will lead to linking a unique or a small number of individuals with sensitive information. EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 MastersM444G0B4 MastersF443G0B3 DoctorateF441G0B1 Total:34 Data recipientsAdversary EducationSexDiagnosis… BachelorsFDepression… BachelorsMHeart disease… MastersFDepression… MastersFHeart disease… DoctorateFKnee injury…
5
Solution: Generalization EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 MastersM444G0B4 MastersF443G0B3 DoctorateF441G0B1 EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 Grad SchoolM444G0B4 Grad SchoolF444G0B4
6
References 1.K. Wang, B. C. M. Fung, and P. S. Yu. Template-Based Privacy Preservation in Classification Problems. In Proc. of the 5th IEEE International Conference on Data Mining (ICDM 2005), Houston, TX, USA, November 27-30, 2005. 2.K. Wang, B. C. M. Fung, and G. Dong. Integrating Private Databases for Data Analysis. In Proc. of the 2005 IEEE International Conference on Intelligence and Security Informatics (ISI 2005), pages 171-182, Atlanta, GA, USA, May 19-20, 2005. 3.B. C. M. Fung, K. Wang, and P. S. Yu. Top-Down Specialization for Information and Privacy Preservation. In Proc. of the 21st IEEE International Conference on Data Engineering (ICDE 2005), pages 205-216, Tokyo, Japan, April 5-8, 2005. For more information, visit http://www.cs.sfu.ca/~bfung
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.