Presentation is loading. Please wait.

Presentation is loading. Please wait.

Privacy Preserving Data Mining Benjamin Fung bfung(at)cs.sfu.ca.

Similar presentations


Presentation on theme: "Privacy Preserving Data Mining Benjamin Fung bfung(at)cs.sfu.ca."— Presentation transcript:

1 Privacy Preserving Data Mining Benjamin Fung bfung(at)cs.sfu.ca

2 Privacy Preserving Data Mining What is data mining? –Non-trivial extraction of implicit, previously unknown, and potentially useful information from large data sets or databases [W. Frawley and G. Piatetsky- Shapiro and C. Matheus, 1992] What is privacy preserving data mining? –Study of achieving some data mining goals without scarifying the privacy of the individuals

3 Scenario (Information Sharing) A data owner wants to release a person-specific data table to another party (or the public) for the purpose of classification analysis without scarifying the privacy of the individuals in the released data. Data ownerData recipients Person-specific data

4 Privacy Threat If a description on (Education, Sex) is so specific that not many people match it, releasing the table will lead to linking a unique or a small number of individuals with sensitive information. EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 MastersM444G0B4 MastersF443G0B3 DoctorateF441G0B1 Total:34 Data recipientsAdversary EducationSexDiagnosis… BachelorsFDepression… BachelorsMHeart disease… MastersFDepression… MastersFHeart disease… DoctorateFKnee injury…

5 Solution: Generalization EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 MastersM444G0B4 MastersF443G0B3 DoctorateF441G0B1 EducationSexAgeClass# of Recs. 9thF300G3B3 10thM320G4B4 11thF352G3B5 12thF373G1B4 BachelorsF424G2B6 BachelorsF444G0B4 Grad SchoolM444G0B4 Grad SchoolF444G0B4

6 References 1.K. Wang, B. C. M. Fung, and P. S. Yu. Template-Based Privacy Preservation in Classification Problems. In Proc. of the 5th IEEE International Conference on Data Mining (ICDM 2005), Houston, TX, USA, November 27-30, 2005. 2.K. Wang, B. C. M. Fung, and G. Dong. Integrating Private Databases for Data Analysis. In Proc. of the 2005 IEEE International Conference on Intelligence and Security Informatics (ISI 2005), pages 171-182, Atlanta, GA, USA, May 19-20, 2005. 3.B. C. M. Fung, K. Wang, and P. S. Yu. Top-Down Specialization for Information and Privacy Preservation. In Proc. of the 21st IEEE International Conference on Data Engineering (ICDE 2005), pages 205-216, Tokyo, Japan, April 5-8, 2005. For more information, visit http://www.cs.sfu.ca/~bfung


Download ppt "Privacy Preserving Data Mining Benjamin Fung bfung(at)cs.sfu.ca."

Similar presentations


Ads by Google