Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Mining (Student Presentation) Samira Roshan_Asma Akbari Mehr 87-88.

Similar presentations


Presentation on theme: "Data Mining (Student Presentation) Samira Roshan_Asma Akbari Mehr 87-88."— Presentation transcript:

1 Data Mining (Student Presentation) Samira Roshan_Asma Akbari Mehr 87-88

2

3

4 hiddenThere is often information hidden in the data that is not readily evident Human analysts may take weeks to discover useful information Much of the data is never analyzed at all Number of analysts Total new disk (TB) since 1995 The Data Gap Gap

5 Data collected and stored at enormous speeds (GB/hour) Traditional techniques infeasible for raw data Data mining may help scientists

6

7

8 DATA Base Target Data Transformed Data Patterns and Rules

9

10 Classification Regression Collaborative Filtering Clustering Association rules Deviation detection

11 ClassifierDecision rules Salary > 5 L Prof. = Exec New applicants data Many approaches: Statistics, Decision Trees, Neural Networks,...

12 Unsupervised learning when old data with class labels not available e.g. when introducing a new product.

13 Given set T of groups of items Example: set of item sets purchased MilkCerealRice TeaRiceBread ChipsBreadcheese......

14 The use of data, particularly about people, for data mining has serious ethical implications. When applied to people discriminate.

15 Data mining (or simple analysis) on people may come with a profile that would raise controversial issues of – Discrimination – Privacy – Security Examples: – Should males between 18 and 35 from countries that produced terrorists be singled out for search before flight? – Can people be denied mortgage based on age, sex, race? – Women live longer. Should they pay less for life insurance?

16

17

18 Instances Instances: the individual, independent examples of a concept Attributes Attributes: measuring aspects of an instance We will focus on nominal and numeric ones

19 number of nuclei (values: 1,2) number of tails (values: 1,2) color (values: light, dark) wall (values: thin, thick) Lethargia Burpoma Healthy

20

21 # Color LightDark Lethargi a 32 Burpom a 12 Healthy 22 # Tails 12 Lethargi a 50 Burpom a 03 Healthy 22 # Nucleus 12 Lethargi a 41 Burpom a 03 Healthy 22 # Membrance ThinThick Lethargia 32 Burpoma 21 Healthy 31

22 # ColorLightDark Lethargi a 32 Burpom a 12 Healthy22 # Tails12 Lethargi a 50 Burpom a 03 Healthy22 # Nucleus 12 Lethargi a 41 Burpom a 03 Healthy22 # Membrance ThinThick Lethargia32 Burpoma21 Healthy31

23 Tails

24 # ColorLightDark Lethargi a 32 Burpom a 00 Healthy02 # Nucleus 12 Lethargi a 41 Burpom a 00 Healthy02 # Membrance ThinThick Lethargia32 Burpoma00 Healthy02

25 Tails Nucleu s Lethargia

26 Tails Nucleu s Lethargia Color Nucleu s Healthy Burpoma Lethargia Healthy

27 If # Tails = 1 then If # Nucleus = 1 then class = Lethargia else If color = light then class = Lethargia else class = Healthy else If # Nucleus = 1 then class = Healthy else class = Burpom

28

29 Resources http://office.microsoft.com/ http://www.wisegeek.com/what-is-a- relational-database.htmhttp://www.wisegeek.com/what-is-a- relational-database.htm http://www.cs.toronto.edu/ avaisman/cscd3 4summer/ccsc343s.htm www.cl.cam.ac.uk/Teaching/current/Databases/ www.cs.uh.edu/~ceick/6340/dw-olap.ppt


Download ppt "Data Mining (Student Presentation) Samira Roshan_Asma Akbari Mehr 87-88."

Similar presentations


Ads by Google