Download presentation
Presentation is loading. Please wait.
Published byDanielle Daniel Modified over 11 years ago
1
Data Mining (Student Presentation) Samira Roshan_Asma Akbari Mehr 87-88
4
hiddenThere is often information hidden in the data that is not readily evident Human analysts may take weeks to discover useful information Much of the data is never analyzed at all Number of analysts Total new disk (TB) since 1995 The Data Gap Gap
5
Data collected and stored at enormous speeds (GB/hour) Traditional techniques infeasible for raw data Data mining may help scientists
8
DATA Base Target Data Transformed Data Patterns and Rules
10
Classification Regression Collaborative Filtering Clustering Association rules Deviation detection
11
ClassifierDecision rules Salary > 5 L Prof. = Exec New applicants data Many approaches: Statistics, Decision Trees, Neural Networks,...
12
Unsupervised learning when old data with class labels not available e.g. when introducing a new product.
13
Given set T of groups of items Example: set of item sets purchased MilkCerealRice TeaRiceBread ChipsBreadcheese......
14
The use of data, particularly about people, for data mining has serious ethical implications. When applied to people discriminate.
15
Data mining (or simple analysis) on people may come with a profile that would raise controversial issues of – Discrimination – Privacy – Security Examples: – Should males between 18 and 35 from countries that produced terrorists be singled out for search before flight? – Can people be denied mortgage based on age, sex, race? – Women live longer. Should they pay less for life insurance?
18
Instances Instances: the individual, independent examples of a concept Attributes Attributes: measuring aspects of an instance We will focus on nominal and numeric ones
19
number of nuclei (values: 1,2) number of tails (values: 1,2) color (values: light, dark) wall (values: thin, thick) Lethargia Burpoma Healthy
21
# Color LightDark Lethargi a 32 Burpom a 12 Healthy 22 # Tails 12 Lethargi a 50 Burpom a 03 Healthy 22 # Nucleus 12 Lethargi a 41 Burpom a 03 Healthy 22 # Membrance ThinThick Lethargia 32 Burpoma 21 Healthy 31
22
# ColorLightDark Lethargi a 32 Burpom a 12 Healthy22 # Tails12 Lethargi a 50 Burpom a 03 Healthy22 # Nucleus 12 Lethargi a 41 Burpom a 03 Healthy22 # Membrance ThinThick Lethargia32 Burpoma21 Healthy31
23
Tails
24
# ColorLightDark Lethargi a 32 Burpom a 00 Healthy02 # Nucleus 12 Lethargi a 41 Burpom a 00 Healthy02 # Membrance ThinThick Lethargia32 Burpoma00 Healthy02
25
Tails Nucleu s Lethargia
26
Tails Nucleu s Lethargia Color Nucleu s Healthy Burpoma Lethargia Healthy
27
If # Tails = 1 then If # Nucleus = 1 then class = Lethargia else If color = light then class = Lethargia else class = Healthy else If # Nucleus = 1 then class = Healthy else class = Burpom
29
Resources http://office.microsoft.com/ http://www.wisegeek.com/what-is-a- relational-database.htmhttp://www.wisegeek.com/what-is-a- relational-database.htm http://www.cs.toronto.edu/ avaisman/cscd3 4summer/ccsc343s.htm www.cl.cam.ac.uk/Teaching/current/Databases/ www.cs.uh.edu/~ceick/6340/dw-olap.ppt
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.