Presentation is loading. Please wait.

Presentation is loading. Please wait.

By: Megan Ryan Phillip Striggow Jonathan Zamora Marquenon Franklin

Similar presentations


Presentation on theme: "By: Megan Ryan Phillip Striggow Jonathan Zamora Marquenon Franklin"— Presentation transcript:

1 By: Megan Ryan Phillip Striggow Jonathan Zamora Marquenon Franklin
Data Mining By: Megan Ryan Phillip Striggow Jonathan Zamora Marquenon Franklin

2 What is Data Mining?

3 Data Mining Data (also known as Knowledge Discovery)
Process of extracting, analyzing, converting and summarizing data. From jumbled-up data from large databases to useful information. Combining methods from statistics and artificial intelligence with database management.

4 The Importance Important tool to transform data into information.
Profiling practices Marketing Surveillance Scientific discovery Businesses organizations. Organize data track right info wages employees get at work.

5 Advantages of Data Mining
Can unearth facts about customers Lend automation benefits to existing hardware and software Enhances efficiency and saves money Market/retail helps to foretell products that customers would like to buy especially identifying patterns. Helps law enforcement profiling

6 Disadvantages Privacy issues Security issues
hackers hack sensitive data Security issues some companies don't have sufficient or reliable security systems to protect the information. Misuse of information/inaccurate information

7 Who uses Data Mining?

8 Private and Public Sectors
The medical community help predict the effectiveness of a procedure or medicine The healthcare industry to re-route health claims to best-of-breed providers identify characteristics of most successful employees determine specific promotions to send to those recipients Pharmaceutical firms research of chemical compounds and genetic material Insurance and banking detect fraud and assist in risk assessment Many retailers and grocery stores most sold items

9 Aiding in Federal Government Cases
Law help detect patterns of crimes government disperse their resources to where they need to go help make us safer as a whole if we know how criminals act

10 Development of Games and Video Games
Provider information on what the consumer wants how to implement the user into the game knows how the game is going to prevail creates a pattern and how the games can be developed or rejected Aid in more sales and increase the popularity of these games

11 Wal-Mart and Data Mining
Record sent directly to the suppliers Supplier send products that are being sold most Good inventory Adequate warehouse supply Wal-Mart and the supplier know what the consumer wants

12 Microsoft Fight against phishing
Technology in financial company websites and E- commerce sites characteristics of the site IP address, DNS, IP address

13 Conclusion: who uses data mining?
Huge asset to several industries provide great insight into things that can be prevented Security purposes Government Law enforcement Commercial Successful businesses

14 What technology is used for Data Mining?

15 Technology used in Data Mining
Three major technologies that makes data mining possible Massive data collection Powerful multiprocessor computers Data Mining algorithms

16 Massive data collection
Explosive growth of database warehouses September 2005-Oracle powered the world’s largest commercial database(100 TB) (Winter Corporation Top Ten Program) Tripled the previous world’s largest database(29.2 TB) by Oracle in 2003 As of today, Oracle is not even in the top three largest databases in the world Shows how massive data collection has evolved-makes sense to use data mining to get useful information

17 (source: www.focus.com)
By the numbers 2011 top 3 largest databases in the world World Data Centre for Climate 220 terabytes of web data 6 petabytes of additional data National Energy Research Scientific Computing Center 2.8 petabytes of data Sprint 323 terabytes of information 1.9 trillion phone call records (source:

18 Powerful multiprocessor computers
Parallel multiprocessing Computers with more than one CPU or processor core that simultaneously execute programs Makes programs run faster Can go through huge data at a very high speed

19 (source: www.unusualbookmarklets.com)
By the numbers 2011 Fastest clocked CPU Intel Celeron D-347 CPU cooled with liquid nitrogen 8.20 GHz processing speed 2011 Fastest desktop CPU in the market AMD Phenom III X4 980 3.7 GHz processing speed 2011 Fastest laptop CPU in the market Intel Core i7-2920XM Processor Extreme Edition 2.5 GHz processing speed (source:

20 Data mining algorithms
Used in analyzing data Composed of well known mathematical algorithms and techniques Integrated in software and hardware used for data mining

21 Types used in Data Mining
Classification algorithms - predict one or more discrete variables, based on the other attributes in the dataset Regression algorithms - predict one or more continuous variables, such as profit or loss, based on other attributes in the dataset Segmentation algorithms - divide data into groups, or clusters, of items that have similar properties Association algorithms - find correlations between different attributes in a dataset Sequence analysis algorithms - summarize frequent sequences or episodes in data, such as a Web path flow

22 Is it ethical to use Data Mining?

23 Is it Ethical? Goes both ways…depending on perspective


Download ppt "By: Megan Ryan Phillip Striggow Jonathan Zamora Marquenon Franklin"

Similar presentations


Ads by Google