Data Mining with Big data

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

© 2013 IBM Corporation Complexity must be conquered (Data is more than “just” big) Laura Haas IBM Almaden Research Center 1.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Sponsored by the U.S. Department of Defense © 2005 by Carnegie Mellon University 1 Pittsburgh, PA Dennis Smith, David Carney and Ed Morris DEAS.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
Supported in part by the National Science Foundation – ISS/Digital Science & Technology Analysis of the Open Source Software development community using.
Meta Learning and Active Learning: Meta Learning and Active Learning: Collaborative Knowledge Discovery in Distributed Systems Dr Yonghong Peng Department.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Introduction Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Introduction Facebook How does Facebook use your data? Where do you think.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
BIGDATA AND DATASCIENCE By Sigma Analytics and Computing.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Speaker: Oscar Corcho Building Semantic Sensor Webs and Applications ESWC 2011 Tutorial 29 May 2011.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 26 October 2013 Observation and Reflection on Official Statistics against Big Data Challenge Yuan Pengfei Research Institute of Statistical Sciences.
MIS – 3030 Business Technologies Social Media & Conversation Big Data.
DATA MINING IN CLOUD COMPUTING Xxxxxxx -11/27/ DSCI 5240.
Data Mining By Dave Maung.
The Web-DL Environment for Building Digital Libraries from the Web P. Calado 1, M. Gonçalves 2, E. Fox 2, B. Ribeiro-Neto 1, A. Laender 1, A. da Silva.
Graph Data Analytics Arka Mukherjee, Ph.D. Global IDs Resolving Complexity at an Enterprise Scale.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Big Data and Large Scale Data Analysis Andrew Mead School of Life Sciences 23 rd October 2013.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Cloud Computing & Big Data Group 9 Femme L H Sabaru | Aditya Gisheila N P | Aninda Harapan | Harry | Andrew Khosugih.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Mining with Big Data. Abstract Big Data concerns large-volume, complex, growing data sets with multiple, autonomous sources. With the fast development.
IoT Meets Big Data Standardization Considerations
Data Mining with Big Data IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014 Xiangyu Cai ( )
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
Hemera KickOff October 5th, 2010 Working Group B5 Efficient management of very large volumes of information for data- intensive applications Gabriel Antoniu,
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Katy Börner Teaching & Research Teaching & Research Katy Börner
Big Data analytics in the Cloud Ahmed Alhanaei. What is Cloud computing?  Cloud computing is Internet-based computing, whereby shared resources, software.
Data Mining Techniques Applied in Advanced Manufacturing PRESENT BY WEI SUN.
BIG DATA BIGDATA, collection of large and complex data sets difficult to process using on-hand database tools.
 The web is referred to as a “massive collection of web pages stored on millions of computers across the world that are linked by the Internet” (Chowdhury,
AUTONOMIC COMPUTING B.Akhila Priya 06211A0504. Present-day IT environments are complex, heterogeneous in terms of software and hardware from multiple.
Oscar E. Cariceo, MSW, NSWM Chile Chapter
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
SNS COLLEGE OF TECHNOLOGY
MIS2502: Data Analytics Advanced Analytics - Introduction
Data Mining, Data Science, Big Data
Profiling based unstructured process logs
Themes in Geosciences.
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
StYLiD: Structured Information Sharing with User-defined Concepts
Introduction C.Eng 714 Spring 2010.
Web Information retrieval
دانشگاه شهیدرجایی تهران
Data Warehousing and Data Mining
Computing/Modeling with Big Data
تعهدات مشتری در کنوانسیون بیع بین المللی
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Batyr Charyyev.
Data Mining.
Executive visibility to critical business assets
Big DATA.
Data Analysis and R : Technology & Opportunity
Advanced Geospatial Techniques: Aiding Earth Observation Applications
Presentation transcript:

Data Mining with Big data By: Pouya Otarod Spring 2014

What is …… ? Data Mining computational process of discovering patterns in large data sets Big Data it is the term for a collection of data sets so large and complex that it becomes difficult to process data has exponential growth, both structured and unstructured

How much Data does exist? 2.5 quintillion bytes of data are created EVERY DAY IBM: 90 percent of the data in the world today were produced with past two years Forms of Data????

Big Data Examples October 4th, 2012, the first presidential debate Flicker and its photos

Problem…! Data has grown tremendously This large amount of data is beyond the of software tools to manage Exploring the large volume of data and extracting useful information and knowledge is a challenge, and sometimes, it is almost infeasible

HACE Theorem Heterogeneous, Autonomous, Complex, Evolving Big data starts with large volume, heterogeneous, autonomous sources with distributed and decentralized control, and seeks to explore complex and evolving relationships among data These are characteristics of Big Data This is theorem to model Big Data characteristics

Huge Data with heterogeneous and diverse dimensionality represent huge volume of data Autonomous sources with distributed and decentralized control main characteristics of Big Data Complex and evolving relationships

Data Mining Challenges with Big Data Big Data Mining Platform Dig Data Semantics and Application Knowledge Information Sharing and Data Privacy Domain and Application Knowledge Big Data Mining Algorithm Local Learning and Model Fusion for Multiple Information Sources mining from Sparse, Uncertain, and Incomplete Data Mining Complex and Dynamic Data

Thanks for you attentions !