Charles Tappert Seidenberg School of CSIS, Pace University

Slides:



Advertisements
Similar presentations
Nokia Technology Institute Natural Partner for Innovation.
Advertisements

Setting Big Data Capabilities Free How to Make Business on Big Data? Stig Torngaard, Partner Platon.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
CloudSocial Mobility Big data Social connections, mobility, cloud delivery and pervasive information are converging in a powerful way. This convergence.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Architecting for the Internet of Things
Global Cognitive Computing Market
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Conceptual Modeling of the Healthcare Ecosystem Eng. Andrei Vasilateanu.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Big Data Use Cases in the cloud Peter Sirota, GM Elastic
Basic Marketing Research Customer Insights and Managerial Action
Rapid Mobile Development Enterprises are having a tough time keeping up with the demand for mobile apps. With these growing demands, businesses are expecting.
Examining Earth Science GATE Unit Exercise Murray Lewis.
Ecommerce … or electronic commerce refers to systems that support electronically executed business transactions. B2C B2B C2C In this section: Ecommerce.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
T.L. Kennedy Secondary School
Big Data Adoption Drivers Sources: US Data: IDC 2012 Vertical IT & Communications Survey. N = 4177 LatAm Data: PRELIMINARY RESULTS from IDC Latin America.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
BIGDATA AND DATASCIENCE By Sigma Analytics and Computing.
Management Information Systems
The Information System Opportunity
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
Reference: An Overview of Business Intelligence Technology, Communications of The ACM, August VOL 54 NO.8
Data Science and Big Data Analytics Chap1: Intro to Big Data Analytics
$20 billion business 230 million printers sold  170 million inkjet  60 million laserjet #1 in ink, laser, scanners, designjet, print servers Consistent.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
Associate of Applied Science Degree Information & Telecommunication Technology Scott edu
Computer New Student Orientation. Overview Our degree programs Jobs in the Computing Field Student Projects Faculty Research.
Marv Adams Chief Information Officer November 29, 2001.
Big Data – Big Opportunity Mohammad Khansari ITRC President Jan 2015 ITRC, Tehran, Iran.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
OMIS 694, Big Data Analytics
IoT Meets Big Data Standardization Considerations
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
MAR Capability Overview Deck Protean Analytics.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
BUSINESS INTELLIGENCE & ADVANCED ANALYTICS DISCOVER | PLAN | EXECUTE JANUARY 14, 2016.
1 Global Hadoop Market Forecast 2014 –2020 Global Hadoop Market Forecast 2014 –2020 Occams Business Research & Consulting.
Advanced Analytics Turin April, Index 2 ■ Advanced Analytics Approach –Architecture Overview –Methodology –Professional Skills ■ Impacted Areas.
Big Data analytics in the Cloud Ahmed Alhanaei. What is Cloud computing?  Cloud computing is Internet-based computing, whereby shared resources, software.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
© 2013 TM Forum | 1 V Catalysts and Innovation Projects Rapid Technology Innovation Projects The Hub at Management World 2013.
FACULTY EXTERNSHIP OPPORTUNITIES IN DATA SCIENCE AND DATA ANALYTICS Facilitated by: FilAm Software Technology, Clark Freeport Zone Ecuiti, San Francisco,
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Wake Technical Community College “Wake Tech” Largest community college in NC 70,000+ students a year attending.
Data Analytics (CS40003) Introduction to Data Lecture #1
Big Data in Technical and Vocational Education (TVE)
CNIT131 Internet Basics & Beginning HTML
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Curriculum and Career preparation
Business Intelligence Minor
7 Big Ideas of Computing:
Data Quality: Practice, Technologies and Implications
TESTING OF BIG DATA & PREDICTIVE ANALYTICS
Introduction Data Mining for Business Analytics.
Experiences with Business Analytics Curriculum Implementation
OMIS 665, Big Data Analytics
What are your Career Options?
Big Data.
 Deep Analytical Talent  Data Savvy Professionals  Technology and Data Enablers.
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Charles Tappert Seidenberg School of CSIS, Pace University
The Most In-Demand Skills for Cloud Computing.
Big DATA.
Presentation transcript:

Charles Tappert Seidenberg School of CSIS, Pace University Data Science and Big Data Analytics Chap 9: Advanced Analytical Theory and Methods: Text Analysis Charles Tappert Seidenberg School of CSIS, Pace University

Data Analytics Lifecycle Data Analytics Lifecycle Overview Phase 1: Discovery Phase 2: Data Preparation Phase 3: Model Planning Phase 4: Model Building Phase 5: Communicate Results Phase 6: Operationalize Case Study: GINA

2.1 Data Analytics Lifecycle Overview Huge volume of data Not just thousands/millions, but billions of items Complexity of data types and structures Varity of sources, formats, structures Speed of new data creation and grow High velocity, rapid ingestion, fast analysis

2.2 Phase 1: Discovery Mobile sensors Social media – 700 Facebook updates/sec in2012 Video surveillance Video rendering Smart grids Geophysical exploration Medical imaging Gene sequencing – more prevalent, less expensive

2.3 Phase 2: Data Preparation image

2.4 Phase 3: Model Planning image

2.6 Phase 5: Communicate Results Structured – defined data type, format, structure Transactional data, OLAP cubes, RDBMS, CVS files, spreadsheets Semi-structured Text data with discernable patterns – e.g., XML data Quasi-structured Text data with erratic data formats – e.g., clickstream data Unstructured Data with no inherent structure – text docs, PDF’s, images, video

2.7 Phase 6: Operationalize image

2.8 Case Study: Global Innovation Network and Analysis (GINA) image

1.2 State of the Practice in Analytics Business Intelligence (BI) versus Data Science Current Analytical Architecture Drivers of Big Data Emerging Big Data Ecosystem and a New Approach to Analytics

Business Intelligence (BI) versus Data Science image

Business Intelligence (BI) versus Data Science image

Current Analytical Architecture image

Current Analytical Architecture image

Drivers of Big Data image

Emerging Big Data Ecosystem and a New Approach to Analytics Four main groups of players Data devices Games, smartphones, computers, etc. Data collectors Phone and TV companies, Internet, Gov’t, etc. Data aggregators – make sense of data Websites, credit bureaus, media archives, etc. Data users and buyers Banks, law enforcement, marketers, employers, etc.

Emerging Big Data Ecosystem and a New Approach to Analytics image

1.3 Key Roles for the New Big Data Ecosystem image

Three Key Roles of the New Big Data Ecosystem Deep analytical talent Advanced training in quantitative disciplines – e.g., math, statistics, machine learning Data savvy professionals Savvy but less technical than group 1 Technology and data enablers Support people – e.g., DB admins, programmers, etc.

Three Recurring Data Scientist Activities Reframe business challenges as analytics challenges Design, implement, and deploy statistical models and data mining techniques on Big Data Develop insights that lead to actionable recommendations

Profile of Data Scientist Five Main Sets of Skills image

Profile of Data Scientist Five Main Sets of Skills Quantitative skill – e.g., math, statistics Technical aptitude – e.g., software engineering, programming Skeptical mindset and critical thinking – ability to examine work critically Curious and creative – passionate about data and finding creative solutions Communicative and collaborative – can articulate ideas, can work with others

1.4 Examples of Big Data Analytics Retailer Target Uses life events: marriage, divorce, pregnancy Apache Hadoop Open source Big Data infrastructure innovation MapReduce paradigm, ideal for many projects Social Media Company LinkedIn Social network for working professionals Can graph a user’s professional network 250 million users in 2014

Focus of Course Focus on quantitative disciplines – e.g., math, statistics, machine learning Provide overview of Big Data analytics In-depth study of a several key algorithms