Big Data Open Source Software and Projects Introduction I590 Data Science Curriculum August 20 2014 Geoffrey Fox

Slides:



Advertisements
Similar presentations
IBM’s Transformation to a Services Company and the Growth of Digital Trade Michael DiPaula-Coyle IBM Governmental Programs.
Advertisements

IT INFRASTRUCTURE AND EMERGING TECHNOLOGIES
Wolters Kluwer A Global Company Performs on the World Stage Nancy McKinstry Chief Executive Officer and Chairman of the Board of Wolters Kluwer.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
CloudSocial Mobility Big data Social connections, mobility, cloud delivery and pervasive information are converging in a powerful way. This convergence.
© 2007 IBM Corporation © 2009 IBM Corporation 1 Tran Viet Huan, PhD CTO, IBM Vietnam IBM Research Global Technology Outlook.
Techuk.org |#techuk Introducing techUK Volcrowe/ Nemode Workshop. 22/4/15.
Information systems: creating tomorrow’s business innovations
Master of Arts in Data Science Geoffrey Fox for Data Science Program March
Top 10 Strategic Technology Trends for 2013 A Channel Partners Slide Show … as highlighted at.
Big Data Open Source Software and Projects Unit 1: Introduction Data Science Curriculum March Geoffrey Fox
COMPUTER APPLICATIONS TO BUSINESS ||
Master of Arts in Data Science
Enterprise Mobility ‘Mobile First’ strategy for your Business
Emerging Trends in Business. Outsourcing Contracting out of a business function, which was previously performed in-house, to an external provider. Contracting.
Big Data Open Source Software and Projects Unit 0 Part B: Class Introduction Data Science Curriculum March Geoffrey Fox
Rapid Mobile Development Enterprises are having a tough time keeping up with the demand for mobile apps. With these growing demands, businesses are expecting.
Progress towards accessible analytics and data visualization Ed Summers SAS Institute.
Contribution since August,2008 National MSME Conclave 19 November, 2014 Use of ICT to make MSME more competitive and sustainable Tapan K. Patra Director.
X-Informatics Cloud Technology February 25 and Geoffrey Fox Associate.
Real world cloud computing challenges Giedrius Markevičius Territory account manager.
DIGITAL TRENDS Omni-channel will continue to dominate the conversation in We have already moved from multi-channel to omni- channel
Emerging trends. Electronic Security Solutions Solution(s) considered for Security Access Control Systems Card readers Access control Personal Vehicles.
X-Informatics Introduction: What is Big Data, Data Analytics and X-Informatics? January Geoffrey Fox
1. 2 IT innovations in specialized areas where competitors will have difficulty copying Excellence in design of processes and activities and how they.
Copyright © 2014 Pearson Education, Inc. 1 Managers are facing unique challenges as Digital Technologies permeate the workplace Chapter 1 - Managing in.
IMPACT OF GLOBAL TRENDS ON BUSINESSES An Evaluation of Key Factors over Next 3 Years.
SureWaves Confidential. For private audiences only. Distribution, reproduction, copying of content or any other reuse is prohibited..
OpenQuake Infomall ACES Meeting Maui May Geoffrey Fox
New Technology for CIO of the future PUTCHONG UTHAYOPAS KASETSART UNIVERSITY.
10/8/20151 Mobile Apps and QAD Stephen McHugh Broom Street Software 03-17,2013.
IoT, Big Data and Emerging Technologies
1.less than 3 million. 2.less than 10 million. 3.over 23 million. 4.over 100 million. 5.Not sure In the U.S., the number of managers that rely on Information.
©Copyright Artificial Solutions 2015 Artificial Solutions & the Teneo Platform Making Technology Think September 2015.
© 2009 IBM Corporation IBM developerWorks: The Front Door to the Cloud Janine Gerber March, 2010.
Smart e-Government in the Era of Social Network Service By Ahn, Moon Suk Ph.D Professor Emeritus of the Korea University
Yue Pan, Siddharth Maini, Eli Blevis Human-Computer Interaction Design, School of Informatics & Computing, Indiana University at Bloomington Framing the.
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
IoT Primer Stephen Bates | Energy Huntsville: Tues 15 Dec
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
Information Systems Education Conference - ISECON Ryerson University School of Information Technology Management The Use of Internet-based Tools.
Information Systems in Organizations 4.3. New innovations: future trends in consumer systems Impact on individuals: Digital identity management.
Remarks on MOOC’s Open Grid Forum BOF July 24 OGF38B at XSEDE13 San Diego Geoffrey Fox Informatics, Computing.
E-Commerce & M-Commerce. Introduction Electronic commerce, commonly known as e- commerce, It is a type of industry where buying and selling of product.
Training Data Scientists DELSA Workshop DW4 May Washington DC Geoffrey Fox Informatics, Computing.
TOP – 3090 Emerging Technologies Social, Mobile, Cloud, Big Data.
Catalyst Dynamic Pricing Solutions for Data Center Digital Services.
Big Data Open Source Software and Projects ABDS in Summary II: Layer 5 I590 Data Science Curriculum August Geoffrey Fox
1© 2015 IBM Corporation Unlocking the power of the API economy Client Briefing Nov.
Introductory Tutorial: OpenStack, Chef, Hadoop, Hbase, Pig I590 Data Science Curriculum Big Data Open Source Software and Projects September Geoffrey.
Understand The Use Of Technologies In Fashion Merchandising And Marketing FM 3.02.
Paul Ormonde-James 2014 CYBERTREKING.COM By Paul Ormonde-James January 2014 The future is closed than you think. COM.
It’s Time for Cognitive Computing
Supporting the human decision maker IBM Research and big data at the edge Dr. Jürg von Känel Associate Director IBM Research Australia.
Business information Systems – lecture 6 Project analysis Enn Õunapuu
Top 10 Strategic Technology Trends for 2013
Global Facial Recognition Market Growth rate and Gross Income
WEBINAR The Rise Of Insights Services
Adoption of Industry 4.0 in India – Opportunities & Challenges
Retailing in Electronic Commerce: Products and Services
Cisco Jackie Barker Corporate Affairs.
Customer Services Single view of the customer, enabling wide variety of customer requests to be dealt with at the point of contact Self-Service Portal.
The 4th Industrial Revolution
Standards for the Internet of Things
Bertil Thorvaldsson Good morning!
Click here to advance to the next slide.
Top 10 Strategic Technology Trends for 2013
Digitalization of Manufacturing
Smart Learning concepts to enhance SMART Universities in Africa
PUTTING PEOPLE AT THE CENTRE OF DIGITAL TRANSFORMATION
Presentation transcript:

Big Data Open Source Software and Projects Introduction I590 Data Science Curriculum August Geoffrey Fox School of Informatics and Computing Digital Science Center Indiana University Bloomington

INTRODUCTION Stress Programming Expertise Python and Java

Introduction I This course studies software used in many commercial activities to study Big Data. The backdrop for course is the ~120 software subsystems illustrated at We will describe the software architecture represented by this collection which we term HPC-ABDS (High Performance Computing enhanced Apache Big Data Stack). – A paper discussing this can be found at – – and presentations at – and – data-uses-and-proposed-architecture-integrating-high-performance- computing-and-the-apache-stack. data-uses-and-proposed-architecture-integrating-high-performance- computing-and-the-apache-stack Copies of this material may be found at

Introduction II The course covers the following material a)The cloud computing architecture underlying ABDS and contrast of this with HPC. b)The software architecture with its different layers at abds.org/kaleidoscope/ covering broad functionality and rationale for each layer. abds.org/kaleidoscope/ c)We will give application examples d)Then we will go through selected software systems – about 10% of those in the Kaleidoscope which have been already deployed on FutureGrid systems using OpenStack and Chef recipes. e)Students will chose one other open source member of Kaleidoscope each and deploy as in d). f)The main activity of the course will be building a significant project using multiple HPC-ABDS subsystems combined with user code and data. g)Teams of up to 3 students can be formed with corresponding increase in scope in activities e), f) Grading will be based on participation (10%), ABDS deployment (30%) and Project (60%). The class will interact with postings on a Google community group. The online section will also interact with Google Hangout or equivalent. We will use FutureSystems (FutureGrid) facilities and cloud computing experience is helpful but not essential. Good working experience with Java is required and Python will be used

DIGITAL DATA AND CLOUD BACKDROP

Gartner Emerging Technology Hype Cycle

Gartner Emerging Technology Hype Cycle

Six Business Era Models in the Digital Business Development Path As set out on the Gartner road map to digital business, there are six progressive business era models that enterprises can identify with today and to which they can aspire in the future. Last 3 are in Emerging Technologies Hype cycle Stage 1: Analog Stage 2: Web Stage 3: E-Business Stage 4: Digital Marketing Stage 5: Digital Business Stage 6: Autonomous

Digital Business Development Stage 4: Digital Marketing The Digital Marketing stage sees the emergence of the Nexus of Forces (mobile, social, cloud and information). – Enterprises in this stage focus on new and more sophisticated ways to reach consumers, who are more willing to participate in marketing efforts to gain greater social connection, or product and service value. – Buyers of products and services have more brand influence than previously, and they see their mobile devices and social networks as preferred gateways. – Enterprises at this stage grapple with tapping into buyer influence to grow their business. Digital Marketing tech includes: Software-Defined Anything; Volumetric and Holographic Displays; Neurobusiness; Data Science; Prescriptive Analytics; Complex Event Processing; Big Data; In-Memory DBMS; Content Analytics; Hybrid Cloud Computing; Gamification; Augmented Reality; Cloud Computing; NFC; Virtual Reality; Gesture Control; In-Memory Analytics; Activity Streams; Speech Recognition.

Digital Business Development Stage 5: Digital Business Digital Business is the first post-nexus stage on the road map and focuses on the convergence of people, business and things. – The Internet of Things and the concept of blurring the physical and virtual worlds are strong concepts in this stage. – Physical assets become digitalized and become equal actors in the business value chain alongside already-digital entities, such as systems and apps. – 3D printing takes the digitalization of physical items further and provides opportunities for disruptive change in the supply chain and manufacturing. – The ability to digitalize attributes of people (such as the health vital signs) is also part of this stage. – Even currency (which is often thought of as digital already) can be transformed (for example, cryptocurrencies). – Enterprises seeking to go past the Nexus of Forces technologies (stage 4) to become a digital business should look to these additional technologies: Digital Business tech includes: Bioacoustic Sensing; Digital Security; Smart Workspace; Connected Home; 3D Bioprinting Systems; Affective Computing; Speech-to-Speech Translation; Internet of Things; Cryptocurrencies; Wearable User Interfaces; Consumer 3D Printing; Machine-to-Machine Communication Services; Mobile Health Monitoring; Enterprise 3D Printing; 3D Scanners; Consumer Telematics.

Digital Business Development Stage 6: Autonomous Autonomous represents the final post-nexus stage. – This stage is defined by an enterprise's ability to leverage technologies that provide humanlike or human-replacing capabilities. – Using autonomous vehicles to move people or products or using cognitive systems to write texts or answer customer questions are all examples that mark the Autonomous stage. – Enterprises seeking to reach this stage to gain competitiveness should consider these technologies on the Hype Cycle Autonomous stage tech include: Virtual Personal Assistants; Human Augmentation; Brain-Computer Interface; Quantum Computing; Smart Robots; Biochips; Smart Advisors; Autonomous Vehicles; Natural-Language Question Answering.

REAL WORLD BIGDATA

My Research focus is Science Big Data but note Note largest science ~100 petabytes = total Science should take notice of commodity Converse not clearly true? Note 7 ZB ( ) is about a terabyte (10 12 ) for each person in world

Hundreds Of Retail Stores Are Closing No more malls?

Where Are Shoppers Going?

Online! We Are Here

E-Commerce Is Driving Nearly All Retail Growth In US

1 In 20 Retail Dollars Are Already Online

Even online groceries taking off

BASIC TRENDS AND JOBS

Note that translates NOW into smaller devices In PAST translated into faster devices of same form factor

Jobs 35

Jobs v. Countries 36

McKinsey Institute on Big Data Jobs There will be a shortage of talent necessary for organizations to take advantage of big data. By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions. At IU, Informatics aimed at 1.5 million jobs. Computer Science covers the 140,000 to 190,