Metis Data Science Meetup:

Slides:



Advertisements
Similar presentations
Connect & Control an iOT Device in 5 Minutes!. Here’s a little Internet of Things application that we created in a few hours to show you the value of.
Advertisements

Machine Learning as a Service
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
IBM Bluemix Ecosystem Development Hands on Workshop Section 1 - Overview.
Matthew Winter and Ned Shawa
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
How to Get Started With Python
BUILD BIG DATA ENTERPRISE SOLUTIONS FASTER ON AZURE HDINSIGHT
Connected Infrastructure
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Device Maintenance and Management, Parental Control, and Theft Protection for Home Users Made Easy with Remo MORE and Power of Azure MICROSOFT AZURE APP.
Python for data analysis Prakhar Amlathe Utah State University
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Data Platform and Analytics Foundational Training
What is it ? …all via a single, proven Platform-as-a-Service.
Connected Living Connected Living What to look for Architecture
Smart Building Solution
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Hadoop and Analytics at CERN IT
Connected Maintenance Solution
Partner Logo Veropath Offers a Next-Gen Expense Management SaaS Technology Solution, Built Specifically to Harness Big Data Analytics Capabilities in Azure.
Azure IoT / RPI / Windows Core 10
Spark Presentation.
Smart Building Solution
Connected Maintenance Solution
Connected Living Connected Living What to look for Architecture
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Wonderware Online Cost-Effective SaaS Solution Powered by the Microsoft Azure Cloud Platform Delivers Industrial Insights to Users and OEMs MICROSOFT AZURE.
Connected Infrastructure
Extensible Platform Microsoft Dynamics 365
Built Using the Powerful Microsoft Azure Platform, Mailytics Gives Marketers Competitor Data and Campaign Analysis to Better Develop Strategies MICROSOFT.
Using Microsoft Azure, Crowdnetic Launches Innovative Lending Gateway Platform That Connects Borrowers to Alternative Lenders MICROSOFT AZURE SOLUTION.
Get Real Value and Insights from Your Data: Biin Solutions Provides Predictive Analytics, IoT, and Business Intelligence with Microsoft Azure Power MICROSOFT.
Add intelligence to Dynamics AX with Cortana Intelligence suite
SVTRAININGS. SVTRAININGS Python Overview  Python is a high-level, interpreted, interactive and object-oriented scripting language. Python is designed.
Power Apps & Flow for Microsoft Dynamics SL
Running on the Powerful Microsoft Azure Platform,
Introduction to Spark.
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Through the Microsoft Azure Platform, TARGIT Decision Suite Enables Organizations to Analyze Critical Data, Giving Them the Courage to Act MICROSOFT AZURE.
I-POWER JAPAN Gives Small Businesses the Ability to Get Their Work Done from Anywhere, Even a Construction Site, by Using Microsoft Azure MICROSOFT AZURE.
Manage Seed and Plant Varieties with the Centralized Mercado Data Solution, Based on the Microsoft Azure Platform MICROSOFT AZURE APP BUILDER PROFILE:
MasterDoc Organizes, Shares Electronic Patient Records for General Practitioners and Their Staff Members, Thanks to the Microsoft Azure Cloud MICROSOFT.
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Accelerate Your Self-Service Data Analytics
Secure Electronic Procurement of Transcripts, HRD Attestations, and Certificates of Origin, Made Easy with Myeasydocs and Power of Microsoft Azure MICROSOFT.
MyCloudIT Enables Partners to Drive Their Cloud Profitability Using CSP-Enabled Desktop Hosting Automation with Microsoft Azure and Office 365 MICROSOFT.
Principal Product Manager Oracle Data Science Platform
Partner Logo Azure Provides a Secure, Scalable Platform for ScheduleMe, an App That Enables Easy Meeting Scheduling with People Outside of Your Company.
Your gateway to cloud innovation
Preparing your Data using Python
MyAppFree, Powered by Microsoft Azure, Lets Global Users Discover and Download Tested and Handpicked Windows Apps and Games for Free MICROSOFT AZURE ISV.
Carl Data Solutions Collects Utility Sensor and Meter Data to Provide Advanced Reporting, Alarming, and Analytics with Microsoft Azure MICROSOFT AZURE.
Preparing your Data using Python
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
XtremeData on the Microsoft Azure Cloud Platform:
Overview of big data tools
TEMPLATE NOTES Our datasheet and mini-case study templates are formatted specifically for consistency of branding at Microsoft. Please do not alter font.
Last.Backend is a Continuous Delivery Platform for Developers and Dev Teams, Allowing Them to Manage and Deploy Applications Easier and Faster MICROSOFT.
The Internet of Things From Hype to Insight Peter Smith
Big-Data Analytics with Azure HDInsight
Thank you to our Sponsors
COMPANY PROFILE: REELWAY
Open data in teaching and education
Presentation transcript:

Metis Data Science Meetup: Saswata Sengupta Nov 30, 2016 Metis Data Science Meetup: Analyzing IBM Watson IoT Events Using Spark & Jupyter Notebooks

Agenda + + - Overview - Introduction to IBM Watson IoT Platform - Introduction to Node Red + IBM and Spark Spark as a Service IBM DataScience Experience Python, Panda and DataFrame Jupyter notebook and Visualization + Recorded Demo + Walkthrough

IBM Watson IoT Platform and Node-Red

What is IBM Watson IoT Platform ? IIIBM Watson IoT Platform What is IBM Watson IoT Platform ? The IBM Internet of Things service lets your apps communicate with and consume data collected by your connected devices, sensors, and gateways. IBM recipes make it super easy to get devices connected to our Internet of Things cloud.Apps can then use our real-time and REST APIs to communicate with your devices and consume the data you've set them up to collect. This is available as part of the IBM Bluemix Cloud offering IBM | Spark

Connect your devices securely to the cloud IIIBM Watson IoT Platform Connect your devices securely to the cloud Before your apps can get to work, you need to get your devices connected up! IBM have a set of verified instructions, or 'recipes', for connecting devices, sensors and gateways from a variety of partners and individuals. Build an app that talks to your devices Communications between your devices and the cloud happen via the open, lightweight MQTT protocol. For example you might have a sensor that collects and sends humidity readings every minute. IBM REST and real-time APIs allow you to quickly pull that device data into your apps for further analysis. IBM | Spark

Flows can be then deployed to the runtime in a single-click. IIIBM Watson IoT Platform What is Node-Red ? Node-RED provides a browser-based flow editor that makes it easy to wire together flows using the wide range nodes in the palette. Flows can be then deployed to the runtime in a single-click. JavaScript functions can be created within the editor using a rich text editor. A built-in library allows you to save useful functions, templates or flows for re-use. Node-Red Starter pack is available as part of the IBM Bluemix Cloud offering IBM | Spark

IBM and Spark

Spark Ecosystem Business Applications and Business Intelligence Apache Spark Spark SQL Streaming MLlib (machine learning) GraphX Hadoop Database Mainframe Data- warehouse IBM | Spark

Spark is complementary to Hadoop, but much faster, with in-memory performance IBM | Spark

1 2 3 IBM’s 3 pillars for Apache Spark Spark as a component of BigInsights 2 Spark-as-a-Service on Bluemix 3 IBM products on Spark SPSS, Commerce, Security, Power, Heathcare, and many more IBM | Spark

1 2 3 Why does Spark matter to a business? Data Science Design Development Why does Spark matter to a business? 1 Spark makes it easier to access and work with all data Enables new data-based use cases All data: Internal/External, Structured/Unstructured Real-time insights, from all data sources Automates analytics with machine learning Clients that lead in data, lead in their industry 2 Spark lets you develop line-of-business applications faster 3 Spark learns from data and delivers in real-time IBM | Spark

IBM has the largest investment in Spark of any company in the world IBM Spark Technology Center Top committer/contributor 300+ inventors Commitment to educate 1 million data scientists Contributed SystemML Founding member of AMPLab Partnerships in the ecosystem IBM | Spark

IBM DataScience Experience- Python, Panda and Jupyter Notebooks

IBM Data Science Exprience IBM Data Science Experience Now you can create value faster using the best of open source and IBM together. Built for data scientists by data scientists, the IBM Data Science Experience is a new cloud-based, social workspace that helps data professionals consolidate create and collaborate across multiple open source tools such as R and Python, Pandas and DataFrame. Data Science Experience uses Apache Spark as a service for distributed analytic workload management. IBM | Spark

IIPython, Pandas and DataFrame Python is a widely used high-level, general-purpose, interpreted, dynamic programming language. Its design philosophy emphasizes code readability, and its syntax allows programmers to express concepts in fewer lines of code than possible in languages such as C++ or Java. Data Engineers and Data Scientist have been using python with Pandas, NumPy and SciPy to build analytical models. It is a viable alternative to R programming language. Pandas are python libraries for for data manipulation and analysis. DataFrame is a 2-dimensional labeled data structure with columns of potentially different types. It is similar to a spreadsheet or SQL table, or a dict of Series objects. IBM | Spark

What Is A Jupyter Notebook? IIIBM Data Science Experience - Notebooks What Is A Jupyter Notebook? In this case, "notebook" or "notebook documents" denote documents that contain both code and rich text elements, such as figures, links, equations, ... Because of the mix of code and text elements, these documents are the ideal place to bring together an analysis description and its results as well as they can be executed perform the data analysis in real time. IBM | Spark

IBM Watson IOT Event processing and Analysis using Spark and Notebook IBM | Spark

Architecture Components IBM”s Watson IoT Platform provides an integrated cloud-and-edge analytics programming model that allows control and optimization over the data flowing between edge devices and the Cloud Apache Spark is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant processing of live data streams. The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. In this demo we will learn ways to explore historical events captured on the IBM Watson IoT Platform. Node-RED is a visual tool for wiring the Internet of Things, this will be used to connect to IoT Platform and store the data in cloudant. Cloudant is an open source non-relational, distributed database service of the same name. Cloudant is based on the Apache-backed CouchDB project and the open source BigCouch project. Cloudant will be used as a datastore. IBM | Spark

Demo steps available here- https://developer.ibm.com/recipes/tutorials/timeseries-data-analysis-of-iot-events-by-using-jupyter-notebook/ Shared Notebook https://apsportal.ibm.com/analytics/notebooks/4987cadc-0a92-4a32-aad2-eadfdb3f5440/view?access_token=cd5199032caf7a85429f689b5b673f947b4dbe39fb8c2904322a35600ae3f04f You will need to create a datascience.ibm.com account and a bluemix account(It is free) IBM | Spark

Please feel free to contact with me at Saswata.sengupta@ibm.com Thank You Please feel free to contact with me at Saswata.sengupta@ibm.com IBM | Spark