BigML “Warren Buffett is one of the best learning machines on this earth. The turtles which outrun the hares are learning machines. If you stop learning.

Slides:



Advertisements
Similar presentations
Creating the Ultimate Online Customer-Service Experience Stefan Beeli, Vice President ESP Computer Services Choosing the proper level of Technology A look.
Advertisements

Data Mining: A Closer Look
An MSDN Subscription acts as a lifeline for software developers, testers, architects, IT professionals, database engineers, and others. It provides priority.
Peter Myers Bitwise Solutions Pty Ltd. Predictive Analytics PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned.
Basic Data Mining Techniques
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
Configuration and E-commerce Invited talk, IFORS July 2002, Edinburgh, Scotland Jesper Møller IT University, Denmark [
A Genetic Algorithm-Based Approach for Building Accurate Decision Trees by Z. Fu, Fannie Mae Bruce Golden, University of Maryland S. Lele, University of.
FPDS- NG Reports Overview December 16, Today’s Goals Provide an overview of the FPDS-NG reporting capability Demonstrate each of the reporting tools.
By Anthony W. Hill & Course Technology 1 Product Evaluation Strategies and Support Standards Beisse.
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. 1.
Virtual Classes Provides an Innovative App for Education that Stimulates Engagement and Sharing Content and Experiences in Office 365 MICROSOFT OFFICE.
GCSE ICT Systems Analysis. Systems analysis Systems analysis is the application of analytical processes to the planning, design and implementation of.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
In an increasingly competitive industry is certified by a recognized provider as Microsoft exam will dramatically improve your chances busy. Microsoft.
A Suite of Products that allow you to Predict Outcomes, Prescribe Actions and Automate Decisions.
Cloud Analytics Platforms Christian Frey. About AIDA Our mission is to advance knowledge in data analytics through research, education and outreach Our.
Public Cloud Market to Global Analysis and Forecasts by Services, Applications No of Pages: 150 Publishing Date: Jan 2017 Single User PDF: US$ 3900.
Social Media Analytics Market to Global Analysis and Forecast by Type, Application and Vertical No of Pages: 150 Publishing Date: Jan 2017 Single.
Social Media Analytics Market to Global Analysis and Forecast by Type, Application and Vertical No of Pages: 150 Publishing Date: Feb 2017 Single.
“Scorecards and PokemonGO”
Data Platform and Analytics Foundational Training
Katie Goldstein Christopher Bilz Brian Northern Michael Yang
How Artificial Intelligence is Changing the Supply Chain
Introduction to Tableau
Group Assignment Moore Medical Corporation
as presented on that date, with special formatting removed
Data Based Decision Making
ExamUnion CCIE Data Center V2.0 Exam
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
MIS2502: Data Analytics Advanced Analytics - Introduction
BRK3288-Discover data-driven apps that learn and adapt
Webparts360: A Low-Code App Development Tool That Enables Non-Programmers to Build Business Solutions for Microsoft Office 365 Quickly, Easily OFFICE 365.
AF1: Thinking Scientifically
It’s All About Me From Big Data Models to Personalized Experience
Microsoft Ignite /22/2018 3:27 PM BRK2121
Data Mining 101 with Scikit-Learn
The BI Squad Thomas Fernandes Trevor Richardson Austin Stone
SocialBoards Self-Service, Multichannel Support Ticket Notifications in Microsoft Office 365 Groups Help Customer Care Teams to Provide Better Care OFFICE.
OpenWorld 2018 How to Create Chatbots with OMCe
Cloud Analytics Platforms
MetaShare, Powered by Azure, Gives SharePoint a User-Friendly, Intuitive User Interface and Added App Features with No Added Administrative Tasks OFFICE.
It’s About Time – ScheduleMe Outlook Add-In for Office 365 Enables Users to Schedule Meetings Easily with People Outside of Your Organization Partner Logo.
MIS5101: Data Analytics Advanced Analytics - Introduction
Dive into Predictive Maintenance using Cortana Intelligence Suite
InLoox PM Web App product presentation
Section 14.1 Section 14.2 Identify the technical needs of a Web server
Machine Learning Interpretability
Course Introduction CSC 576: Data Mining.
Yooba File Sync: A Microsoft Office 365 Add-In That Syncs Sales Content in SharePoint Online to Yooba’s Sales Performance Management Solution OFFICE 365.
Reportin Integrates with Microsoft Office 365 to Provide an End-to-End Platform for Financial Teams That Simplifies Report Creation and Management OFFICE.
Amazon Machine Learning
Business Intelligence
MIS2502: Data Analytics Classification Using Decision Trees
Welcome! Knowledge Discovery and Data Mining
AZ-900 Exam Braindumps
Welcome to Microsoft Azure for Research Training!
MS-900 MS-101 Dumps PDF 2019
Azure Machine Learning
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Getting Started with Microsoft Azure Machine Learning
Presentation transcript:

BigML “Warren Buffett is one of the best learning machines on this earth. The turtles which outrun the hares are learning machines. If you stop learning in this world, the world rushes right by you.” Lucas Remmerswaal

What is BigML? www.bigml.com A predictive analytics tool that is offered through software-as-a-service (SaaS). In particular, it uses machine learning algorithms to create decision trees www.bigml.com

About BigML “The service can be used in production mode or development mode. Development mode is free but limited in the size of tasks that can be completed. Production mode is a paid mode and credits can be purchased ad hoc in blocks or on a subscription basis. This is a familiar pattern from other cloud based services like storage or compute servers” Source: http://machinelearningmastery.com/bigml-review-discover-the-clever-features-in-this-machine-learning-as-a-service-platform/

Interesting Fact BigML was not evaluated by Gartner: “Among those evaluated by Gartner but excluded from the analysis: BigML, Business-Insight, Dataiku, Dato, H2O.ai, MathWorks, Oracle, Rapid Insight, Salford Systems, Skytree and TIBCO.”

Industry Competition “The primary goal of many machine learning models is to make accurate predictions from unseen data.” How well does BigML do in this respect? Source: https://blog.bigml.com/2012/08/27/machine-learning-throwdown-part-4-predictions/

https://bigml.com/pricing Pros / Cons Pros Easy of use (no need to write code to make predictions) Ability to store models locally and make predictions offline Functionality (fast predictions offline) Cons No batch predictions (one per API call) Individual predictions through the API are very slow Cost ($45,000/year for 1 server (8cores) Source: https://blog.bigml.com/2012/08/27/machine-learning-throwdown-part-4-predictions/ https://bigml.com/pricing

BigML & MIST 5620 One assignment BigML Project (Group) – 10 points Gets you familiarized with BigML. You will first have training, answer a few questions about the 2012 presidential election and churn, and also use a dataset to create a predictive machine learning model of your choice. Finally, you will turn a pdf file of your work

BigML & MIST 5620 Step 1: Register to use BigML Its use is free, but you need to register here Step 2: Learn about BigML Features can be learned here. More details and a complete example is available here. Read more about BigML here. At each node in a decision tree, BigML provides information about how confident the prediction is correct at that node. To understand the meaning of confidence, read an article here

BigML & MIST 5620 Step 3: Your Project Description Part A. In the 2012 Presidential Election, Obama and the Democrats received considerable recognition for the use of analytics to understand the electorate and get out the vote. Use the 2012 Presidential Election Winner dataset in BigML to answer the following questions: Which single variable best predicts whether a person voted for Obama? Which path has the highest confidence level in predicting whether a person voted for Obama or Romney?

BigML & MIST 5620 Step 3: Your Project Description Part B. Churn is a major problem for telecommunications firms. It is not unusual for 20 percent of a company’s customers to not renew their contracts. Because of this, Telcos are using analytics to identify customers that are most likely to churn so they can intervene to try to influence these customers to stay, such as providing attractive offers to renew or the promise of better service through a new cell tower. Use the Telco churn dataset to develop a model to predict churn. Exclude any variables that are unlikely to be related to churn and provide the logic behind your thinking. Also check for any variables that need to be recategorized from numerical to categorical and discuss why. What is the single most important variable in predicting churn?

BigML & MIST 5620 Step 3: Your Project Description Part C. Using one of the other datasets provided on the BigML site, create a model, and then develop questions that you answer using the Prediction feature. In designing your model, first build it using part of the data (say 80%) and then test it using the remaining data. How good is your model? Provide a discussion.

BigML & MIST 5620 Step 3: Your Project Data & Timeline You will use data sources provided on the BigML site. The datasets are small but will give you the experience and skills needed to take on larger projects You have nearly one month to turn in your pdf file on eLc (due date is Dec 2 at 11:59pm) see guidelines here

Conclusion BigML is a BI software designed to help you incorporate machine learning into your organization It offers a variety of analyses (ranging from regression to cluster analysis and anomaly detection) in ways that are accessible and easy for users with no technical knowledge Good accuracy (~.96) It can be expensive though and it lacks batch production