Understanding the field & setting expectations.  Personal  International  UNT Alumni (Mathematics)  Academic  Economics & Mathematics  Professional.

Slides:



Advertisements
Similar presentations
Nokia Technology Institute Natural Partner for Innovation.
Advertisements

Career Opportunities in Statistical Computing. Two Perspectives on Careers in Statistical Computing 1.Software development opportunities at SAS 2.Emerging.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Chapter 9 Business Intelligence Systems
Lecture Microsoft Access and Relational Database Basics.
Decision Support Chapter 10. Overview Databases are really information technology Decision Support is a business application that actually uses databases.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Chapter 14 The Second Component: The Database.
Financial Engineering Club Career Path and Prep. Entry Level Career Paths Type 1: Research based Background: Physics, Electrical Engineering, Applied.
Building Knowledge-Driven DSS and Mining Data
Data Analytics Program at Drake Brad C. Meyer, Chair Information Management and Business Analytics.
Data Mining – Intro.
Data mining By Aung Oo.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Data Mining: A Closer Look
Lecture-8/ T. Nouf Almujally
A Primer on Jobs in Industry Nitin Jindal Department of Computer Science University of Illinois at Chicago.
Cody Clifton KU GSO/AWM joint meeting April 29, 2014.
Building Efficient Time Series Similarity Search Operator Mijung Kim Summer Internship 2013 at HP Labs.
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Decision Support Systems Chapter 10.
DATA MINING Prof. Sin-Min Lee Surya Bhagvat CS 157B – Spring 2006.
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
Analytics for Smart Decisions The Wonderful World of Big Data, Business Analytics and Business Intelligence - Demystified.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
SQL Jan 20,2014. DBMS Stores data as records, tables etc. Accepts data and stores that data for later use Uses query languages for searching, sorting,
Big Data: Industry Needs Data Scientists Data Analysts Data Infrastructure Engineers Developers (all kinds) 2-3:30, August 10, 2015 Room 261 RSC.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Site Technology TOI Fest Q Celebration From Keyword-based Search to Semantic Search, How Big Data Enables That?
© 2006 Pearson Education Canada Inc. 3-1 Chapter 3 Database Management PowerPoint Presentation Jack Van Deventer Ward M. Eagen.
Computers and Careers Graduation Project By: John Vonelli.
Mining of Massive Datasets Edited based on Leskovec’s from
Relational Database Systems Bartosz Zagorowicz. Flat Databases  Originally databases were flat.  All information was stored in a long text file, called.
Table of Contents Introduction Why Data Analytics Data Analytics Terminology Predictive Analytics Data Analytics challenges Data Analytics Platform Data.
D ATA S CIENTISTS Who are they and what do they do?
Big Data Yuan Xue CS 292 Special topics on.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Database Processing Chapter "No, Drew, You Don’t Know Anything About Creating Queries.” Copyright © 2015 Pearson Education, Inc. Operational database.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
What is BIG DATA? Why learn HADOOP? &. What is Big Data? Big data is a data that is to large, complex and dynamic for any conventional data to capture,
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Book web site:
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Computer Information Technology
Oracle Advanced Analytics
Systems Analysis and Design in a Changing World, Fifth Edition
Big Data is a Big Deal!.
Careers in data science – opportunities & challenges
Data Mining in Higher Education
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
Big Data A Quick Review on Analytical Tools
MIS2502: Data Analytics Advanced Analytics - Introduction
Introduction to R Programming with AzureML
Systems Analysis – ITEC 3155 Evaluating Alternatives for Requirements, Environment, and Implementation.
Coding - The Ultimate Survival Skill
Dane Stubben QuintilesIMS Database Manager
Primary Purpose: To provide apprenticeship, certificate, technical diploma, and associate degree programs that: Respond to district workforce needs Prepare.
Parallel Analytic Systems
TIM TAYLOR AND JOSH NEEDHAM
Course Introduction CSC 576: Data Mining.
MIS2502: Data Analytics Introduction to Advanced Analytics
Second Major in Data Science & Analytics (DSA)
McGraw-Hill Technology Education
Big DATA.
Welcome! Knowledge Discovery and Data Mining
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Presentation transcript:

Understanding the field & setting expectations

 Personal  International  UNT Alumni (Mathematics)  Academic  Economics & Mathematics  Professional  Academic Research, Hilton, Ansira, Sabre

 Analytics: Discovery and communication of meaningful patterns in Data  Data Science: The novel application of algorithms and statistical techniques to solve business problems.  Reality: Different meanings at different companies  The culture of the company determines the nature of work that you do  A relatively new field  Most Companies are in the process of defining their analytics strategy  Titles common to the field:  Data Scientist, Analytics Consultant, Statistical Modeler, Risk Analyst, Statistician.

 Forecasting  “Predictive Analytics”: Classification  Logistic Regression, SVM, Random Forest, Gradient Boosting  Fraud, Customer Acquisition  Customer Retention/ Churn Modeling  Who is likely to leave for a competitor  Recommendation Engines  Netflix Challenge  Customer Choice Modeling  What will people buy  Multinomial Logit Model  Optimization  Market Mix Modeling  Clustering/ Market Basket Analysis

 Most Companies house their data in relational databases  Oracle, Teradata, IBM DB2, Microsoft SQL  SQL queries used to retrieve data  SQL: a basic entry level requirement to work in this field  Most of tasks require significant amounts of time and energy combining tables and data  Hadoop -An open source distributed framework for storing and processing large amounts of data  Petabytes  Java based  Map-Reduce  Pig, Hive-SQL syntax-Facebook, Impala-SQL syntax, Spark  Spark – UTD offers a Spark Course  HTML  JSON

 Statistical Programming Languages  R- Open Source, easy to learn, unparalleled no. of packages and functionality, Memory Limitations.  SAS – Very Common in Businesses but losing popularity, expensive, losing market share to R, handles large data sets well.  Python – Versatile, reasonable no. of packages, R’s biggest competitor.  Matlab – More common in Engineering field.  General Programming Languages  JAVA – Not knowing java has cost me at least 4 jobs.  C/ C++ - For writing faster R programs  Scala – Spark more common among people on the forefront of development

 Search for positions you are overqualified for.  More likely to sponsor you  State your status as soon as possible  Some companies have policies against hiring international students.  myvisajobs.com  See companies that are sponsoring  See salaries for negotiation purposes  Others.

 SQL  Fundamental Requirement  Experience with Large Data Sets  10k records is no large  SAS/ R  Take courses  Free courses at UNT  Very Strong in at least one area (Optimization, Forecasting, Classification)  Specialize in something  Linux Experience  Get exposure  JAVA  Learn it.  Multiple Projects (At least 3)- Code Research Paper, Apply a technique to company data, participate in Kaggle, do internship.

 Universities  UTD – School of Management/ Operations Research  OSU (Oklahoma) – Analytics and Data Mining Programs  UNT-Economics  SMU- Statistics  Economics, Mathematics, Statistics, Operations Research, Computer Science, Engineering.  Companies  AT&T, Sabre, Epsilon, Amazon,  AnalyticRecruiting.com (lots of Phone Interviews), Kforce.com (Very Promising and takes care of Visa issues)

 Kaggle.com  The Home of Data Science  Company recruiting & Pays winners  Many Kaggle winners manage Analytics teams  Compete! Get recognized.  Internships are extremely important  AT&T, Sabre, Epsilon, Amazon, Santander, Capital One in Plano  Companies prefer to hire Mathematicians  Never accept first offer  Jumping around vs. Staying at one company  They always divide by 2  Dallas R user group- Network  Meetup.com – Network  Informs local chapter

 The Elements of Statistical Learning: Data Mining, Inference and Prediction.  The Art of R Programming  The Theory and Practice of Revenue Management