Download presentation
Presentation is loading. Please wait.
Published byLizbeth Garrison Modified over 7 years ago
1
Data Scientist Ahsan Alam (2014-CS-314) Fahad Subzwari (2014-CS-327)
Muhammad Saeed (2014-CS-326) Shoaib Ahmed (2014-CS-298) fahad
2
Contents Data Scientists History Professional Importance
Data Science in Pak / India Types of Data Scientists Data Scientist Statistics (globally) Skills needed Summary
3
Data Science Data science is the study of information and how it can be turned into a valuable resource. Mining data to identify patterns to help an organization in business strategies.
4
Data Scientist Data scientist is a person that excels at analyzing data. A high-ranking professional with the training to make discoveries in the world of big data. Is sometimes used as a synonym for data analyst
5
History Discovered by Jonathan Goldman in 2006
At LinkedIn, the business networking site Found patterns that predicted networks for a given profile Can be termed the first data scientist The term “data scientist” coined by D.J Patil and Jeff Hammerbacher in 2008
6
Professional Importance
Termed the ‘Sexiest’ job of the 21st Century by Harvard Business Review 2016’s hottest profession by Glassdoor, a recruiting site Garnered Glassdoor’s top “job score” ranking and "career opportunity" score. average salary is $118,709 versus $64,537 (programmer) McKinsey, a management consulting company estimates shortage of 140,000 – 190,000 data scientists by 2018
7
Continued… Facebook: $133,841 Apple: $149,963 Airbnb: $117,229
Data Scientist salaries at the hottest tech companies Facebook: $133,841 Apple: $149,963 Airbnb: $117,229 Twitter: $134,861 Microsoft: $119,129 LinkedIn: $138,798 IBM: $110,823
8
Data Science Education in Pakistan
IBM and Information Technology University (ITU) has joined hands to introduce data science education in the country. For the purpose, IBM will be providing services of software, academic content, and their systems. Data Science - It provides modern art computational and statistical techniques to extract business value from a rapidly expanding volume of data. fahad IBM and Information Technology University (ITU) has joined hands to introduce data science education in the country. For the purpose, IBM will be providing services of software, academic content, and their systems. According to the Express Tribune, data science is a new field for the students. It provides modern art computational and statistical techniques to extract business value from a rapidly expanding volume of data. Data Science Lab at the ITU was new in the field. The ITU has set up a 10-member team, which includes graduates from top universities across the world. Dr. Asim Karim, who is currently on sabbatical from the Lahore University of Management Sciences (LUMS), is also part of the lab. “IBM offers members of the academic initiative programme the opportunity to receive extended access to the IBM Bluemix and other cloud-based services; IBM Academic Initiative Programme worked with universities and corporations around the world to support work on data science and advanced analytics. For this reason, IBM is partnering with the Data Science Lab at the ITU,” stated an official from the IBM. The PITB and the ITU share the common leadership as Umar Saif, chairperson of the PITB, is also vice chancellor of the university. As a result, the Data Science Lab has access to the data of various departments across the province. The lab is also focused on development and support of the Citizen’s Feedback Model. ITU is also starting a lot of new programs, very soon. These include Nanotechnology programs, Mechanical Engineering, Material Sciences, Industrial Engineering, Automotive Engineering. The university is one of the top-ranked technology universities in Pakistan.
9
Top 5 Big Data Training Institutes in India
SimpliLearn - Introduction to Big Data & Hadoop. BigDataTraining.In - Hadoop Training from Bigdatatraining claims to the leading global talent corporation. Blue Ocean Learning - Big Data Hadoop Development training program. Jigsaw Academy - Wiley Big Data Specialist. iClass Bangalore - The Big Data Training. shoaib Top 5 Big Data Training Institutes in India These Big Data Training Institutes have been selected on the basis of curriculum, topics covered and the scope of opportunities it provides. 1. SimpliLearn :-Introduction to Big Data & Hadoop – This course covers all the modules of big data beginning by the introduction of hadoop and big data, hadoop architecture, MapReduce, R-Hadoop, PIG, HIVE, HBase, Mahout, ZooKeeper, Flume & Sqoop. This course is good enough to give a holistic overview of these tools. 2. BigDataTraining.In :- Hadoop Training from Bigdatatraining claims to the leading global talent corporation. The course provided here covers Hadoop v2 yarn, Microsoft HDInsights, Apache Solr, Apache Tez, Apache Storm, working on cloud servers powered by AWS Cloud and 15+ PoC projects. Furthermore, to ensure uninterrupted flow of knowledge, they provide a 24/7 technical support for escalation of queries. This training is being offered in modes such as weekends trainings, Fastrack Trainings and Online Trainings. 3. Blue Ocean Learning Big Data Hadoop Development training program: gives you a hands on experience on writing a map reduce program, common mapreduce algorithms, overview of hadoop, Sqoop and Flume, HBase, NoSQL and also provides some real time provides using hadoop to gain practical experience. 4. Jigsaw Academy (Wiley Big Data Specialist) : This course is provided by . This course is best suited for candidates keen to enter big data analytics industry and wants to build their career in data science. The course duration is 16 months ( 10 hours / week). This course emphasizes on big data Hadoop and its related modules using R programming. It covers the essentials of big data such as Hadoop, MapReduce, HDFS, Hive, Sqoop, Flume, Pig, Impala and other big data technologies. Apart from regular teaching, the candidate will also receive case studies, projects, assignments to get a practical sight of big data. 5. iClass Bangalore: The Big Data Training in Bangalore & Best IBM Big Data Hadoop training institute offers the course of Big Data Analytics with Data Science. It includes the introduction to big data covering topics such as data science, data analyst, role of daya scientist which is then supported by case studies across multiple domains ranging from finance, retail defense, research, healthcare etc. It also covers the stages of data mining and tools of data preparation briefly. It also cover the classification algorithms such as K means, hierarchical clustering, naive bayes etc and data visualization in R. In this article, I’ve covered the 5 best training courses in India on the basis of their curriculum. You can select these courses on the basis of your location and your choice. These courses will enable you with the necessary skills useful to take the first step in the field of data science and big data. The salary of big data is quite attractive. The entry level position earns way better than an IT professional with an experience of more than 1 year. If you think you are into the same shoes and thinking of making a career move, this is the right time. Grab any of the courses listed above and embark upon the journey of your transformation.
10
Data Science Summits Data Science Summit
is the premier event for data scientists and developers, learn from industry innovators and academic experts in data science, applied machine learning Held on July 12-13, 2016 in San Fransisco Data Science Summit by dsf (Data Science Foundation) (India) 3rd Annual International summit to be held on September 23, 2016 in Kolkata
11
Types of Data Scientist Jobs
Data Analyst Data Engineer Statistician General shoaib
12
A Data Scientist is a Data Analyst
Tasks include pulling data out of MySQL databases Becoming a master at Excel tables Producing basic data visualizations
13
Please Wrangle Our Data! (Data Engineer)
sets up the data infrastructure loves to play around with databases and large – scale processing systems
14
We Are Data. Data Is Us. (Statistician)
focuses on producing great data-driven products gets useful insights from data harvests the data and turns it into information and knowledge
15
Non-Data Companies Who Are Data-Driven (General)
A data scientist joins an established team of other data scientists generalists fills a specific niche where the team is lacking, such as data visualization or machine learning shoaib
16
Data Scientist Statistics (Globally)
Degrees producing the most data scientists Backgrounds of data scientists Top 10 countries with the most data scientists Companies that hire the most data scientists ahsan
20
ahsan
21
Skills Needed Basic Tools Basic Statistics Machine Learning
a statistical programming language, like R or Python, and a database querying language like SQL Basic Statistics a basic understanding of statistics, should be familiar with statistical tests, distributions, maximum likelihood estimators etc Machine Learning be familiar with machine learning methods, things like k-nearest neighbors, random forests, ensemble methods, understand when it is appropriate to use different techniques saeed
22
Continued… Multivariable Calculus and Linear Algebra Data Munging
basic multivariable calculus or linear algebra Data Munging how to deal with imperfections in data, missing values, inconsistent string formatting Data Visualization & Communication describing your findings or techniques to audiences, be familiar with data visualization tools like ggplot and d3.js
23
Continued… Software Engineering Thinking Like A Data Scientist
strong software engineering background Thinking Like A Data Scientist a (data-driven) problem solver
25
Facts & Figures According to marketsandmarkets.com, the advanced analytics market will be worth $29.53 Billion by 2019. Wired.com points to a report by Glassdoor that the average salary of a data scientist is $118,709. Randstand reports that pay hikes in the analytics industry is 50% more than IT The US alone will need 190,000 Data Scientists by 2018. Average salary for Data Scientists - $120,000 a year. According to marketsandmarkets.com, the advanced analytics market will be worth $29.53 Billion by Wired.com points to a report by Glassdoor that the average salary of a data scientist is $118,709 Randstand reports that pay hikes in the analytics industry is 50% more than IT
26
Summary “Data Scientist” is the pinnacle rank in an analytics organization. Glassdoor has ranked Data Scientist first in the 25 Best Jobs for 2016. Good data scientists are scarce and in great demand. As a data scientist you will be required to understand the business problem. And finally make recommendations. saeed “Data Scientist” is the pinnacle rank in an analytics organization. Glassdoor has ranked Data Scientist first in the 25 Best Jobs for Needless to say, good data scientists are scarce and in great demand. As a data scientist you will be required to understand the business problem, design the analysis, collect and format the required data, apply algorithms or techniques using the correct tools, and finally make recommendations backed by data.
27
Reference Searchbusinessanalytics - searchbusinessanalytics.techtarget.com/definition/Data-scientist SearchCIO - searchcio.techtarget.com/definition/data-science Mashable - mashable.com/2014/12/25/data-scientist/#.i1ecNQXhZqZ Data Science Summit - datascience-summit.com Import IO - import.io/post/from-masters-to-microsoft-7-charts-that-plot-the-path-of-todays-data-scientists saeed “Data Scientist” is the pinnacle rank in an analytics organization. Glassdoor has ranked Data Scientist first in the 25 Best Jobs for Needless to say, good data scientists are scarce and in great demand. As a data scientist you will be required to understand the business problem, design the analysis, collect and format the required data, apply algorithms or techniques using the correct tools, and finally make recommendations backed by data.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.