A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI.

Slides:



Advertisements
Similar presentations
I HAVE GAINED THIS FROM PHILOSOPHY : THAT I DO WITHOUT BEING COMMANDED WHAT OTHERS DO ONLY FROM FEAR OF THE LAW. -A RISTOTLE.
Advertisements

Little Used, but Powerful Features with GP Cathy Fregelette, CPA, PMP Practice Manager BroadPoint Technologies September 20, 2012.
PRE-SCHOOL QUANT WORKSHOP II R THROUGH EXCEL. NEW YORK TIMES INFOGRAPHICS GALARY The Jobless Rate for People Like You Home Prices in Selected Cities For.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Sarah Reonomy OSCON 2014 ANALYZING DATA WITH PYTHON.
SADC Course in Statistics Producing Good Tables In Excel Module B2 Sessions 4 & 5.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Presenter notes: This Microsoft Excel presentation is a prepackaged solution for basic Excel training. You may use the presentation as-is or customize.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Working With Large Datasets in Corporate Settings Ed Bassin
Working with Tables Lesson 5 of Introduction to ArcGIS for Emergency Managers.
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
STATISTICS Microsoft Excel “Frequency Distribution”
1 Overview of Databases. 2 Content Databases Example: Access Structure Query language (SQL)
P366: Lecture #1 Use of Excel for analysis Lei Chen, MD Jan 6, 2002.
XP 1 Excel Tables Purpose of tables – Process data in a group – Used to facilitate calculations – Used to enhance readability of output Types of tables.
DAY 9: EXCEL CHAPTER 6 Tazin Afrin September 17,
Populating an XML instance document with data from Excel 1.Create an instance document skeleton containing at least 2 elements (with attribute tags) 2.Import.
CPS120: Introduction to Computer Science Lecture 19 Introduction to SQL.
United Nations Educational, Scientific and Cultural Organization IDAMS Internationally Developed Data Analysis and Management Software package 2004.
R Brown-Bag Seminar 2: Hands-on Antony Karanja Ndungu Research Methods Group-ICRAF.
Why use a Database B8 B8 1.
 Integrated set of data files  allows easy cross-referencing of related data items  promotes integrity  eliminates redundancy.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Excel: Pivot Tables Exploring Computer Science Lesson Supplemental.
XML Databases by Sebastian Graf Hier beginnt mein toller Vortrag.
XP. Objectives Sort data and filter data Summarize an Excel table Insert subtotals into a range of data Outline buttons to show or hide details Create.
By N.Gopinath AP/CSE Cognos Impromptu. What is Impromptu? Impromptu is an interactive database reporting tool. It allows Power Users to query data without.
Microsoft Excel Spreadsheet Software
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
CSC350: Learning Management Systems COMSATS Institute of Information Technology (Virtual Campus)
Comparison of different output options from Stata
Database Management System. DBMS A software package that allows users to create, retrieve and modify databases. A database is a collection of related.
DATA MINING Pandas. Python Data Analysis Library A library for data analysis of (mostly) tabular data Gives capabilities similar to Excel and SQL but.
Tables Microsoft Word Tables A table is a grid of rows and columns that intersect to form cells. A table is a grid of rows and columns that intersect.
Tables Microsoft Word Tables A is a grid of rows and columns that intersect to form cells. A is a grid of rows and columns that intersect to form cells.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
Analytics Plus Product Overview. Introduction Analytics Plus is a self-service Business Intelligence and advanced analytics software. On-premise reporting.
Week 5 – Nov 4, 2015 Data Analysis. Class today Last week & Homework review SQL: Purpose and functions Data Cleaning Pivot Tables, Power Pivots and Power.
PivotTables Use a PivotTable report to analyze and summarize your data. MAN Micro-computers & Their Applications.
By Martha Nelson Digital Learning Specialist Excel Pivot Tables.
The Power of Excel for Ad Hoc Reporting FAB 5013 Brian Caviggiola Controller - Indiana Wesleyan University Mike Ionno Senior.
IT for quantitative information Working with Spreadsheets 1.
GROUP GOAL Learn and understand python programing language Libraries: Pandas Numpy SKlearn Use machine learning algorithms Decision trees Random Forests.
Python for data analysis Prakhar Amlathe Utah State University
AP CSP: Cleaning Data & Creating Summary Tables
Add More Zing to your Dashboards – Creating Zing Plot Gadgets
Exploring Computer Science Lesson Supplemental
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
Microsoft Office Illustrated
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
Lesson 13 - Cleaning Data Lesson 14 - Creating Summary Tables
Data Tables, Indexes, pandas
Python for Quant Finance
Advanced Excel Helen Mills OME-RESA.
Tutorial 8 Objectives Continue presenting methods to import data into Access, export data from Access, link applications with data stored in Access, and.
Lecture 12: Data Wrangling
Microsoft Excel PowerPivot & Power View
Spss.
Analytics Plus Product Overview 1.
1.
Analytics: Its More than Just Modeling
Introduction To Structured Query Language (SQL)
商業智慧實務 Practices of Business Intelligence
Tutorial 7 – Integrating Access With the Web and With Other Programs
Donald Donais Minnesota SharePoint Users Group – April 2019
Python for Data Analysis
Collecting, Analyzing, and Visualizing Data with Python Part I
Assignment resource Working with Excel Tables, PivotTables, and Pivot Charts Fairhurst pp The commands on these slides work with the Week 2 Excel.
Presentation transcript:

A Powerful Python Library for Data Analysis BY BADRI PRUDHVI BADRI PRUDHVI.

Contents Contents:  Introduction  Key Features  Core Operations  Sample Dataset Analysis in Pandas

Introduction Why Learn Pandas?  If you like Python…It’s a better Python.  It’s a smoother path than raw numpy  Very easy to do Data Analysis Why do you need pandas?  When working with tabular or structured data (like R data frame, SQL table, Excel spreadsheet,...):  Import data  Clean up messy data  Explore data, gain insight into data  Process and prepare your data for analysis  Analyze your data (together with statsmodels,...)

Introduction   Software library written for the Python programming language.  Mainly used for data manipulation and analysis.  Core Data Structures:  Series  Data Frames.  Offers data structures and operations for manipulating :  Numerical tables  Time series.

Key features  Data Frame object for data manipulation with integrated indexing  Fast, easy and flexible input/output for a lot of different data formats  Merging and joining (concat, join)  Powerful time series manipulation (resampling, timezones,..)  Easy plotting  Data alignment  integrated handling of missing data  Reshaping and pivoting of data sets  Label-based slicing, fancy indexing, and sub setting of large data sets

Key features (Contd..)  Data structure column insertion and deletion  Grouping : groupby functionality  Hierarchical axis indexing to work with high-dimensional data in a lower-dimensional data structure  Time series-functionality :  Date range generation,  Frequency conversion,  Moving window statistics,  Moving window linear regressions,  Date shifting and lagging

Core Operations  Create  Select  Insert  Map  Join  Sort  Clean  Bin  View  Update  Filter  Append  Group  Summarize  Conform  Rotate

Let’s see some Cool stuff !!! Analyzing Sample Data using Pandas in Python

Thanks for listening…. Any Questions?