Download presentation
Presentation is loading. Please wait.
1
Excel for data presentation
© Susan Mowers Spring 2016
2
HANDS-ON OPEN PRESENTATION:
Save to your desktop under a folder called “EXCEL”
3
Overview Working with GSS, related crime and Census data
Descriptive statistics 101 – Bring it home!!! Introducing Excel as your data transformer Elements of an Excel transformation Reference questions Crime and victimization profiles (any topic*) *Could include industry / economic profiles Population profiles (GSS today, and touching on the Census/NHS) Data linking – neighbourhood, population effects on outcomes Practice exercises for descriptive statistics: Extract, organize, manipulate, present (and archive) data Joining new variables at the community level Pros and cons of multiple worksheets More Excel magic … macros, pivot tables, colectica, preserve Excel for data sharing, Quantitative human data is very likely to include qualitative data. Often self reported. Quantitative analysis has two aspects: Descriptive statistics (Description or summarization of or showing the data at hand), E.g., According to the data from the 2006 Census, women are twice than men as likely to be single parents than men if they live in Ontario. Inferential analysis is about predictions.
4
Learning outcomes Practice using Excel for your Beyond 20/20, Odesi and other STC outputs – Excel is more than cross-tabulations and pie-charts! Learn how to do basic data transformations for presentations, ouputing or visualisations Learn how researchers study neighbourhood/social effect (data joinining with Excel) Awareness of and best practices for data labeling Capture your files and work –for working effectively and being able to back up your research results
5
Introducing Excel Benefits of Excel? Universality
Most students have Excel. (Note: latest version supports over a million rows) Some professors support it Many Excel add-ons: like Haver, colectica, and even other people’s Excel macro workbooks Many Excel tutorials and fairly intuitive! Excel supports labels, portable/open output file formats, many others, like .dbf for GIS!!! Go further with GitHub, macros (and shareable NEW macro workbooks.xlsm see [LINK] tricky – archive your visual basic instead), and/or, colectica for best research practices Excel preserves data labels Exploit good and well-labeled data!!! WE ARE DOING THIS … !! SPSS = graphical – pull-down menus + syntax code to re-run … you and your supervisor or instructor or readers of your research articles can re-run your statistics Research replication Non-SPSS file formats ? Conversion options Can you import or transfer to SPSS .sav ? SPSS supports some « codebook information » Non-SPSS file formats? Conversion! Check to see if the original file has data labels? Note that .sav is a standard SPSS data file. .por is a portable SPSS file Labels are essential for your descriptive statistics which is partly about showing your statistics, either through statistical tables or in graphs!
6
OUR Research question # 1:
Have property crime rates increased relative to violent crime in Ottawa in the past 15 years?
7
Crime tables … Uniform crime survey
CANSIM Extract to B20/20 Crime statistics, by detailed offences, annual (Number) *Terminated*, 1977 to Description | StatCan page [LINK] Incident-based crime statistics, by detailed violations and police services, Ontario, annual (Number), to 2015 Description | StatCan page [LINK] There is also an archived version on Odesi. Notes: Citation on the StatCan page documentation (remember File Summary) Useful to look at these detailed versions to see what detailed crimes fall under property crime and violent crime, and which are excluded. These tables always break down We will only take the Rate per 10000 population Follow Download tab in CANSIM Extract as per question (I added Kingston) and save B2020 original, extract and Excel extract This is B2020’s default chart:
8
HANDS-ON Download and save Excel CANSIM table extract xls [LINK] Prepare your columns and rows for graphing: One header row only, one header column (years) followed by data Start Line Graph Modify your Line Graph with a chart title, axis labels and grid lines,
9
Let’s create a line graph
Delete rows 2 and 5 Copy and paste to row 21: Rate per 100,000 population Delete row 1 Create one header row, e.g.: Do you have only one header row, then data? Good. Select A1-C17 Do a line graph: Click on Insert Under Chart Tools to top right, you will see the Design tab is activated Click on Add Chart Element on far left and add axis titles and chart titles. You may want to change styles as well. You can do other things. You can graph year of year change, calculate 15 year change, (see [LINK], graph recession lines Annual
10
OUR Research question 2:
Does getting older have a positive correlation on the perception that the local police treat people fairly for all Canadians born in Canada versus visible minorities born outside Canada?
11
HANDS-ON (VERY QUICK) Open Beyond 20/20 CANSIM table extract xls [LINK] Prepare your data to be GIS ready Move Geography to rows and change label to geocode
12
Crime / Victimization data … Extract statistics from Odesi
Public-use microdata file Create custom tabulation in Odesi Statistics Canada. (2016). General social survey (GSS), 2013, cycle 28, Victimization - Safety [Public use microdata file]. Ottawa, Ontario: Statistics Canada [producer], Ottawa, Ontario: Data Liberation Initiative, Statistics Canada [distributor]. Retrieved from Why might RTRA be even more interesting for our question? Follow Download tab in CANSIM Extract as per question (I added Kingston) and save B2020 original, extract and Excel extract This is B2020’s default chart:
13
Hands-on Do a bar graph of your percentage table &
Open Odesi Excel table [LINK] In the yellow highlighted columns … Calculate Percentages (note, you will first need to total all repondents born in Canada Do a bar graph of your percentage table & Insert your frequencies as labels
14
DATA MANIPULATIONS THROUGH CALCULATIONS
Start by downloading an Odesi tabulation with raw numbers Create Bar graphs to show the relationship between two categorical variables We will first do our re-code here for all respondents born in Canada using the “sum” feature for columns B and E Then we will calculate percentages in the yellow highlighted columns, as =X job/total Copy and paste special percentages (values only) to green area Insert the total population size in the labels in red Do a bar graph
15
HANDS-ON (VERY QUICK) Open Excel file with postal code related data [LINK] As time permits, open a new worksheet and do the following join
16
Table join for social / neighbourhood effect research
17
Where to start with Data? REverse of Data Lifecycle
18
Ease, speed and functionality of the applications
Managing your data for multiple applications Strong data viewer Visualization potential, e.g., mapping and many more Retain full wording of variables Organize your data for great tables and charts! Intended for about numbers or less (and with short labels) Beyond 20/20 Low High Excel Low to medium Odesi Medium High (through export) Very customizable © Susan Mowers 2016
19
Interoperability
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.