November 8th, 2017 Matthew Davis and John Fink

Slides:



Advertisements
Similar presentations
Presentation on 3CD welcomes you to a Winman Software Pvt. Ltd.
Advertisements

Page 1 These instructions will help guide you through the pages of the Self-Nomination Process web site. Please follow these steps to navigate through.
1 Chapter 2 & Chapter 4 §Browsers. 2 Terms §Software §Program §Application.
Dayton Williams 19 September easy steps, in a step-by-step process, to add Dragonflies or Through the Grief eBooks to Play Books for the purpose.
Downloading and Installing Autodesk Revit 2016
Downloading and Installing Autodesk Inventor Professional 2015 This is a 4 step process 1.Register with the Autodesk Student Community 2.Downloading the.
 Go to YouTube and click “create account” on the top right of the page.YouTube  If you already have a Google account (i.e. gmail) then you may use this.
WEB 2.0 TOOLS VIDEO HOSTING SITE YOUTUBE. WHAT IS YOUTUBE? Youtube is a video hosting site were anyone can upload videos expressing their talents, interests.
Primenumbers.co.uk This presentation will help you get the most out of this service.
How to make an Interactive Voice Response (IVR) using an OzML script This slideshow is intended to be a great explanation on how to develop an Interactive.
HOW FILLING OUT A SINGLE PROFILE CAN LEAD TO MULTIPLE SCHOLARSHIPS. CSF of Marlborough Dollars for Scholars, Inc © Scholarship America. November 2013.
Essex Insight Introduction to Essex Insight Training Guide Source: Research and Analysis Unit v4.
How To Upload YouTube Video?. Copy The Video From Your Camera To Your Computer Before you upload the video to YouTube, make any edits that.
AdisInsight User Guide July 2015
How to search and how to upload files into sentry file
General System Navigation
What is GroupWise? A tool for communication in our organization A system to send and receive A way to increase productivity A method to get documents.
T3/Tutorials: Data Submission
Web for Faculty Faculty and Advisor Banner Self-Service
Development Environment
EndNote X2 Training Materials
Introduction to gathering and analyzing data via APIs Gus Cavanaugh
Imaging and Design for Online Environment
Open OnDemand: Open Source General Purpose HPC Portal
WikID installation/training
Business Directory REST API
Contributing to the WCARC Website
Personal Website Final Project
Select Survey Invitations
Creating a Database Microsoft Access.
Getting Started with SAM
About SharePoint Server 2007 My Sites
Learning the Basics – Lesson 1
101.
Single Sample Registration
Assess Survey Invitations
USAJOBS – Application Manager
Basic User Site Access Training & Producing Reports
Core LIMS Training: Advanced Administration
Adding a File to a Course
Service Provider Best Practices
Using the Card Swipe Readers
How to register and use the app for Law Enforcement users?
Applying for a contractor’s license online
How to register and use ODMAP for Fire/EMS and other partners
Shodan.
Welcome to our first session!
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Penn State Educational Programming Record (EPR) Guide
This presentation has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational purposes.
The Student Experience
How to Create and Start a Test Session
E-permits Tutorial for first-time users
2-1-1 Automated Verifications
Applying for a contractor’s license online
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
This is our new system for turning in papers!
RefWorks Presented by Suzanne van den Hoogen
HOW TO MAKE PAGES FOR A WEB SITE
Unemployment Insurance Agency Michigan Web Account Manager
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Planning and Storyboarding a Web Site
[DISCLAIMER] This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation.
Overview of Contract Association Batch Upload
Tutorial 7 – Integrating Access With the Web and With Other Programs
Adding , Editing, and Assigning Full Text Finder Links
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Concord Products Online
Presentation transcript:

November 8th, 2017 Matthew Davis and John Fink Do More with Digital Scholarship: Building Data Sets Using Social Media November 8th, 2017 Matthew Davis and John Fink

What is a social media data set? What is a data set? What is a social media data set? A data set is a collection of similar information, sharing a structure, that covers a fixed period of time Incoming student transcripts for the years 2010-2016 Medical records for a research trial Records of bequests kept in a parish register or courthouse A social media data set is records of entries on social media sites like Twitter or Facebook over a period of time. The structure comes from the internal structures the particular social media site uses to organize its data, generally exposed to the world through an API

What is an API? Stands for Application Programming Interface The set of rules that software programs follow in order to communicate with each other There’s a lot of API’s out there, and if you’re going to work with computer programs and data you will be using more than one – and probably already are without realizing it For our purposes, we’re going to work primarily with Web API’s

What’s a Web API? A Web API basically a set of commands that you use, via regular http methods to interact with a web client. Disqus https://github.com/abhishekbanthia/Public-APIs#music-discovery

Some Caveats Because you’re using a structure that the social media company has provided, you are limited to what information they’re willing to give you access to. Social media companies make their money off of data. These API’s are designed for business purposes, not for academic use, and are structured accordingly. Some API’s may not be publically accessible or may be experimental. If you try to grab too much information from these the company can and will shut you down. Not all the data you retrieve will be valid! Social media companies have no incentive to filter fake accounts out from their results. The responsibility to verify the utility of the data you retrieve rests on you as the researcher. Be ready to do a lot of sifting.

Facebook (taken from https://towardsdatascience To access the Facebook API outside of Facebook’s web interface, you have to have a developer’s token. This is because Facebook assumes you’re building an app to work with their service. Go to developers.facebook.com and create an account there. You may find it will take you directly to the apps page. If this is the case, skip the next bulletpoint. Go to “My apps” drop down in the top right corner and select “add a new app”. Choose a display name and a category and then “Create App ID”. Once you’ve created the account, go to developers.facebook.com/tools/explorer. You will see “Graph API Explorer” below “My Apps” in the top right corner. From “Graph API Explorer” drop down, select your app. Then, select “Tools and Support.” Click on “Access Token Tool” or navigate to developers.facebook.com/tools/accesstoken. Select “Debug” corresponding to “User Token.” Go to “Extend Token Access.” This will ensure that your token does not expire every two hours. Note that you may need to grant permissions on your application in order to get an access token. Once you have an active token, navigate back to developers.facebook.com/tools/explorer

LiveSlide Site https://developers.facebook.com/tools/explorer

Twitter Twitter is an entirely different animal than Facebook in terms of what information is available and how it’s organized, but the way you gain access to collect it is much the same. Navigate to https://apps.twitter.com/ and select “Create New App.” Note that Twitter requires you to have your phone registered with them to create developer tokens. Fill out and submit the form. Click on “Keys and Access Tokens on the resulting page. These are what you need to access Twitter’s information via tool.

Twarc Twitter has its own internal tool, called Twurl (https://github.com/twitter/twurl) and written in Ruby, but the Twarc Python tool (https://github.com/DocNow/twarc) is more powerful and has been around longer. Follow the instructions at the linked page (basically, twarc configure after you’ve installed it) and put your consumer and access keys where indicated. The github page provides information on how to perform searches and export information to .json files, .html, and more.

Extracting your data Once you’ve generated your data in Facebook, you can cut and paste it into a plain text file. Save this file with the extension .json If you are using twarc, you just need to add >FILENAME.json to your command and it will save the results in a .json file

LiveSlide Site http://127.0.0.1:3333/

Matthew Davis davism17@mcmaster.ca John Fink jfink@mcmaster.ca Thank you! Matthew Davis davism17@mcmaster.ca John Fink jfink@mcmaster.ca