Update By: Brian Klug, Li Fan Presentation Overview: API we plan to use (Syntax and commands) Obtainable Data Types (Location, Text, Time, User, Reply)

Slides:



Advertisements
Similar presentations
Social Media Marketing Mark O'Leary -
Advertisements

Connecting Social Content Services using FOAF, RDF and REST Leigh Dodds, Engineering Manager, Ingenta Amsterdam, May 2005.
Overview of Twitter API Nathan Liu. Twitter API Essentials Twitter API is a Representational State Transfer(REST) style web services exposed over HTTP(S).
RED HAT PRODUCT FOCUS 3/17/14 – 3/28/14 INTRODUCTION Our Product Focus for the next two weeks is Red Hat. Red Hat is a maker and distributor of enterprise.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 8: Monitoring the Network Connecting Networks.
Software Frame Simulator (SFS) Technion CS Computer Communications Lab (236340) in cooperation with ECI telecom Uri Ferri & Ynon Cohen January 2007.
Web 2.0: Concepts and Applications 5 Connecting People.
Web 2.0: Concepts and Applications 5 Connecting People.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Using XML files as real corpora making an XML database with the dbXML program
Interprocess Communications
Session 6 Server-side programming - ASP. An ASP page is an HTML page interspersed with server-side code. The.ASP extension instead of.HTM denotes server-side.
Wireless LAN Topology Visualiser Project Supervisor: Dr Arkady Zaslavsky Project Team Members: Jignesh Rambhia Robert Mark Bram Tejas Magia.
21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.
| Basel Discovering Windows Azure Mobile Services and Media Services Ken Casada
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
Networking Nasrullah. Input stream Most clients will use input streams that read data from the file system (FileInputStream), the network (getInputStream()/getInputStream()),
CLARIN tools for workflows Overview. Objective of this document  Determine which are the responsibilities of the different components of CLARIN workflows.
JavaScript & jQuery the missing manual Chapter 11
Software Quality Assurance QA Engineering, Testing, Bug Tracking, Test Automation Software University Technical Trainers SoftUni Team.
Input for the Bayesian Phylogenetic Workflow All Input values could be loaded as text file or typing directly. Only for the multifasta file is advised.
Is Apache CouchDB for you?
CNIT 133 Interactive Web Pags – JavaScript and AJAX JavaScript Environment.
Twitter.  Twitter is a social networking and micro-blogging service that enables its users to send and read other user’s updates.
Adrian Crenshaw. Darknets  There are many definitions, but mine is “anonymizing private networks ”  Use of encryption.
1 In the good old days... Years ago… the WWW was made up of (mostly) static documents. –Each URL corresponded to a single file stored on some hard disk.
| nectar.org.au NECTAR TRAINING Module 10 Beyond the Dashboard.
A/WWW Enterprises15 July 1996 Implementing Queries with HTTP A. Warnock A/WWW Enterprises
CH1. Hardware: CPU: Ex: compute server (executes processor-intensive applications for clients), Other servers, such as file servers, do some computation.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Agenda Getting Started: Using Unix Unix Structure / Features Elements of the Unix Philosophy Unix Command Structure Command Line Editing Online Unix Command.
New perfSonar Dashboard Andy Lake, Tom Wlodek. What is the dashboard? I assume that everybody is familiar with the “old dashboard”:
Software Status Sonja Vrcic Socorro,
MAKANI ANDROID APPLICATION Prepared by: Asma’ Hamayel Alaa Shaheen.
Online Conference June 17 th and 18 th SharePoint Online Administration with PowerShell.
Twitter 101. What is Twitter? Twitter is a social networking and micro-blogging service that enables its users to send and read other user’s updates.
WWW: an Internet application Bill Chu. © Bei-Tseng Chu Aug 2000 WWW Web and HTTP WWW web is an interconnected information servers each server maintains.
Interprocess Communications
Social Media Primer. Social Media is Great For: Building awareness and attracting new business Fostering community Providing helpful content and information.
Web Server.
| nectar.org.au NECTAR TRAINING Module 10 Beyond the Dashboard.
ICM – API Server & Forms Gary Ratcliffe.
TAXII SC Call Agenda Administrivia Month Behind Discussion Month Ahead.
How to combine IRIS products Available APIs Examples of integrations Ole Andersen Senior Strategic Account Manager.
New Generation University Faculty of Computer Science Chapter Five: File Uploaded and Ad Rotate Lecturer: Mukhtar Mohamed Ali “Hakaale”
AJAX and REST. Slide 2 What is AJAX? It’s an acronym for Asynchronous JavaScript and XML Although requests need not be asynchronous It’s not really a.
IPS Infrastructure Technological Overview of Work Done.
BIF713 Introduction to Linux. Agenda Getting Started: Using Linux Unix and Linux - Structure / Features Elements of the Linux Philosophy Linux Command.
Discussion of OCP/SMTP profile and some Use cases Presented by Abbie Barbir
Server-side http General form of http response/request GET request method POST request method Responses Servlet support.
Distributed Computing, M. L. Liu 1 Interprocess Communications Mei-Ling L. Liu.
Dr. Adil Yousif University of Alneelian – Master of CS - IT Electronic Mail.
Stata tweets and other API libraries: a practical guide William Matsuoka Stata Conference Chicago, IL - July 28, 2016.
Gridpp37 – 31/08/2016 George Ryall David Meredith
Network Controllable MP3 Player
The Client-Server Model
WWU Hackathon May 6 & 7.
Content from Python Docs.
Node.js Express Web Applications
Data Virtualization Tutorial… OAuth Example using Google Sheets
COMP2322 Lab 2 HTTP Steven Lee Feb. 8, 2017.
Node.js Express Web Services
All about social networking
Software Quality Assurance
NACE Committee Workspace (NCW)Training Workspace Features Navigation
Challenges in Network Troubleshooting In big scale networks, when an issue like latency or packet drops occur its very hard sometimes to pinpoint.
Hydrographic Data as a Service
Network Controllable MP3 Player
Presentation transcript:

Update By: Brian Klug, Li Fan Presentation Overview: API we plan to use (Syntax and commands) Obtainable Data Types (Location, Text, Time, User, Reply) Infrastructure (Hardware, Storage Req’s, Design) Tentative Work Plan (Timeline and Schedule)

Update Enables near-real time access to a subset of public Twitter statuses. –Currently in alpha test –Access to further restricted resources is extremely limited and granted only after acceptance of an additional TOS document. We have applied for credentials which grant us access to these increased resources (namely a larger sampling, more statuses) – Features of streaming API –Continual connection that streams statuses over HTTP. Opened indefinitely and only requires basic authentication for the most basic level –Output data is in XML or JSON formats, both of which are easy to parse. –Can focus on certain tracking predicates that, when specific enough, return all occurrences in full Firehose stream EG "track=basketball,football,baseball,footy,soccer". Execute: curl -uAnyTwitterUser:Password 2 API: Streaming API

Update 3 Streaming API data Example data: Can you bring the script tomorrow? We can write in the APE if you're not busy.","favorited":false,"in_reply_to_screen_name":"FreedomProject","source":" TweetDeck ","created_at":"Fri Nov 20 06:37: ","in_reply_to_user_id": ,"in_reply_to_status_id": ,"geo":null,"user":{"favourites_count":0,"ve rified":false,"notifications":null,"profile_text_color":"34da43","time_zone":"Tijuana","profile_link_color":"e98907","descri ption":"I'm a Robot created in Mexican soil, therefore my name is Mexican Robot","profile_background_image_url":" c10de6ac70ef2f637f8f62f26.jpg","created_at":"Mon Dec 22 07:34: ","profile_sidebar_fill_color":"b03636","profile_background_tile":false,"location":"Surfin' tubular Innernet waves","following":null,"profile_sidebar_border_color":"050e61","protected":false,"profile_image_url":" om/profile_images/ /jessicaavvy_normal.png","statuses_count":946,"followers_count":59,"name":"Mexican Robot","friends_count":173,"screen_name":"MexicanRobot","id": ,"geo_enabled":false,"utc_offset": ,"profile_background_color":"000000","url":" Data Classes: Who the message is in response to, if anyone Client user agent Location tagged geo-aware data, if any Time of creation and time zone of poster Information about avatar, background, profile User metrics: Statuses posted, Followers, Friends User description: short user-defined string

Update Streaming API expected volume: 3-4 million entries/day Storage Consideration: –Average total JSON example output size: ~1400 characters –Messages are UTF-8, we’ll assume most are 1 byte –1400 msg/day * 1 byte * 3.5 million = 4.56 gigabytes/day –1 year ~ 1.6 terabytes Currently working on getting at least one server running Ubuntu Server in a VM to begin downloading data –May require additional public IP addresses depending on rate limits, additional servers depending on load Download first, parse later 4 Infrastructure

Update Work Plan –Continue investigating using RSS to download status updates from far in the past beyond the 15,000 we are allowed to go back using the streaming API –1-2 weeks: test our environment and make sure everything is working well Make sure our methodology for downloading from the stream is resistant to Twitter downtime as features are rolled in and out of the alpha test Await possible response from Twitter regarding access to additional restricted resources (even higher rate firehose) –2 weeks to explore how to parse the content into a DB, whether this can be realistically done real time in another process. –Additional time for data mining, research topics, e.t.c. 5 Tentative Timeline