Presentation is loading. Please wait.

Presentation is loading. Please wait.

Team web space Local access Web Access

Similar presentations


Presentation on theme: "Team web space Local access Web Access"— Presentation transcript:

1

2 Team web space Local access Web Access
\\up.ist.local\TeamWebsites\ist402sp17\section2\Team01 \\up.ist.local\TeamWebsites\ist402sp17\section2\Team02 . \\up.ist.local\TeamWebsites\ist402sp17\section2\Team10 Web Access

3 Visualization of Trending

4 Topic Trending in Conferences

5

6

7

8 Trends Can be used to analyze Newspaper stories Emails
Online forum discussions Product review Microblogs Category change .

9 Streamgraph Requirement Benefits Issues
Categories, frequencies (weights), time Categories and frequencies Can be hard to get and need advanced algorithms E.g., topics of news articles, conference topics, Benefits Clear trending patterns Comparison among different categories/topics Issues Dominant shapes may be misleading.

10 Python 3 Extracting Online Data

11 Importance of Online Data
Data related to our first programming assignment Online data is dynamic. Online data could be voluminous. Online data is usually "well" structured. Can you extract online data easily and automatically?

12 Example: Penn State Crime Logs
Each record has a well-defined HTML structure. <div class="views-field views-field-title"> <span class="views-label views-label-title">Incident #: </span> <span class="field-content">PSU </span> </div> <div class="views-field views-field-field-occurred"> <span class="views-label views-label-field-occurred">Occurred: </span> <span class="field-content"><span class="date-display-single">03/15/2017 <div class="date-display-range"> <span class="date-display-start" property="dc:date" datatype="xsd:dateTime" content=" T02:42:00-04:00">2:42 AM </span> to <span class="date-display-end" property="dc:date" datatype="xsd:dateTime" content=" T03:05:00-04:00">3:05 AM</span> </div></span></span> </div> <div class="views-field views-field-field-location"> <span class="views-label views-label-field-location">Location: </span> <span class="field-content">Cunningham Hall</span> </div>

13 Our Goal Extract all Records from a Page and Save them to a CSV File.

14 Basic Idea Extract Information based on HTML Tags
We need a package to parse HTML codes. Extract individual categories Build a dataframe based on all data from all categories Export the dataframe to a CSV file.

15 Exercise Follow the exercise instruction \\up.ist.local\Courses\Spring2017\IST402\InClassExe rcisesResources\Week10_Python3


Download ppt "Team web space Local access Web Access"

Similar presentations


Ads by Google