 Problem:  How to discover the latent structure in unstructured data (e.g. Wikipedia articles).  Objective:  Improve the ways people explore and analyze.

Slides:



Advertisements
Similar presentations
Student Research Center & eLibrary Created by: Tisha A. Tytar Oakwood High School Fall 2008.
Advertisements

Presentation at Society of The Query conference, Amsterdam November 13-14, 2009 (original title: Learning from Google: software design as a methodology.
This tutorial is designed to take you through the features and content of Oxford African American Studies Center. Please click "Start the Tour" below for.
Ask a Question What are we testing? Step 1. Step 2 Research your topic. Look at the past work of others to see if this problem has already been tested.
National Geographic Magazine Archive, What is the National Geographic Magazine Archive? Complete archive from of the world’s best-
Projections & Coordinate Systems James Payne Morongo Band of Mission Indians.
Joint Sentiment/Topic Model for Sentiment Analysis Chenghua Lin & Yulan He CIKM09.
"Don't Forget the Salt!" Aquarius Education and Public Outreach Annette deCharon University of Maine.
The West Virginia GeoExplorer Project is located at: You can try everything you see here, and more, on this.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Oct 19, 2005.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Search on Journal of Dairy Science ® An Overview April
The Geographer’s Basic Tools. A map is a representation of the earth features drawn on a flat service. Maps use symbols and colours to represent the features.
Managing references : Mendeley
From VS C# 2010 Programming, John Allwork 1 VS2010 C# Programming - DB intro 1 Topics – Database Relational - linked tables SQL ADO.NET objects Referencing.
Change-Link: A Digital Forensic Tool for Visualizing Changes to Directory Trees Timothy R. Leschke, M.S. Doctoral Student Alan T. Sherman,
2-1 Relations and Functions
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Causality Project Team 1 David Conley Vijay Hattiangadi Byung Lee Jennifer Stoneking.
*Chapter One: What is Footnote?* Footnote allows people to find and share over 70 million historical documents Use the search engine to explore documents.
Colorado’s Historic Newspaper Collection Presented by Mary J. Johnson 10:55-11:10 A.M.
Representing Relations By Ajhane Foster And MarTricesa Carter.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
1 Sunbelt, 2/18/05 Interactive Visualizations to Explore Dynamic Network Data Jim Blythe USC Info Sciences Institute Cathleen McGrath Loyola Marymount.
Introduction to Remote Sensing. Outline What is remote sensing? The electromagnetic spectrum (EMS) The four resolutions Image Classification Incorporation.
1 T-Scroll: Visualizing Trends in a Time-Series of Documents for Interactive User Exploration Yoshiharu Ishikawa and Mikine Hasegawa Nagoya University,
1 EviCare (NLP) Aid clinical work in hospitals using NLP: “Summarize” health records Select and rank recommendations from clinical practice guidelines.
CaLP RCWG – 28 th January 2015 Start time : 3 – 5pm (Thailand) Presenters: Rebecca Vo – CaLP Asia Regional Focal Point Wadel S. Cabrera III – CaLP Asia.
© Copyright 2008 STI INNSBRUCK Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Online Databases for Academic Libraries EBSCO Publishing Academic Databases Business Databases Medical Databases Linking Services VISUAL SEARCH.
EBSCOhost 2.0 GOLD/GALILEO ANNUAL USERS GROUP CONFERENCE August 1, 2008.
*Note: If you would like to view the transcript of the audio, click Notes in the upper right section of the screen. Main Window The Training Interface.
Lesson 1: Exploring Access Learning Objectives After studying this lesson, you will be able to: Start Access and identify elements of the application.
Data Mining By Dave Maung.
Exploring GIS concepts. Introduction to ArcGIS I (for ArcView 8, ArcEditor 8, and ArcInfo 8) Copyright © 2000–2003 ESRI. All rights reserved. 2-2 Organizing.
Part II: Business environment analysis with ESRI Business Analyst Desktop Getting to Know ESRI Business Analyst Fred L. Miller, PhD Murray State University.
Numbers in Science. Why Do We Collect Data? We collect data to analyze test results, calculate averages, compare our data with other sets of data, and.
2015/12/121 Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Proceeding of the 18th International.
Final Project Current Event Debate Directions Report on an issue that has been debated in the last 10 years. Describe what caused the controversy. Be.
A Generalized Architecture for Bookmark and Replay Techniques Thesis Proposal By Napassaporn Likhitsajjakul.
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
Report management basics FLL July Report management basics Overview of all modules Chart Reviews Reports Review areas Word  Consent forms  Patient.
Using the New ECRI Institute Website Healthcare Risk Control ECRI INSTITUTE 2014 Corporate Presentation.
Generating Summaries from FOT Data ITS World Congress, Detroit 2014 Dr. Sami Koskinen, VTT
Gabriel Rodriguez. Google Earth is open source software. It is an interactive map with drawing, measuring, data layers, images, and Wikipedia articles.
ISI Web of Knowledge update: October What’s New? Conference Proceedings Citation Indexes now in Web of Science –Two editions – Science and Social.
Using GIS Technology for Emission Inventory and Air Quality Applications Prepared by: Tami H. Funk Lyle R. Chinkin Sonoma Technology, Inc. Petaluma, CA.
Data Visualization as a Tool for Communicating Ocean Science Rob Bochenek Information Architect Axiom Consulting & Design.
Data -Data is the raw materials from which information is generated. -Data are raw facts or observations typically about physical phenomena or business.
Article Detail. Authors Article Update & Change Notes Notes: New field in CTD to allow for the authors to add change/update notes to articles to inform.
Introduction to STELLA modeling Tutorials are at loads/tutorials/ModelBuilding.aspx.
Allison Nichols, Ed.D. Evaluation Specialist.  In this workshop we'll explore creating an online survey using Google Documents. You don't need to buy.
The Spotlight is on Type Wells. 2 WHAT IS IT? Users can analyze analogous well production, create a type profile for modelling new drills, and create.
How a topographical map is made.
Introduction to Wikipedia & Wikipedia assignment
For basic Internet searches for news articles or interviews with the person you are researching, try Bing &/or Google. News search will help you find where.
Data Warehousing and Data Mining
Extra credit #1 – swim lane
NON-FICTION UNIT 5th Grade
Dissemination with GIS
Extra credit #1 – swim lane
國立雲林科技大學 National Yunlin University of Science and Technology
[Organization Name]:[insert]Collaborative
Touring Capitol Park Sacramento, CA
From Unstructured Text to StructureD Data
Automating Student Yield Data Extraction
Wikipedia Is Good for You?!
Understanding the values of a good thematic map
Presentation transcript:

 Problem:  How to discover the latent structure in unstructured data (e.g. Wikipedia articles).  Objective:  Improve the ways people explore and analyze hundreds of thousands Wikipedia articles.  Methods (cont.) :  Extract topics that summarize 189,000 Wikipedia articles  Extract time and location from the articles  Visually present the results in a meaningful way.

Collect Data Extract Topic Summaries Extract Location and Date Visualize Results

Map View Views can include: Imagery Topographic Street layer Oceanic

Support View Tabs can show: Separate map Link to Wikipedia article List of searchable topics

Timeline Navigation slider shows user’s range Graph shows number of articles on topic at the time

 Havre et al. “ThemeRiver Visualizing Thematic Changes in large Document Collections”  Specialty: Large amounts of data over time.

 Incorporating annotations Birth of Christ (0 AD) Conquest of Dacia (106 AD)

 With annotations in the proper coordinates

 With annotations displayed

 Potential data applications include:  Military  Corporate  Government  Medical  Consumer  Try for yourself at: 