CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

eClassifier: Tool for Taxonomies
RSP Summer School14-16 September 2009 UK Institutional Repository Search: a collaborative project to showcase UK research output through advanced discovery.
Chapter 5: Introduction to Information Retrieval
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
1 Presented by Jean-Daniel Fekete. 2  Motivation  Mélange [Elmqvist 2008] Multiple Focus Regions.
Fishery View Project Team: Ying Zhang, Lan Wu Improvement of Time Series Line Chart Visualization of Fishery Data.
HDDVis: An Interactive Tool for High Dimensional Data Visualization by Mingyue Tan April 21st, 2004.
Project 6: Usability and Utility of a Network Visualization of Document Topics Initial Presentation Gill, Amandeep Singh Gulati, Vrishti Mesple, Eric Yang,
Tuple – InfoVis Publication Browser CS533 Project Presentation by Alex Gukov.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Best Practices Using Enterprise Search Technology Aurelien Dubot Consultant – Media and Entertainment, Fast Search & Transfer (FAST) British Computer Society.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Blaz Fortuna, Marko Grobelnik, Dunja Mladenic Jozef Stefan Institute ONTOGEN SEMI-AUTOMATIC ONTOLOGY EDITOR.
Custom driven scientific information extraction from digital libraries using integrated text mining services Betim Çiço, Adrian Besimi, Visar Shehu 14th.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
CS3041 – Final week Today: Searching and Visualization Friday: Software tools –Study guide distributed (in class only) Monday: Social Imps –Study guide.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
David Herring NOAA Climate Program Office May 28, 2013 NOAA Climate.gov A brief overview and highlights of what’s new.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Building a Multi-Year Database of AAG Conference Abstracts André Skupin /Shujing Shu Dept. of Geography / Dept. of Computer Science University of New Orleans.
Information Retrieval in Practice
Designing for the Next Generation of Engineers
Browse Content by Subfield
Clustering of Web pages
CSE5544 Final Project Proposal
DataNet Collaboration
Advanced Scientific Visualization
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Implementing Knowledge Management in Organization
Natural Language Processing (NLP)
CSC420 Showing Complex Data.
Personalized Social Image Recommendation
Cartography and Labeling
A Path of Learning and Improvement
What to Look for Mathematics Grade 5
Professor John Canny Fall 2001 Nov 29, 2001
Visualization of Web Search Results in 3D
Explore. Discover. Focus.
The GEO DAB possible contributions
Multi-Dimensional Data Visualization
Principals’ Session Approaching Research: Module 10.3
Document Visualization at UMBC
Data Warehousing and Data Mining
Disseminating regional and urban statistics The new visualisation tool of Eurostat Teodora Brandmüller Unit E4 Regional statistics and geographical information.
TDM=Text Mining “automated processing of large amounts of structured digital textual content for purposes of information retrieval, extraction, interpretation.
حيـــم الر حمن الر الله بســـم.
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
CSE 635 Multimedia Information Retrieval
Introduction to Visual Analytics
Automating Profitable Growth™
Magnet & /facet Zheng Liang
CHAPTER 7: Information Visualization
27 June 2018 Odeya Cohen (BGU) & Jonas Hermelin (FOI)
Natural Language Processing (NLP)
Anatomy of a modern data-driven content product
Data mining Data mining is the process of analyzing data from different perspectives and summarizing it into useful information.
Andrew Hendrickson & Brian Embley
CS565: Intelligent Systems and Interfaces
Lab 2: Information Retrieval
Template for methodological application
Presented By: Grant Glass
Introduction to Search Engines
Applying Agile Lean to Global Software Development
Natural Language Processing (NLP)
Presentation transcript:

CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members: Xiaonan Ji & Tong Zhao May 1st 2017

Motivation Dataset: Visualization Publication Data Collection - 2,752 IEEE Visualization (IEEE VIS) publications from 1990-2015. Problem & Significance Potential audiences: academics and students working in the Visualization domain. Provide a rapid overview of the IEEE Vis research community and facilitate the cognition of its development over the past 25 years. Facilitate exploration, analysis, and knowledge discovery regarding: Specific research topics and publications Relationships among multiple research topics and publications Leverage impactful publications, and advise the audiences in developing research scopes to address existing gaps or challenges.

What did we do Two visualizations addressing the VisPubData from different perspectives. Map projection of publication content similarity Interactive exploration & analysis Citation/Reference network Interactive exploration & analysis

Visualization 1 Map projection of publication content similarity Interactive exploration & analysis

Objective & Prototype Innovation Gap? Filled Gap Potential future work Mature area Unexplored Area New area Important Publication & its highlighted citations/references Clustering ->Topic? Year 1 Year 2 Year 3 Generate interactive map projections (2D scatter plot) by placing 2,752 publications based on their text features derived from title, abstract, and keywords. Questions to answer How different research topics are developed over the 25 years? How a research topic is explored/fulfilled by relevant publications? What are the most impactful milestone publications? What are the citations and references? What patterns can be identified? What kinds of new potentials can be identified?

Workflow of Implementation Dataset (.csv) (Effective) Text Feature Development with NLP Title Dimensionality Reduction Tokenization, POS Tagging & Chunking Extraction of Noun Phrases (NPs) Stemming (Porter Stemmer) 1-gram Count Abstract Python 1 & 2-gram Tf-idf Truncated SVD Keywords t-SNE Examine publication affinity and clustering with their positions in the 2D map. Adjust resolution with zoom-in and zoom-out. Identify important publications with their sizes. Search publications of interests via DOI or text matching. Click and select a publication for its references/citations. Mouseover a publication to see instant information i.e. title and keywords. Examine the evolution with year selection/filtering. 2D Map Projection D3 User Interaction Sizing Coloring #Citations Type Year

Highlights To facilitate the cognition, exploration, and discovery: NLP for effective text feature development and dimensionality reduction Map projection accommodating 2,752 publication dots. Publication distribution and clustering (research topic) in the 2D space are based on their similarity/relationship in the text feature space. Three modes can be selected: title, abstract, and keywords. Adjustable resolution with zoom-in and zoom-out features to examine clusters and publications on a customizable level. Highlight of important publications (milestones) that have many citations. Instant information display of a mouseover-ed publication. Locate publications of interests by searching with a DOI or user keyword(s). Suggest related publications (references and citations) of a user selected publication. Selection/filter of publication years to examine the evolution through the past 25 years.

Analysis & Findings We were able to identify many dominant research topics, including volume rendering, flow visualization, vector/tensor visualization, multi-dimensional visualization, graph visualization, user interaction and interface, visual data mining, etc. While InfoVis, SciVis, and VAST have different focuses, there exist overlaps among them, for instance, user interaction and interface, multi-dimensional visualization… Some research topics have met different patterns of evolution through the past years: “Volume rendering“ - stable growth during the past years. "Visualization of text document" - "slack" development: initial attention in 1995, but many following studies were not published until 2000s or after 2010. “Graph visualization" - began to draw larger attentions around 2000, the issue of "clutter graph" in 2004, and many related solutions like "parallel coordinate“ and "focus and context” were applied to this scope after then. Many milestone publications that were published in early years (1990s) continue to impact many recent studies, as reflected by the continuous increment of citations. Some research trends and potential opportunities can be identified, as an example, the cognitive exploration of an dataset is drawing increasing attends in the recent years.

Visualization 2 Citation/Reference network Interactive exploration & analysis

Thank you! 