Exemplar-based Visualization of Large Document Corpus Yanhua Chen, Lijun Wang, Ming Dong, and Jing Hua {chenyanh, ljwang, mdong, Department.

Slides:



Advertisements
Similar presentations
What are the S, T, and E in STEM? How are they related?
Advertisements

Machine Learning & Data Mining CS/CNS/EE 155 Lecture 14: Embeddings 1Lecture 14: Embeddings.
New Geometric Methods of Mixture Models for Interactive Visualization PIs: Jia Li, Xiaolong (Luke) Zhang, Bruce Lindsay Department of Statistics College.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
IMGD The Game Development Process: Project 5 – Level Design Due: Friday, October 9 th Status report: Monday, October 5 th.
Level Design Project 5 Due date: Friday, October 6th.
Image Annotation using XML and MPEG-7 Manjeet Rege Department of Computer Science Wayne State University
Tuesday Session 2 – Intro to ArcMap Starting Arc Map – Empty Map – Map Template – Project Data View – Display – Source – Selection Layout View – Draft.
Incorporating User Provided Constraints into Document Clustering Yanhua Chen, Manjeet Rege, Ming Dong, Jing Hua, Farshad Fotouhi Department of Computer.
IMGD The Game Development Process: Project 5 – Level Design Due: Friday, October 10 th (in class) Status report: Monday, October 6 th.
Professional Website Portfolios Principles of Visual Design LCC 2720 Brian Schrank.
Rye City School District  Using Google Docs allows you to create documents, presentations, spreadsheets, forms and drawings to share, collaborate.
8/13/2015 | 1 Library Introduction Arts, Culture and Media.
Visual Analytics for Interactive Exploration of Large-Scale Documents via Nonnegative Matrix Factorization Jaegul Choo*, Barry L. Drake †, and Haesun Park*
Graphing Data for Science. Graphing Data A graph is a visual representation of data that allows us to quickly see trends and relationships between (or.
Webpage Understanding: an Integrated Approach
Utilising software to enhance your research Eamonn Hynes 5 th November, 2012.
Datamining MEDLINE for Topics and Trends in Dental and Craniofacial Research William C. Bartling, D.D.S. NIDCR/NLM Fellow in Dental Informatics Center.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Transfer Learning with Applications to Text Classification Jing Peng Computer Science Department.
Document Collections cs5984: Information Visualization Chris North.
Microsoft Dynamics NAV 2009 and Architecture Overview Name Title Microsoft Corporation.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Lab 4 ZigBee & with PICDEM Z Boards 55:088 Spring 2006.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
CS3041 – Final week Today: Searching and Visualization Friday: Software tools –Study guide distributed (in class only) Monday: Social Imps –Study guide.
1 KMeD: A Knowledge-Based Multimedia Medical Database System Wesley W. Chu Computer Science Department University of California, Los Angeles
Machine Learning and Data Mining Clustering (adapted from) Prof. Alexander Ihler TexPoint fonts used in EMF. Read the TexPoint manual before you delete.
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.
Friday, September 4 th, 2009 The Systems Group at ETH Zurich XML and Databases Exercise Session 5 courtesy of Ghislain Fourny/ETH © Department of Computer.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Cluster Analysis Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Algorithms and Tools Relevant for the Mapping of Science and HPS Lead: Kevin Boyack Participants: Tony Beavers Yunwei Chen Jean-Gabriel Ganascia Jaimie.
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Canadian Bioinformatics Workshops
KNN & Naïve Bayes Hongning Wang
OU Neurology THE TITLE OF MY TALK John Doe, M.D. Professor Department of Neurology The University of Oklahoma Health Sciences Center.
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
On Dataless Hierarchical Text Classification
Concept Map: Clustering Visualizations of Categorical Domains
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
Example 13 Wendy Balmer Indiana U
Prepared by: Mahmoud Rafeek Al-Farra
Physics-based simulation for visual computing applications
DiXiT Camp 1 – Presentation ESR2: Document-centric Editions (KCL)
Project Title This is a sample slide layout
cs5984: Information Visualization Chris North
CHAPTER 7: Information Visualization
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
Title Layout Subtitle.
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
Precision and Recall Reminder:
Project Title This is a sample poster layout -
WAYNE STATE UNIVERSITY – DETROIT, MICHIGAN
Strength of relation High Low Number of data Relationship Data
THE TITLE OF MY TALK John Doe, M.D. Professor Department of Neurology
Suggested Layout ** Designed to be printed on A3 paper in an assortment of colours. This is directly linked to the Computer Science Specification.
Presentation transcript:

Exemplar-based Visualization of Large Document Corpus Yanhua Chen, Lijun Wang, Ming Dong, and Jing Hua {chenyanh, ljwang, mdong, Department of Computer Science Wayne State University, Detroit, MI

Overview Text Mining and Visualization Current Visualization Systems Exemplar-based Visualization (EV) Experiments and Results EV Demo

Text Mining: Clustering Definition Given: A source of textual documents Similarity measure e.g., how many words are common in these documents Clustering System Similarity measure Documents source Doc Find: Several clusters of documents that are relevant to each other

Current Visualization Systems Text Visualization: select the representation of selected features of complex multi-dimensional data to display in a logical layout (2-D or 3-D) and understand the relationship between documents IN-SPIREInfosky

Exemplar-based Visualization (EV) Data Low-rank Approximation Exemplar-based Clustering Visualization by Parameter Embedding

Experiments and Results Visualization of 20,000 Medical Articles

Exemplar-based Visualization Demo

Reminder Title: Exemplar-based Visualization of Large Scale Document Corpus Session: Text Visualization Time: 10:30am-12:10pm Friday, 16 October