Exemplar-based Visualization of Large Document Corpus Yanhua Chen, Lijun Wang, Ming Dong, and Jing Hua {chenyanh, ljwang, mdong, Department of Computer Science Wayne State University, Detroit, MI
Overview Text Mining and Visualization Current Visualization Systems Exemplar-based Visualization (EV) Experiments and Results EV Demo
Text Mining: Clustering Definition Given: A source of textual documents Similarity measure e.g., how many words are common in these documents Clustering System Similarity measure Documents source Doc Find: Several clusters of documents that are relevant to each other
Current Visualization Systems Text Visualization: select the representation of selected features of complex multi-dimensional data to display in a logical layout (2-D or 3-D) and understand the relationship between documents IN-SPIREInfosky
Exemplar-based Visualization (EV) Data Low-rank Approximation Exemplar-based Clustering Visualization by Parameter Embedding
Experiments and Results Visualization of 20,000 Medical Articles
Exemplar-based Visualization Demo
Reminder Title: Exemplar-based Visualization of Large Scale Document Corpus Session: Text Visualization Time: 10:30am-12:10pm Friday, 16 October