Plan for today Introduction Graph Matching Method Theme Recognition Comparison Conclusion.

Slides:



Advertisements
Similar presentations
Machine Learning Approaches to the Analysis of Large Corpora : A Survey Xunlei Rose Hu and Eric Atwell University of Leeds.
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
A UTOMATICALLY A CQUIRING A S EMANTIC N ETWORK OF R ELATED C ONCEPTS Date: 2011/11/14 Source: Sean Szumlanski et. al (CIKM’10) Advisor: Jia-ling, Koh Speaker:
Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.
A New Suffix Tree Similarity Measure for Document Clustering Hung Chim, Xiaotie Deng City University of Hong Kong WWW 2007 Session: Similarity Search April.
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai et al, KDD 2010 Presenter: Yunchao Gong Dept. Computer Science, UNC Chapel Hill.
Automatic Classification of Accounting Literature Nineteenth Annual Strategic and Emerging Technologies Workshop Vasundhara Chakraborty, Victoria Chiu,
Data Mining and Machine Learning Lab Document Clustering via Matrix Representation Xufei Wang, Jiliang Tang and Huan Liu Arizona State University.
Rooks, Parts of the paragraph Objective: Enable students to write a complete outline of paragraph and a complete paragraph with the correct grammar.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
Rooks, Review  Objective : Students are able to explain cause and effect analysis paragraphs and comparison and contrast paragraph. Students.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Lecture 17 CSE 331 Oct 8, HW 4 due today Q1 and Q2 in one pile Q3 in another pile I will not take any HW after 1:15pm.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Essay Outline Poetry Unit.
Click mouse or spacebar to continue …
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Define html document byusing Example : Title of the document The content of the document......
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
CS654: Digital Image Analysis Lecture 3: Data Structure for Image Analysis.
1 Text Summarization: News and Beyond Kathleen McKeown Department of Computer Science Columbia University.
Web Document Clustering: A Feasibility Demonstration Oren Zamir and Oren Etzioni, SIGIR, 1998.
Ioana Sora, Gabriel Glodean, Mihai Gligor Department of Computers Politehnica University of Timisoara Software Architecture Reconstruction: An Approach.
Co-clustering Documents and Words Using Bipartite Spectral Graph Partitioning Jinghe Zhang 10/28/2014 CS 6501 Information Retrieval.
Web Mining: Phrase-based Document Indexing and Document Clustering Khaled Hammouda, Ph.D. Candidate Mohamed Kamel, Supervisor, PI PAMI Research Group University.
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments:
University of Malta CSA3080: Lecture 4 © Chris Staff 1 of 14 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Edge and Boundary interpretation Consistent line drawing labeling via backtracking Presented by Guy Shtub.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Event-Centric Summary Generation Lucy Vanderwende, Michele Banko and Arul Menezes One Microsoft Way, WA, USA DUC 2004.
The Grammar Business © 2001 Glenrothes College The Grammar Business Part Two 4. Logic rules: paragraphs, links and headings.
Clustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm.
S calable K nowledge C omposition Ontology Interoperation January 19, 1999 Jan Jannink, Prasenjit Mitra, Srinivasan Pichai, Danladi Verheijen, Gio Wiederhold.
Imaged Document Text Retrieval without OCR IEEE Trans. on PAMI vol.24, no.6 June, 2002 報告人:周遵儒.
Topics to be covered: Paragraph construction Types of paragraphs General to specific Problem/solution Paragraph issues * using definitions * internal.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Extending the Growing Hierarchal SOM for Clustering Documents.
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
Writing a paragraph.
PhD Dissertation Defense Scaling Up Machine Learning Algorithms to Handle Big Data BY KHALIFEH ALJADDA ADVISOR: PROFESSOR JOHN A. MILLER DEC-2014 Computer.
Link Distribution on Wikipedia [0407]KwangHee Park.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Improving the Classification of Unknown Documents by Concept Graph Morteza Mohagheghi Reza Soltanpour
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi
Essay writing. Comparative essays: Comparative essays establish a level of similarity (there has to be one) and then examines how two texts (articles,
Compare and Contrast Essay Example
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
A Survey on Automatic Text Summarization Dipanjan Das André F. T. Martins Tolga Çekiç
BASIC COREEXPANDED CORE Points 1) Has acceptable thesis (Addresses comparison of the issues or themes specified) 1Expands beyond basic core of 1-7. The.
Link Distribution in Wikipedia [0324] KwangHee Park.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Visualization in Process Mining
Document Clustering Based on Non-negative Matrix Factorization
Semantic Processing with Context Analysis
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Mining the Data Charu C. Aggarwal, ChengXiang Zhai
Summarizing Entities: A Survey Report
Systems of Inequalities
ECE 3000: Engineering Seminar
LAB REPORT (50 pts) I- Title (1)
The Writing Process Four Stages.
Vocabulary activity: Copy the following terms into your notes
Muscle Size vs. Muscle Strength
Research Presentation Directions:
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

Plan for today Introduction Graph Matching Method Theme Recognition Comparison Conclusion

Introduction Fact: growth of information sources Problem: impossible to read everything Assumption: documents are structured Solution: automated summary!

Graph Matching Mani & Bloedorn, 1997, MITRE company - Word is a node -Adjacency links -For each phrase: -Find different -Find common -FSD algorithm: -C- and D-score -Phrase links Articles Graphs Words Summary? Phrases -Name links -Same links -Spreading act. -Select best ones

Theme Recognition McKeown et al., 1999, Columbia Univ. -break into paragraphs -find similarities -preprocess phrases -match phrase trees -make grammar trees -break into phrases -construct sentences -cluster similar para’s Articles Themes Phrases Summary Sentences

Comparison 5 issues –Content Representation –Information Fusion –Semantics Preservation –Scalability –Domain Independence

Content Representation Graph Matching –keeps doc’s apart –scale: word Theme Recognition –all doc’s on one pile –scale: paragraph

Information Fusion Graph MatchingTheme Recognition

Semantics Preservation Graph MatchingTheme Recognition

Scalability Graph MatchingTheme Recognition

Domain Independence Graph MatchingTheme Recognition

Conclusion