Presentation is loading. Please wait.

Presentation is loading. Please wait.

GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Canan BATUR 504101545.

Similar presentations


Presentation on theme: "GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Canan BATUR 504101545."— Presentation transcript:

1 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Canan BATUR 504101545

2 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION What is Summarization ? Why important ?

3 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION What is the main differences in Multi-document and single- document summarization ? ● Degree of redundacy ● Temporal dimensions ● Compression Ratio ● Co-reference Problem

4 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Multi Document Summarization Methods ● Extractive Summarization ● Abstractive Summarization ● Summary Should be Readable,interrelated with other sentences and function of cohesion is studied ● How we Analyze or extract important information ?

5 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Which Algorithm we should Use ? ● Graph Based Ranking Algoritm ● This algorithm is a way deciding on the important vertex within a graph. ● We can use this algorithm in Natural language processing.

6 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION

7

8 ● HITS (Hyperlinked Induced Topic Search) ● Is an İteartive Algorithm that was designed for ranking Web Pages. ●

9 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION ● Positional Power Function ● Determine Score Of vertex ● Positional Weakness Function ●

10 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION ● Page Rank ● Designed For Web Link Analysis

11 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION

12

13

14

15

16

17

18 Rouge is set of metrics evaluate the summarization success.

19

20 GRAPH BASED MULTI-DOCUMENT SUMMARIZATION ● Conclusion ● ● A text unit (vertex) recommends other related text units,and the strength of the recommendation is recursively computed based on importance unit making recommendation.

21 References S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1–7). DUC. 2002. Document understanding conference 2002. http://wwwnlpir. nist.gov/projects/duc/. P.J. Herings, G. van der Laan, and D. Talman. 2001. Measuring the power of nodes in digraphs. Technical report, Tinbergen Institute. J.M. Kleinberg. 1999. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604–632. C.Y. Lin and E.H. Hovy. 2003a. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of Human Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada, May. C.Y. Lin and E.H. Hovy. 2003b. The potential and limitations of sentence extraction for summarization. In Proceedings of the HLT/NAACL Workshop on Automatic Summarization, Edmonton, Canada, May. R. Mihalcea and P. Tarau. 2004. TextRank – bringing order into texts. R. Mihalcea, P. Tarau, and E. Figa. 2004. PageRank on semantic networks, with application to word sense disambiguation. In Proceedings of the 20st International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland, August. G. Salton, A. Singhal, M. Mitra, and C. Buckley. 1997. Automatic text structuring and summarization. Information Processing and Management, 2(32). S. Teufel and M. Moens. 1997. Sentence extraction as a classification task. In ACL/EACL workshop on ”Intelligent and scalable Text summarization”, pages 58–65, Madrid, Spain.

22 THANK YOU...


Download ppt "GRAPH BASED MULTI-DOCUMENT SUMMARIZATION Canan BATUR 504101545."

Similar presentations


Ads by Google