Presentation is loading. Please wait.

Presentation is loading. Please wait.

Hierarchical Segmentation: Finding Changes in a Text Signal Malcolm Slaney and Dulce Ponceleon IBM Almaden Research Center.

Similar presentations


Presentation on theme: "Hierarchical Segmentation: Finding Changes in a Text Signal Malcolm Slaney and Dulce Ponceleon IBM Almaden Research Center."— Presentation transcript:

1 Hierarchical Segmentation: Finding Changes in a Text Signal Malcolm Slaney and Dulce Ponceleon IBM Almaden Research Center

2 Problem Statement Problem How do we browse video? Goal Create a table-of-contents Solution Look for topic changes in text

3 TOC Example Chapter 1 Chapter 2

4 Overview of This Talk Goal and approach Latent semantic indexing (LSI) Scale space Combination Results LSI Scale Space Filter Segment

5 Approach Sentences -> Semantic Space Filter at multiple scales Look for large jumps Three subjects (loops) shown Loop 1: Polychromaticity Artifacts Loop 2: Emission Tomography Loop 3: Ultrasound Tomography

6 Courtesy of Jianbo Shi (CMU) Building on Previous Work LSI and clustering Text tiling Change point analysis Segmentation Scale space

7 Latent Semantic Indexing Collect histogram of word frequencies Use SVD to capture frequent combinations Orthogonal decomposition Represent in low-dimensional space Words Docs 10D

8 LSI Within a Document Split into chunks Fixed size Sentences Compute histograms Perform SVD Look at results Sources “ Principles of Computerized Tomographic Imaging ” PBS News Hour

9 LSI – 2D Projection Chapter 4 of Principles of Computerized Tomographic Imaging

10 LSI – Self-similarity Measure similarity Cosine of angle between “ documents ” Plot all pairs of chunks/sentences Look for block diagonal Chapter 4 of Principles of Computerized Tomographic Imaging

11 Scale-space Filtering What size are the features? Look at different scales! Continuous scale Used for Object Recognition Feature Detection

12 Scale-space Movie Green line marks best high-level segmentation 10d semantic space Scale varies from 1 to 400 sentences

13 Scale-space Segmentation Low pass filter signal Form image of scale vs. time Look for changes Track peaks of vector derivative across scale

14 Scale-space Example Derivative as function of scale and sentence

15 LSI and Scale Space Putting it all together Split document/transcript Perform LSI analysis Look at change in angle Perform scale-space segmentation Show tree

16 Scale-Space Image Peaks in scale- space derivative Peaks traced to their origin

17 Results – CT Comparison Scale-Space Book Headings

18 Results – News Comparison Scale-Space Ground Truth

19 Results – Autocorrelation Block sentences Measure correlation Positive Peak Anti- correlation

20 Discussion Issues Evaluation (and ground truth) Lafferty ’ s measure Temporal properties Histogram/SVD chunking size Autocorrelation

21 Computational Effort Histogram: O(N) SVD: O(N 3 ) Scale space: O(N 2 ) N < 1000 Number of sentences in a video or document is not large

22 LSI Document Lookup Histogram documents Entropy term weighting Compute SVD Use first 10-100 vectors to model space Encode query as histogram Look for documents in similar direction

23 LSI Example Collection of book titles Differential equations vs. algorithms and applications


Download ppt "Hierarchical Segmentation: Finding Changes in a Text Signal Malcolm Slaney and Dulce Ponceleon IBM Almaden Research Center."

Similar presentations


Ads by Google