Presentation is loading. Please wait.

Presentation is loading. Please wait.

R EPRESENTING L INGUISTIC D ATA Maha Shouman. T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances)

Similar presentations


Presentation on theme: "R EPRESENTING L INGUISTIC D ATA Maha Shouman. T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances)"— Presentation transcript:

1 R EPRESENTING L INGUISTIC D ATA Maha Shouman

2 T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances) Automatic summary generation Exclude original linearity!

3 Index http://www.i75online.com/FLAIndexPage1.html Concordance http://www.opensourceshakespeare.com

4

5

6

7

8

9

10 T HEME R IVER Data target: Large text collections Temporal patterns Thematic changes Traditional techniques: Histogram Other visualizations focus on documents

11

12

13

14 3D T HEME R IVER ? www.cs.sunysb.edu/~vislab/papers/ 3DThemeriver.pdf

15 T HE W ORD T REE Visualization + information retrieval Graphical Key Word In Context (KWIC) Format for concordance KWIC + suffix tree

16 T HE W ORD T REE

17

18

19

20

21 Click Shift- Click


Download ppt "R EPRESENTING L INGUISTIC D ATA Maha Shouman. T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances)"

Similar presentations


Ads by Google