R EPRESENTING L INGUISTIC D ATA Maha Shouman
T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances) Automatic summary generation Exclude original linearity!
Index Concordance
T HEME R IVER Data target: Large text collections Temporal patterns Thematic changes Traditional techniques: Histogram Other visualizations focus on documents
3D T HEME R IVER ? 3DThemeriver.pdf
T HE W ORD T REE Visualization + information retrieval Graphical Key Word In Context (KWIC) Format for concordance KWIC + suffix tree
T HE W ORD T REE
Click Shift- Click