R EPRESENTING L INGUISTIC D ATA Maha Shouman. T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances)

Slides:



Advertisements
Similar presentations
Recuperação de Informação B Cap. 10: User Interfaces and Visualization 10.1,10.2,10.3 November 17, 1999.
Advertisements

GSSR Research Methodology and Methods of Social Inquiry January 10, 2012 Research Using Available Data.
Copyright , SPSS Inc. 1 Practical solutions for dealing with missing data Rob Woods Senior Consultant.
News Headlines Activity 16.
XML DOCUMENTS AND DATABASES
UT-Space Manager. Define Rooms The Define Rooms task is used to manage your room data. 1.On the Process Navigator, click on the Space Inventory & Performance.
UT-CAD Manager. Define College and Fund Highlight Patterns The Define College and Fund Highlight Patterns task is used to assign a highlight pattern to.
Procedure for Developing a Multimedia Presentation 6.02 Apply procedures to develop multimedia presentations used in business.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
Modern Information Retrieval
Original Tree:
Searching with Lucene Chapter 2. For discussion Information retrieval What is Lucene? Code for indexer using Lucene Pagerank algorithm.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
Applying Software Model Checking to Automatic Text Summarization SSSEV2011 Irina Shoshmina, Nasrin Mostafazadeh, Omid Bakhshandeh, Alexey Belyaev, and.
Collection Understanding Through Streaming Collage Michelle Chang John Leggett Center for the Study of Digital Libraries Texas A&M University.
WMES3103 : INFORMATION RETRIEVAL INDEXING AND SEARCHING.
Day 1-3. Variable Selection and GIS Processing 1.Discuss V mapping goals, targeted system (what is vulnerable?), framework 2.Choose data layers (criteria:
Esri International User Conference | San Diego, CA Technical Workshops | Managing and Editing Annotation Natalie Vines Samantha Keehan July 14, 2011.
Procedures 6.02 Apply procedures to develop multimedia presentations used in business.
Term and Document Clustering Manual thesaurus generation Automatic thesaurus generation Term clustering techniques: –Cliques,connected components,stars,strings.
Lakeland Click arrow to advance show. Click on the “A” under “Listed By Name.” (“A” for Academic Search Database)
Copyright © Cengage Learning. All rights reserved. 2 Organizing Data.
Donatella Castelli CNR-ISTI
WordStat & Yoshikoder T.M. & M.S.. WordStat About WordStat Must be run as part of SimStat Designed to process text such as open ended responses, journal.
Manatees of Florida. Standard: MAFS.912.S-ID.1.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). MAFS.912.S-ID.1.3:
Chapter 11 Arrays Continued
Standard Grade Presentations & Multimedia. Presentation & Multimedia Software Allows the user to set up exciting and attractive documents which helps.
Understand business uses of presentation software and methods of distribution.
TEKS: E1.Fig19B E1.5A,B,C,D.  Understanding connections between literary elements facilitates the reader’s ability to make meaning of text.  What techniques.
I NTRODUCTION TO W ORD G 1. Office Button 2. Quick Access toolbar 3. Title bar 5. Tab 7. Commands 6. Group 8. Dialog Box Launcher 4.Ribbon 9. Ruler.
1 CS430: Information Discovery Lecture 18 Usability 3.
Clearly Visual Basic: Programming with Visual Basic 2008 Chapter 24 The String Section.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Copyright © 2008 Pearson Prentice Hall. All rights reserved Copyright © 2008 Prentice-Hall. All rights reserved. Committed to Shaping the Next.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
MOVIE RETRIEVAL SYSTEM INFORMATION VISUALIZATION & PROPOSING NEW INTERFACE IAT 814 Adrian Bisek.
Performance of Compressed Inverted Indexes. Reasons for Compression  Compression reduces the size of the index  Compression can increase the performance.
HTML Concepts and Techniques Fourth Edition Project 1 Introduction to HTML.
Concept Mapping: A Graphical System for Understanding the Relationship between Concepts. ERIC Digest.
How OLE Works Carlotta Eaton Exploring Microsoft Visual Basic 5.0 To insert your company logo on this slide From the Insert Menu Select “Picture” Locate.
Unit 2: Systems Day 1: Solving Systems with tables and graphing.
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
Procedure for Developing a Multimedia Presentation Apply procedures to develop multimedia presentations used in business.
Reading in Business Administration Center for Academic and Technology Support.
Chapter 23 The String Section (String Manipulation) Clearly Visual Basic: Programming with Visual Basic nd Edition.
Visualising the Old Bailey Proceedings as a Digital Panopticon Dataset RICHARD WARD RESEARCH ASSOCIATE, DIGITAL PANOPTICON PROJECT 14 APRIL 2014
Data -Data is the raw materials from which information is generated. -Data are raw facts or observations typically about physical phenomena or business.
Fundamentals of Nud*ist 6 Overview for Nursing Faculty May 2003 by June Kaminski, MSN.
BTS430 Systems Analysis and Design using UML Design Class Diagrams (ref=chapter 16 of Applying UML and Patterns)
Copyright 2002, Paradigm Publishing Inc. CHAPTER 25 BACKNEXTEND 25-1 LINKS TO OBJECTIVES Compiling a Table of Contents Compiling a Table of Contents Assigning.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
Understanding Core Database Concepts Lesson 1. Objectives.
AET 545 Week 2 Individual Needs Analysis Select your workplace setting or a setting with which you are familiar for this project. Collect data about the.
Unit 2: Lesson 11 & 12 Making Data Visualizations
Solving Linear Inequalities in One Unknown
Unit 2: Lesson 11 & 12 Making Data Visualizations
ETI 4448 Applied Project Management
YOUR text YOUR text YOUR text YOUR text
Click Summary Value Button to Show Source of Integral or Time
CLICK TO START.
CLICK TO START.
Understanding Core Database Concepts
Histogram Summary Process Steps
Call Now : Click : -
Call Now : Click : -
Call Now : Click : -
Histogram Summary Process Steps
Presentation transcript:

R EPRESENTING L INGUISTIC D ATA Maha Shouman

T EXT A RC Data target: Raw text Medium-sized Traditional techniques: Structured word lists (indices, concordances) Automatic summary generation Exclude original linearity!

Index Concordance

T HEME R IVER Data target: Large text collections Temporal patterns Thematic changes Traditional techniques: Histogram Other visualizations focus on documents

3D T HEME R IVER ? 3DThemeriver.pdf

T HE W ORD T REE Visualization + information retrieval Graphical Key Word In Context (KWIC) Format for concordance KWIC + suffix tree

T HE W ORD T REE

Click Shift- Click