Part Two: Using Xaira to explore corpora Richard Xiao

Slides:



Advertisements
Similar presentations
Information Systems Technology Ross Malaga B Copyright © 2005 Prentice Hall, Inc. B-1 WORKING WITH DATABASES.
Advertisements

1. XP 2 * The Web is a collection of files that reside on computers, called Web servers. * Web servers are connected to each other through the Internet.
Getting Started with Microsoft Office 2007
Appendix The SAMIEE / MD Dashboard – Step-by-Step Configuration Membership Development WebcastSlide 1 Prepared by: Helen Shiminsky MGA Information Process.
Introduction to Metview
0 - 0.
Introduction to Xaira Part One: All about Xaira Andrew Hardie.
Corpus Linguistics Richard Xiao
Office 2003 Introductory Concepts and Techniques M i c r o s o f t Windows XP Project An Introduction to Microsoft Windows XP and Office 2003.
Excel Vocabulary.
1 After completing this lesson, you will be able to: Compose, address, and send messages. Format the body of a message. Attach a file to a message. Check.
Teacher/Mentor Institute Using the Cortex Chuck Powell.
Managing Your Site – Lesson 61 Managing Your Site Lesson 6.
South Dakota Library Network ALEPH v20 Tables © South Dakota Library Network, 2012 ©Ex Libris (USA), 2009 Modified for SDLN Version Last Update:
1 Web-Enabled Decision Support Systems Access Introduction: Touring Access Prof. Name Position (123) University Name.
Creating Tables in a Web Site
Lesson 7: Using Tables Courseware #: 3240
Microsoft Access.
Integration Integrating Word, Excel, Access, and PowerPoint
1 Web Site Design Overview of the Internet Cookie Setton.
Introducing the 2007 Microsoft Office system Enda Flynn IW Product Marketing Manager March 2008
Microsoft Office Illustrated Fundamentals Unit C: Getting Started with Unit C: Getting Started with Microsoft Office 2010 Microsoft Office 2010.
PowerPoint Basics Tutorial 4: Interactivity & Media PowerPoint can communicate with the outside world by linking to different applications, managing different.
SW388R7 Data Analysis & Computers II Slide 1 Copying SPSS Output Into Microsoft Word Copying syntax commands from SPSS output to Word Copying a statistics.
General Navigation Training Presentation for Supply Chain Platform: BAE Systems July 2012.
National Center for Health Statistics Data Online Query System Overview
Ottawa PC Users’ Group Beginner’s Workshop Word Processor and Spreadsheet Jocelyn Doire.
Step-by-Step: Insert a Section Break
Using Word 2010 Part 1 Chapter 2 1. What is a Word Processor? 2.
Spotlight on Word Processing
Working with Tables for Page Design – Lesson 41 Working with Tables for Page Design Lesson 4.
Learning the Basics – Lesson 1
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Lesson 15 Working with Tables
Lesson 30: Maintaining a Database. Learning Objectives After studying this lesson, you will be able to:  Change the layout of a table by adjusting column.
TM Graphical Monitoring Electronic Service Tools.
Introduction to PowerPoint
1 Introduction to OBIEE: Learning to Access, Navigate, and Find Data in the SWIFT Data Warehouse Lesson 8: Printing and Exporting an OBIEE Analysis This.
Corpus Linguistics: session 2 Corpus Linguistics (2): The Tools of the Trade 669o4zt
Office XP Introductory Concepts and Techniques Windows XP Edition M i c r o s o f t Windows XP Project An Introduction to Windows XP Professional and Office.
Word processing June 2013.
PubMed/How to Search, Display, Download & (module 4.1)
Start the slide show by clicking on the "Slide Show" option in the above menu and choose "View Show”. or – hit the F5 Key.
 Using Microsoft Expression Web you can: › Create Web pages and Web sites › Set what you site will look like as you design it › Add text, images, multimedia.
MyiLibrary® ‘Search & View’ Website Training June 8, 2010.
Copyright © Texas Education Agency, All rights reserved. 1 Web Technologies Website Development with Dreamweaver.
Word Processing ADE100- Computer Literacy Lecture 12.
Tabs to main publication types Links in the orange navigation bar for: News Librarians Users Guide Price List alerts 1. Top Navigation Bar General.
 What is the BNC?  What is Xaira?  How to use the BNC for: › Language teaching and learning › Research.
Website Development with Dreamweaver
T U T O R I A L  2009 Pearson Education, Inc. All rights reserved. 1 2 Welcome Application Introducing the Visual Basic 2008 Express Edition IDE.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
Introducing XARA… An XML aware tool for corpus searching Lou Burnard Tony Dodd Research Technology Services, OUCS.
RefWorks Your Personal Online Database And Bibliography Creator.
Food and Agriculture Organization of the UN Library and Documentation Systems Division July 2005 Ontologies creation, extraction and maintenance 6 th AOS.
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
Introducing Dreamweaver. Dreamweaver The web development application used to create web pages Part of the Adobe creative suite.
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
What is a Corpus? What is not a corpus?  the Web  collection of citations  a text Definition of a corpus “A corpus is a collection of pieces of language.
MS Office Local Government Scheme Based Training Microsoft Office Suit  The Microsoft Office Suite is a package of office productivity software released.
Illuminate Form Letters June 2016 Assessment & Evaluation, Illuminate Bootcamp Contact: Kate Arch,
Microsoft Excel Illustrated Introductory Workbooks and Preparing them for the Web Managing.
C Copyright © 2009, Oracle. All rights reserved. Using SQL Developer.
XAIRA is an XML Aware Indexing and Retrieval Architecture ● Developed from the British National Corpus Sara program, it provides: – platform-independent.
The Simple Corpus Tool Martin Weisser Research Center for Linguistics & Applied Linguistics Guangdong University of Foreign Studies
Introduction to OBIEE:
AntConc is a freeware, multiplatform of application suitable for all types of users
United Kingdom SDGs Reporting Platform
Chapter 2 – Introduction to the Visual Studio .NET IDE
United Kingdom SDGs Reporting Platform
Presentation transcript:

Part Two: Using Xaira to explore corpora Richard Xiao

2 Outline of the talk Concordance Wordlist Keywords (No) Output formats Manipulating results Collocation/colligation Distribution analysis Live demonstration Tips for keeping away from bugs Multilingual dimension Xaira FAQs

3 Concordance Word query ( ) Search for a word Phrase/Quick query ( or ) Searching for a word or phrase Addkey query ( ) POS or lemma search Pattern query ( ) Regular Expression search XML query ( ) Search for XML markup CQL/XQL query ( ) Searching using XML-based Corpus Query Language Query builder ( ) A powerful combination of all query types

4 Wordlist In Client >> Word query (up to 100,000 lexicon entries): sorting alphabetically, by frequency, or the number of forms In Xaira Indexer Tools >> Tools >> Indexer >> Options >> Create frequency table

5 Keyword? Sadly, no – Use WordSmith instead WordSmith version 4.0 fully supports Unicode

6 Output formats Page mode vs. Line mode (KWIC) Plain text vs. XML text Scope of context Alignment (left, right, top, bottom) Reference (on the status bar)

7 Manipulating results Edit query (to save time for related queries) Bibliographical data Sort KWIC concordances Select/block select/copy concordances Right click on a concordance Thin/edit concordances Random sampling Save queries and export them in XML Print results

8 Collocation/colligation ( ) Statistical measure (MI or Z) Window span Minimum frequency Minimum MI/Z score Top N collocates Computing collocation statistics for individual words Applying selected lemmata Colligation (Addkey tags)

9 Distribution analysis ( ) Defining partition (subcorpora) (Texts >> Column control to select XML tags) Texts >> Define partition (3 ways) Based on selected class, values in a column, or solutions to a query Texts >> Open partition Tabulation (text class, words, hits, %, etc) Normalised frequencies for subcorpora Sorting tabulated data Graphic presentation (pie/bar chart) Save distribution data in various forms Copy pie/bar chart

10 Additional features of Xaira Annotating concordances (making notes) Copying query text or notes User-defined stylesheet Colour book (e.g. different colours for different POS categories) Remote access over a network Platform-independent

11 Xaira live demonstration Here we go… …slides to follow

12 Tips for keeping away from bugs In the Line mode, a maximum of 1,524 concordances are displayed See the rest in the Page mode In Query builder, joining query nodes in the horizontal direction (OR) and then in the vertical direction (AND) may produce unreliable counts when the Link type is specified as One-way or Two-way Only define Link type as Next or Not next If thousands of hits are downloaded and dozens of them are deleted by reverse selection in thinning, the system may crash If concordances have been sorted/edited, a saved query may not be opened again Save the edited concordances as an XML list using Query – Listing in the menu or pressing on the toolbar

13 Truly multilingual - Chinese

14 Truly multilingual - Bengali

15 Truly multilingual - Hindi

16 Truly multilingual - Punjabi

17 Truly multilingual - Urdu

18 Xaira FAQs Is Xaira free and where can I get it? Yes, it is absolutely free. You can get a copy (binary for Windows, and source codes for compilation on the Unix/Linux/Mac system) at the SourceForce website. The latest release is Where can I get more documentation? In addition to the built-in help file, more documentation is available at the Xaira site: Where can I get technical help? You can sign up for the Xaira Preview List to get help: For a critical review, see pdf pdf