Helen Dry & Anthony Aristar LINGUIST List: LSA Symposium: The Open Language Archives Community 4 January 2002http://linguistlist.org.

Slides:



Advertisements
Similar presentations
CIDOC 2000 Using GEM Metadata to Access Education Resources Nancy Virgil Morgan Coordinator
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
N OLAC 2002 IRCS UPenn 1 Andrew W. Cole Linguistic Data Consortium University of Pennsylvania Integrating ELRA/LDC.
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Outreach Jeff Good UC Berkeley. OLAC's Needs Maximal involvement from the whole community –The more data providers involved the more useful the services.
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
The Open Language Archives Community: Building a worldwide library of digital language resources Gary Simons, SIL International LSA Tutorial on Archiving.
OLAC Process and OLAC Protocol: A Guided Tour Gary F. Simons SIL International ___________________________ OLAC Workshop 10 Dec 2002, Philadelphia.
Jan 7, 2005 Linguistic Society of America 2005 Annual Meeting, Oakland, CA The E-MELD Project: Helen Aristar Dry The LINGUIST List Eastern Michigan University.
An Overview of OLAC: The Open Language Archives Community Gary Simons and Steven Bird Workshop on The Digitization of Language Data: The Need for Standards.
The OLAC Metadata Set Gary Simons Workshop on The Digitization of Language Data: The Need for Standards June 2001.
LSA Archiving Tutorial January 2005 Archives, linguists, and language speakers.
Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
Endangered Languages and Web-Based Archiving Megan J. Crowhurst The University of Texas at Austin & CELP Contributors: Chris Beier, Heidi Johnson, Lev.
Getting Involved in OLAC Steven Bird University of Pennsylvania LSA Symposium: The Open Language Archives Community 4 January 2002.
Helen Dry & Anthony Aristar LINGUIST List: LREC Symposium: The Open Language Archives Community 29 May 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
Gary Holton ANLC LSA Symposium: The Open Language Archives Community 4 January 2002 Creating an OLAC data provider at the Alaska Native Language Center.
Language Archives and Linguistic Anchoring of Digital Archives Chu-Ren Huang Institute of Linguistics, Academia Sinica LSA Symposium: The Open Language.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
E-Government Use of Metadata in Denmark. Presentation of a project by Palle Aagaard Web Standards & Guidelines. The Danish State Information Service.
OLAC: Open Language Archives Community OLAC : The Open Language Archives Community Gary F. Simons SIL International and Graduate Institute of Applied Linguistics.
Forest Markup / Metadata Language FML
Metadata 8/7/2012 Katie Moss Digital Metadata Technician, Digital Library Services
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
An Leabharlann UCD Órna Roche UCD James Joyce Library Metadata Documenting your data
National Technical Information Service. NTIS National Technical Information Service National Technical Information Service (NTIS), United States Department.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
ACCESS TO QUALITY RESOURCES ON RUSSIA Tanja Pursiainen, University of Helsinki, Aleksanteri institute. EVA 2004 Moscow, 29 November 2004.
The Rosetta Project Digital Language Archive Laura Buszard-Welcher The Long Now Foundation / University of California, Berkeley.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
July 11, 2003E-MELD 2003 E-MELD “School” of Best Practice Helen Aristar-Dry & Gayathri Sriram The LINGUIST List Eastern Michigan University.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
Resource Discovery (metadata and searching) Working Group Report.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
June 20, 2006E-MELD 2006, MSU1 Toward Implementation of Best Practice: Anthony Aristar, Wayne State University Other E-MELD Outcomes.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
The Archive of the Indigenous Languages of Latin America Goals and Visions.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Nov 21, 2005University of Texas at Austin The E-MELD Project Helen Aristar Dry & Anthony Aristar The LINGUIST List Eastern Michigan U & Wayne State U.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson The University of Texas at Austin Latin American Digital Library Initiative,
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Aug 2-5, 2002 EMELD Workshop Overview & Update Helen Aristar Dry The LINGUIST List & Eastern Michigan University EMELD Workshop on The Digitization.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA The School of Best Practice How Standards can Matter Anthony Aristar, Wayne State University.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Standards for representing meeting metadata and annotations in meeting databases Standards for representing meeting metadata and annotations in meeting.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Markup of Educational Content
Heidi Johnson The University of Texas at Austin
Cataloging the Internet
Overview Ideas Other Stuff
Presentation transcript:

Helen Dry & Anthony Aristar LINGUIST List: LSA Symposium: The Open Language Archives Community 4 January 2002http://linguistlist.org OLAC, EMELD, & Us

OLAC Launch, LSA-02 Who is Us ? The community of academic linguists who produce data & documentation on languages who use language data & documentation in their research Includes most subscribers to The LINGUIST List

OLAC Launch, LSA-02 The LINGUIST List 15,200 subscribers 105 different countries 4 European mirror sites: Tübingen | Stockholm Edinburgh | Moscow Current project: EMELD...

OLAC Launch, LSA-02 What is E-MELD? Electronic Metastructure for Endangered Languages Data 5 year collaborative project, begun Sept Participants: The LINGUIST List (Eastern Michigan University, Wayne State University, University of Arizona) The Linguistic Data Consortium (University of Pennsylvania) The Endangered Languages Fund (Yale University, Haskins Laboratories) Funded by NSF

OLAC Launch, LSA-02 E-MELD Objectives: To aid in … … the preservation of Endangered Languages (EL ) data and documentation … the development of infrastructure for linguistic archives

OLAC Launch, LSA-02 The Problem with EL archives: Lack of interoperability < many different procedures and data formats Lack of permanence < use of proprietary tools & standards Inadequate input from linguists into the standards-setting enterprise A L

OLAC Launch, LSA-02 Result: Endangered Languages plus Endangered data

OLAC Launch, LSA-02 EMELD Components Catalog of language resources on the Internet Promotion of community consensus about best practice in: Language identification Resource description Markup or annotation Showroom of Best Practice

OLAC Launch, LSA-02 Showroom of Best Practice Information on standards & software Query Room, where questions may be addressed to native speakers Texts and lexicons from 10 ELs marked up according to best practice

OLAC Launch, LSA-02 Languages Mocovi (Guaicuruan) 7000 speakers [EMU] Biao Min (Mienic) 21,000 speakers [WSU] Ega (Kwa) 300 speakers [LDC] Cambap (Mambiloid) 30 speakers [LDC] Lakota (Macro-Siouan) [ELF] Tofa (Turkic) [ELF] Two from: Alamblak, Dadibi, Mapos Buang, Takaulu Kalagan, Tuwali Ifugao - [SIL] Two from Post-Docs as yet to be determined.

OLAC Launch, LSA-02 OLAC & EMELD: OLAC Common Goals EMELD Needed: Collaboration!

OLAC Launch, LSA-02 Components 1.Catalog of resources 2.Promotion of community consensus about best practice in: 1.Resource description 2.Markup 3.Language identification OLAC Service Provider OLAC metadata Ethnologue /LINGUIST language codes proposed as OLAC best practice propose as OLAC best practice OLAC-related

OLAC Launch, LSA-02 LINGUIST = Gateway to information on best practice LDC = Repository of Standards & Software SIL = Vocabulary Server for Languages

OLAC Launch, LSA-02 LINGUIST = Gateway to Language Resources Archive 1Archive 2Archive 3 LINGUIST = OLAC Service Provider Data Provider 1Data Provider 2Data Provider 3 Key = Metadata

OLAC Launch, LSA-02 What you need to know to … Understand Metadata Yes a) Standardization is power b) Standardization is hard Why ?? Is it really important? Is it really as simple as it sounds ? (for Computers) (for People)

OLAC Launch, LSA-02 Metadata Data about data, e.g., cataloguing information Facilitates resource description, including summarization Enables search and retrieval

OLAC Launch, LSA-02 How LINGUIST will use Metadata Harvest metadata from OLAC archives Collect metadata from individual linguists Provide a searchable database of information (metadata) on Language data & documentation Software & tools Standards & formats

OLAC Launch, LSA-02 An Example Derbyshire, Desmond C. Topic continuity and OVS order in Hixkaryana In Joel Sherzer and Greg Urban (eds.), Native South American discourse, Berlin: Mouton. Word order

OLAC Metadata... built on Dublin Core set of 15 elements: Language Publisher Relation Rights Source Subject Title Type Contributor Coverage Creator Date Description Format Identifier

Added for Language Resources : Subject.language A language the resource is about E.g. A Grammar of Russian written in English has Subject.language = Russian Type.linguistic The nature of the content from a linguistic point of view E.g. transcription, annotation, description, lexicon

OLAC Launch, LSA-02 Important for LL Searching Derbyshire, Desmond C. Topic continuity and OVS order in Hixkaryana In Joel Sherzer and Greg Urban (eds.), Native South American discourse, Berlin: Mouton. Word order

OLAC Launch, LSA-02 What s been done so far: -OLAC harvester on the LINGUIST site: -prototype: -Language identification: -Code list for ancient languages, constructed languages, and language families to complement the Ethnologue code list -Everything on LINGUIST site (not just harvested metadata) categorized according to these codes: see Directory of LinguistsDirectory of Linguists

OLAC Launch, LSA-02 What needs to be added?...to LINGUIST Gateway Advice about software, tools, formats User reviews of archives, software Look up for Controlled vocabularies OLAC best practice

OLAC Launch, LSA-02 What needs to be done?...on Language Codes Mechanism ensuring community input into system Establishment of working group using OLAC process Promotion of code use among OLAC data providers

OLAC Launch, LSA-02 What needs to be done?...on Markup Finish knowledge base for markup (U. of Arizona) Input needed from linguists sample annotation schemas feedback on proposed KB content contact Terry Langendoen

OLAC Launch, LSA-02 Outcome? Data Access Data Permanence Improved Accuracy of language representation