Aug 2-5, 2002 EMELD Workshop 2002 1 Overview & Update Helen Aristar Dry The LINGUIST List & Eastern Michigan University EMELD Workshop on The Digitization.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Dublin Core for Digital Video: Overview of the ViDe Application Profile.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Outreach Jeff Good UC Berkeley. OLAC's Needs Maximal involvement from the whole community –The more data providers involved the more useful the services.
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
The Open Language Archives Community: Building a worldwide library of digital language resources Gary Simons, SIL International LSA Tutorial on Archiving.
Jan 7, 2005 Linguistic Society of America 2005 Annual Meeting, Oakland, CA The E-MELD Project: Helen Aristar Dry The LINGUIST List Eastern Michigan University.
An Overview of OLAC: The Open Language Archives Community Gary Simons and Steven Bird Workshop on The Digitization of Language Data: The Need for Standards.
The OLAC Metadata Set Gary Simons Workshop on The Digitization of Language Data: The Need for Standards June 2001.
Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
Getting Involved in OLAC Steven Bird University of Pennsylvania LSA Symposium: The Open Language Archives Community 4 January 2002.
Helen Dry & Anthony Aristar LINGUIST List: LREC Symposium: The Open Language Archives Community 29 May 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
Helen Dry & Anthony Aristar LINGUIST List: LSA Symposium: The Open Language Archives Community 4 January 2002http://linguistlist.org.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
1 Demystifying metadata Ann Chapman UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives and Libraries, the Joint Information.
LIFTing LEGO with RELISH: Lexicon Interchange FormaT in Use Helen Aristar-Dry Institute for Language Information and Technology Eastern Michigan U.
EuroCRIS Best Practices & Solutions Members Helping Members Move Forward.
The LEGO Project Brent Miller, The LINGUIST List.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
Reusable!? Or why DDI 3.0 contains a recycling bin.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Metadata Standards Anita Coleman, Asst. Prof. School of Information Resources & Library Science, University of Arizona, Tucson.
What Linguists Want (we think) Helen Aristar Dry & Anthony Aristar LINGUIST List & E-MELD.
The Rosetta Project Digital Language Archive Laura Buszard-Welcher The Long Now Foundation / University of California, Berkeley.
Digital Encoding What’s behind E-text Resources?.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
July 11, 2003E-MELD 2003 E-MELD “School” of Best Practice Helen Aristar-Dry & Gayathri Sriram The LINGUIST List Eastern Michigan University.
Sharing and Browsing Linguistic Data EMELD Arizona: Terry Langendoen Scott Farrar.
Resource Discovery (metadata and searching) Working Group Report.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Highlights of Main Activities in China Hou Huiqun INIS LO for China Director of CINIE 1.
E-Meld Workshop on Digitization of lexical Information 3-5 August 2002, EMU, Ypsilanti Working Group on Lexicon Macrostructures Chairman’s Report Dafydd.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
June 20, 2006E-MELD 2006, MSU1 Toward Implementation of Best Practice: Anthony Aristar, Wayne State University Other E-MELD Outcomes.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
A Common Ontology for Linguistic Concepts Scott Farrar University of Arizona.
Nov 21, 2005University of Texas at Austin The E-MELD Project Helen Aristar Dry & Anthony Aristar The LINGUIST List Eastern Michigan U & Wayne State U.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA The School of Best Practice How Standards can Matter Anthony Aristar, Wayne State University.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Discovering libraries’ gold through collection-level descriptions ELAG 2014, Bath Valentine Charles Data specialist.
The Open Archives Initiative and the Sheet Music Consortium Jon Dunn, Jenn Riley IU Digital Library Program October 10, 2003.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
July 1-3, 2005 E-MELD 2005 Ontologies in Linguistic Annotation 1 The GOLD Effort So Far Terry Langendoen Brian Fitzsimons Emily Kidder Department of Linguistics.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Markup of Educational Content
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Heidi Johnson The University of Texas at Austin
Cataloging the Internet
Overview Ideas Other Stuff
Presentation transcript:

Aug 2-5, 2002 EMELD Workshop Overview & Update Helen Aristar Dry The LINGUIST List & Eastern Michigan University EMELD Workshop on The Digitization of Lexical Data Aug. 2-5, 2002

Aug 2-5, 2002 EMELD Workshop What Is E-Meld? “Electronic Metastructure for Endangered Languages Data”  5 year collaborative project, begun Sept  Participants:  The LINGUIST List (Eastern Michigan U., Wayne State U., U. of Arizona)  The Linguistic Data Consortium (University of Pennsylvania)  The Endangered Languages Fund (Yale University, Haskins Laboratories)  Funded by NSF

Aug 2-5, 2002 EMELD Workshop The LINGUIST List 16,500 subscribers 106 different countries 4 European mirror sites: Tübingen | Stockholm Edinburgh | Moscow

Aug 2-5, 2002 EMELD Workshop  …the preservation of Endangered Languages data and documentation  …the development of infrastructure for linguistic archives To aid in … Objectives

Aug 2-5, 2002 EMELD Workshop Components  Metadata server facilitating access to language resources  Promulgation of best practice in:  Language identification  Resource description  Markup or annotation  Involvement of linguistic community in deciding best practice  Query Room, where questions can be addressed to native speakers  Demonstration project: texts and lexicons from 10 EL’s marked up according to best practice

Aug 2-5, 2002 EMELD Workshop Languages Mocovi (Guaicuruan) 7000 speakers [Grondona] Biao Min (Mienic) 21,000 speakers [Solnit] Ega (Kwa) 300 speakers [Gibbon, Connell Cambap (Mambiloid) 30 speakers [Connell] Lakota (Macro-Siouan) [Whalen] Tofa (Turkic) [Harrison]  Two from: Alamblak, Dadibi, Mapos Buang, Takaulu Kalagan, Tuwali Ifugao - [SIL]  Two from Post-Docs as yet to be determined.

Aug 2-5, 2002 EMELD Workshop Outreach  Workshops  2001 – Santa Barbara, CA:  focus: metadata, markup, language codes  2002 – Ann Arbor/Ypsilanti, MI  focus: lexicon markup & metadata  2003, 2004: workshops  2005, 2006: “digital institutes”

Aug 2-5, 2002 EMELD Workshop Project Emphasis: Breadth  Widest access to information  Web-based tools  Open standards  Simple interfaces

Aug 2-5, 2002 EMELD Workshop Progress  Metadata Collection:  Search facility  Metadata editor  Language Identification  Query Room  Markup Ontology (U. of Arizona) ORE Ethnologue + LL CodesEthnologue + LL Codes: used throughout LL site OLAC Service Provider (ELF & Rosetta)

Aug 2-5, 2002 EMELD Workshop Markup  Focus: morphosyntactic markup  Objective: a system which allows:  Field workers to submit data in different markups  Searcher to retrieve all relevant data despite varying markups  No “gold standard” in linguistic markup  Instead: ontology to serve as “interlanguage” for translation among markups

Aug 2-5, 2002 EMELD Workshop Markup  Tool to translate common markup formats (RDF, Shoebox, Word) into XML  Tool to help linguist identify aspects of markup with concepts in the ontology  More on this today from Langendoen, Lewis, and Farrar

Aug 2-5, 2002 EMELD Workshop Data Input Tool  Web-based Web-based  Potentially portable  Creates database input– to be output as xml  Can be customized to fit individual language  More on this tomorrow from Martha Ratliff & Zhenwei Chen

Aug 2-5, 2002 EMELD Workshop Affiliation w/OLAC  Resource identification  OLAC Service Provider  OLAC = Open Language Archives Community  Part of Open Archives Initiative  Multi-disciplinary initiative to promote multi-archive searching via http protocols

Aug 2-5, 2002 EMELD Workshop OLAC Metadata Set  Contributor  Coverage  Creator  Date  Description  Format  Identifier  Language  Publisher  Relation  Rights  Source  Subject  Title  Type Based on Dublin Core Set of 15 Elements With 2 refinements Subject.language Type.linguistic Type.linguistic: Draft of controlled vocabulary

Aug 2-5, 2002 EMELD Workshop Data Provider 2: Individual Data Provider 3 (Archive) OLAC Service Provider http: GET or POST Data Provider (Archive) Metadata LINGUIST List Data Provider 2: Individual

Aug 2-5, 2002 EMELD Workshop On LINGUIST  OLAC Search:  18 archives, 30,000+ records  Metadata Editor (ORE):  Form-based editor  Creates OLAC metadata in xml  Makes it available to OLAC search engine  Language Lookup: