Eureka! User friendly access to the MPI linguistic data archive Max Planck Institute for Psycholinguistics Alexander Koenig Jacquelijn Ringersma Claus.

Slides:



Advertisements
Similar presentations
Collections Management Software for Museums and Archives r e d i s c o v e r y s o f t w a r e. c o m O V E R V I E W P R E S E N T A T I O N.
Advertisements

Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
IRCS Workshop on Open Language Archives IMDI & Endangered Languages Archives Heidi Johnson / AILLA.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Interoperability aspects in the The Virtual Language Observatory Dieter Van Uytvanck Max Planck Institute for Psycholinguistics
Aggregation Services & Library Consortia N V Sathyanarayana Informatics (India) Limited, Bangalore, India.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
Mine Action Information Center
A partnership of Truman Presidential Museum & Library, Truman Institute, and the MU Design Team at CTIE Project Whistlestop.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
The Wichita lexicon in LEXUS Armik Mirzayan University of Colorado at Boulder Jacquelijn Ringersma Max Planck Institute for Psycholinguistics RELISH Workshop.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
Zum Aufbau eines multimedialen Spracharchivs Dagmar Jung (Institut für Linguistik, Allgemeine Sprachwissenschaft, Universität zu Köln) CCeH Eröffnungsworkshop.
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878 – 1879) Maria Nisheva-Pavlova, Pavel Pavlov Faculty.
InfoTrac Power Search 2.0 Lund Online 2009 – Products & Platforms Monique Schutterop.
Information Architecture Donna Maurer Usability Specialist.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
The Internet & Web Browsers Business Webpage Design Kelly Seale.
ISOWare Presentation January 2009 ISOWARE is a management tool, that simple and efficient describes and communicates Business Processes. ISOWARE is also.
Section 2.1 Compare the Internet and the Web Identify Web browser components Compare Web sites and Web pages Describe types of Web sites Section 2.2 Identify.
Content Management System By SURULINATHI M Assistnat Librarian.
Language-Sites: Accessing Language Resources via Geographic Information Systems Dieter van Uytvanck, Alex Dukers, Paul Trilsbeek Jacquelijn Ringersma (Peter.
CHAPTER 2 Communications, Networks, the Internet, and the World Wide Web.
Towards Online Accessibility of Valuable Phenomena of the Bulgarian Folklore Heritage Radoslav Pavlov 1 Konstantin Rangochev 1 Desislava Paneva-Marinova.
Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
June 20, 2006E-MELD 2006, MSU1 Toward Implementation of Best Practice: Anthony Aristar, Wayne State University Other E-MELD Outcomes.
WEB TERMINOLOGIES. Page or web page: a file that can be read over the world wide web Pages or web pages: the global collection of documents associated.
Large Corporation Web Sites Efficient process Maximum effectiveness.
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Increasing the usage of endangered language archives in the.
Gathering and Analyzing Web Use Statistics: A Practical Tutorial for Archivists Michael Szajewski, Ball State University, Archivist for Digital Development.
The Archive of the Indigenous Languages of Latin America Goals and Visions.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
Wordpress Ben Mulpeter. What is wordpress?  Wordpress is a free Content management system (CMS)  It allows free tools to help design your website and.
The New Digital World and the Transformation of Information and Libraries Patricia L. Thibodeau Associate Dean Library Services & Archives Oct. 26, 2011.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
CLARIN Metadata Infrastructure Component Metadata and intermediate solutions Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics.
LEXUS: a web based lexicon tool Jacquelijn Ringersma Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.
Web 2.0 Tools By: Jon McCabe. Podcast A podcast is an audio or video program formatted to be played on the iPod and made available for free or for purchase.
1 DOBES/MPI Archive - architecture - Paul Trilsbeek, Roman Skiba, Peter Wittenburg MPI for Psycholinguistics Access Management Nijmegen November 2004.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
MULTIMEDIA DEFINITION OF MULTIMEDIA
Customizing the IMDI metadata schema for endangered languages Heidi Johnson (AILLA) Arienne Dwyer (DOBES)
CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen,
Depositors’ usage of IMDI metadata Daan Broeder & Alex Klassmann MPI Institute for Psycholinguistics DELAMAN meeting London 2006.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Exploring and Enriching a LR Archive via the Web Marc Kemps-Snijders, Alex Klassmann, Claus Zinn, Peter Berck, Albert Russel, Peter Wittenburg MPI for.
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
TECHNOLOGY TERMS BY:SHAQUILLA WATSON&SIMONE TAYLOR.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
 Network  A _____ of computers that can _________ w/ each other  Examples of hardware  ______________ & communication lines  Internet  Hardware.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Google Apps and Education Jack Nieporte St James of the Valley
UDL 2.0 Beth Poss, MA CCC-SLP Christopher R. Bugaj, MA CCC-SLP
Creating & Testing CLARIN Metadata Components A CLARIN-NL project Folkert de Vriend Meertens Institute, Amsterdam 18/05/2010.
Computer-based Media Language Elements Understanding how we communicate through media Stewart.C. (2007). Media: New Ways and Means. John Wiley & Sons:
Discovering libraries’ gold through collection-level descriptions ELAG 2014, Bath Valentine Charles Data specialist.
Video Active Presentation Agenda: –Demonstration of videoactive.eu Frontend and Backend fiatifta.dk Copenhagen September 2008.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Radoslav Pavlov, Galina Bogdanova, Desislava Paneva- Marinova, Todor Todorov, Konstantin Rangochev
Accessible Open Educational Resources for Students with Disabilities in Greece: They are Open to the Deaf Accessible Open Educational Resources for Students.
Searching the Web for academic information Ruth Stubbings.
Objectives Overview Identify the four categories of application software Describe characteristics of a user interface Identify the key features of widely.
Library Web Portals: Reinventing Libraries for the Future
DIGITAL LIBRARY.
Márton Németh – László Drótos How to catalogue a web archive?
ViCoS Visualising Conceptual Spaces
Presentation transcript:

Eureka! User friendly access to the MPI linguistic data archive Max Planck Institute for Psycholinguistics Alexander Koenig Jacquelijn Ringersma Claus Zinn Hypermedia workshop, 1 October 2009

The MPI linguistic archive Misconception about archiving Your stuff is buried here and gone forever

The MPI linguistic archive only 10 years since we started 300,000 audio, video & text resources 100,000 metadata descriptions 16.5 TB of data...and still growing daily

MPI researchers' data DoBeS projects Corpora from other research institutions Corpus Nederlandse Gebarentaal European Science Foundation's Second Language Acquisition by Adult Immigrants Dutch Bilingual Database The MPI linguistic archive

Diverse Sources - One Structure all data described using IMDI metadata format central organisational concept: the session resource bundle –obligatory metadata –a set of annotations & transcriptions –a set of media data (video, audio, photo) Eye- track Audio Video Photo S Annotations Documents Field notes

Diverse Sources - One Structure every resource and metadata description has a URID ( all data in principle available online – complex access management system regulates who can access what

Accessing the Data TROVA IMDI Browser Web Portals Metadata Search Google Earth Overlay ViCoS Facetted Browsing LEXUS

Accessing the Data TROVA IMDI Browser Web Portals Metadata Search Google Earth Overlay ViCoS Facetted Browsing LEXUS The General Public Speech Communities Researchers from other Domains Linguistic Researchers / Depositors

Accessing the Data The Traditional Method – The IMDI Browser

The Traditional Method – The IMDI Browser data is structured by researchers can be ordered by geographic location or thematic subject or combination of these

The Traditional Method – The IMDI Browser Metadata is directly visible from the IMDI browser

The Traditional Method – The IMDI Browser Resources can be viewed using ANNEX

Accessing the Data More than just Browsing – Metadata and Content Search

More than just Browsing – Metadata and Content Search Metadata is always open and can be searched

More than just Browsing – Metadata and Content Search If you have access to the resources, you can also search the annotations

Accessing the Data The Narrow-down Method – Facetted Browsing

The Narrow-down Method – Facetted Browsing facets are taken from IMDI metadata users can browse through data –user selects facet (i.e. genre: conversation) –only matching sessions are shown –user selects another facet (i.e. country: Brazil) –an even smaller subset is shown

The Narrow-down Method – Facetted Browsing

Accessing the Data Easy as Geography – Google Earth Overlays

Easy as Geography – Google Earth Overlays geographic navigation seems like an obvious approach for novice users Google Earth is a popular, freely available tool KML format is widely used and easily convertible

Easy as Geography – Google Earth Overlays place marks for –linguistic archives –language sites –entry point for sets of resource bundles

Easy as Geography – Google Earth Overlays place marks can be enriched with introductory texts, photos and direct links to the MPI archive

Accessing the Data Back to the Communities – Community Web Portals

Back to the Communities – Community Web Portals attractive graphical design tailor-made for a specific language community open source CMS as back end communication with the MPI archive through REST interface

Back to the Communities – Community Web Portals

Accessing the Data Word for word navigation - LEXUS

Word for word navigation - LEXUS web-based tool for creating & editing lexica complies to LMF ISO standard but still very flexible lexical entries can be enriched with multimedia (video, audio, photos) multimedia can simply be linked from the archive

Word for word navigation - LEXUS

Accessing the Data Browsing Concepts - ViCoS

Browsing Concepts - ViCoS supplement to LEXUS also web-based users can link concepts a number of “universal” relation types are provided users can define their own culture-specific relations idea is to bring indigenous people onboard

Browsing Concepts - ViCoS

Conclusion diverse data in the MPI archive different user groups want access (open) IMDI standard and REST interfaces make different ways of accessing possible MPI tries to create alternative access methods customized for specific groups

Archiving instance Archiving instance, Max Planck Institute for Psycholinguistics Archive managers: 3 Archive developers: 2 System manager: 1 Archiving software development: 4 Enrichment software development: 4 Archive for language data: 40 Terabyte of data archived objects

Max Planck Institute for Psycholinguistics