Uralic multimedia corpora: ISO/TEI corpus data in the project INEL

Slides:

Advertisements

Similar presentations

Preservation by Migration to XML Dirk Roorda. work on a preservation strategy positioning of the XML preservation strategy implementing the strategy in.

Advertisements

ECMA Open XML File Formats and the Evolution of Open File Formats Mark Lange Senior Policy Counsel Microsoft EMEA.

IAC (ACCESS INTERFACE CORPUS) DEVELOPED BY BARCELONA MEDIA & UNIVERSITAT POMPEU FABRA TONI BADIA (BARCELONA MEDIA - UNIVERSITAT POMPEU FABRA) JUDITH DOMINGO.

Mitglied der Leibniz-Gemeinschaft Querying Spoken Language Corpora Thomas Schmidt IDS Mannheim.

Coursework.  5 groups of 4-5 students  2 project options  Full project specifications on 3 rd March  Final deadline 10 th May 2011  Code storage.

Concepts & Techniques for Accessible, Closed Captioned Web-Based Video 10th Annual Accessing Higher Ground: Accessible Media, Web and Technology Conference.

Let’s Get GUI! Understanding the Windows ® Graphical User Interface © 2006 by Ted Altenberg

SBahn Database Management Tool Georgi Cholakov, Sam Joachim University of Plovdiv “Paisii Hilendarski”, e-Commerce Laboratory in cooperation with Humboldt.

1 PROJECT Web-based Database Applications Lecture 1: Basic Internet Concepts & Databases - the History.

Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.

Database „Multilingualism“ – Perspectives for collaborative corpus construction and collaborative commentary Thomas Schmidt Sonderforschungsbereich 538.

1 A Manager’s Guide to Converting XML to Structured FrameMaker Doug Martin.

Object Linking and Embedding A tool which allows different software application packages to share data.

Software and Multimedia

Mindmap Converter. Caveats ADL will evolve CIMI RM will evolve Mindmap requirements will evolve.

CLARIN tools for workflows Overview. Objective of this document  Determine which are the responsibilities of the different components of CLARIN workflows.

Authors: RIEFOLO Anthony FANCHETTE Edouard Tutors : BETBEDER Marie-Laure REFFAY Christophe MULCE project Interface to display XML objects 1.

Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,

Customized cloud platform for computing on your terms !

About Me 4 th Internship UT Austin Computer Science.

Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

1 e-Research for Linguists Dorothee Beermann & Pavel Mihaylov NTNU, Trondheim, Norway and Ontotext, Sophia, Bulgaria.

Presented by Team D Compare Windows 2000, XP, and.NET By John Leonard, Brian North, Jeffrey Reynolds, Todd Saylor.

Eureka! User friendly access to the MPI linguistic data archive Max Planck Institute for Psycholinguistics Alexander Koenig Jacquelijn Ringersma Claus.

PCWG Analysis Tool Peter Stuart September 15, 2015.

WordFreak A Language Independent, Extensible Annotation Tool.

Confidential, I.R.I.S. © 2005, All rights reserved I.R.I.S. new OCR Software suite: A full range for document conversion, for private and corporate users.

2XML Marko Tadić Department of linguistics, Faculty of philosophy, University of Zagreb ( Tübingen,

Current Situation and CI Requirements OOI CyberInfrastructure Science User Requirements Workshop: San Diego January 23-24, 2008.

The New Internet and the Classroom Cool Tools For Teachers.

Tao Huang, Shrideep Pallickara, Geoffrey Fox Community Grids Lab Indiana University, Bloomington {taohuang, spallick,

Using Google Docs. Objectives Google Docs overview Create G-mail accounts – DO NOT use personal accounts Google Doc Interface Spreadsheet/Form overview.

Transcripts are stored in a relational database Transcripts are divided up to their smallest constituent (words), while the context is preserved, in a.

CHAPTER 15 WPF Windows Presentation Foundation Dr. John Abraham Professor, UTPA.

AnCoraPipe: A tool for multilevel annotation Manu Bertran, Bàrbara Soriano, Oriol Borrega, Marta Recasens Universitat de Barcelona CBA 2008.

Agility with Services – The eBay Way

Technical Communication A Practical Approach Chapter 14: Web Pages and Writing for the Web William Sanborn Pfeiffer Kaye Adkins.

ELECTRAAdvantages ELECTRA Advantages Intuitive workflow Electra workflow consistently follows standard Civil engineering design process which intuitively.

GL15 Grey Literature Bratislava 2-3 december 2013 Industrial Philology: problems and techniques of data and archives preservation for future generations.

Data Organization Quality Assurance and Transformations.

Grades 6-8 iSquad Get Going with Zamzar. Focusing Questions How can I convert files to different formats to make it possible to use with my software?

GDML “Geometry Description Markup Language” by Daniele Francesco Kruse University of Rome “Tor Vergata” European Organization for Nuclear Research.

Unity Application Generator How Can I… Import control modules (Instrument list) from PID Into the UAG.

By Anne Nattembo Using Social Media to Increase Access to Sexual Reproductive Health Information among Young People in Uganda.

1 Dr. Cord Pagenstecher Testimonies on Nazi Forced Labor and the Holocaust Building Digital Environments for Research and Education Dr. Cord Pagenstecher.

BRAT: a web based tool for manual annotation Hans Paulussen ITEC, KU Leuven KULAK.

A SCRIPT FOR ARCHIVING DIGITAL RESEARCH DATA IMPROVING ACCURACY AND EFFICIENCY IN THE DATAVERSE NETWORK ABSTRACT SUMMARY Rachel Carriere, Thu-Mai Christian,

Instructional Design Center Creating PDF Files Using Microsoft Word.

Name/Title of Your App Prepared by: …… For the 5 th National ICT Innovation Competition.

Import Live Mail Contacts to Outlook Get Live Mail Contacts converter solution to recover live.

How to Apply PDF in Flipbook on Website. Description If you are finding solution for applying PDF in flipbook mode on website, and adding multimedia items.

Topic Map & SMIL Prototypes KUL-ESAT-DOCARCH

CONCLUSION REFERENCES METHODOLOGY PRELIMINARY FINDINGS BACKGROUND

DATA INTEGRATION FOR LANGUAGE DOCUMENTATION

Hardware and Software Hardware refers to the physical devices of the computer system e.g. monitor, keyboard, printer, RAM etc. Software is a set of programs,

Ford Foundation International Fellowship Program Records

Power Hour April 2011 DITA and ePublisher

DivaServices-Spotlight

CFS Community Day Core Flight System Command and Data Dictionary Utility December 4, 2017 NASA JSC/Kevin McCluney December 4, 2017.

Software and Multimedia

Software and Multimedia

دانشگاه شهیدرجایی تهران

تعهدات مشتری در کنوانسیون بیع بین المللی

Power Hour October 2013 Extending Styles Adding properties and options

Using GOLD to Tracking L2 Development

Corpora of social media in minority Uralic languages

Written By: Daniel Ontiveros

AI Discovery Template IBM Cloud Architecture Center

MULTIMEDIA SYSTEMS Dr S.ARUNA/III BCA-C/MULTIMEDIA SYSTEMS/IMAGES,COLOR,IMAGE FILE FORMATS.

Presentation transcript:

Uralic multimedia corpora: ISO/TEI corpus data in the project INEL Timofey Arkhangelskiy Universität Hamburg / Alexander von Humboldt Foundation timarkh@gmail.com Anne Ferger Universität Hamburg anne.ferger@uni-hamburg.de Hanna Hedeland hanna.hedeland@uni-hamburg.de

INEL Long-term documentation project at Hamburg, currently corpora of Selkup, Kamas and Dolgan are being prepared Spoken corpora (+ archival transcriptions) All annotated data stored and edited in EXMARaLDA (time-aligned XML format + GUI) Our goal is (a) long-term preservation of the data; (b) providing easy access to corpora through an online user interface

EXMARaLDA > ISO/TEI > tsakorpus We transform EXMARaLDA data to the XML based on the ISO/TEI standard (good for long- term preservation) We use the Tsakorpus corpus platform for online access ISO/TEI files are converted to Tsakorpus JSON The pipeline is applicable to other spoken corpora hosted at Hamburg Center for Language Corpora

Disclaimers (from all of us) INEL-internal data handling (tools, glossing strategies, choice of EXMARaLDA etc.) is outside the scope of our presentation (from me personally) I am only responsible for the ISO/TEI > Tsakorpus conversion and do not participate in INEL

Thank you for your attention!