1 e-Research for Linguists Dorothee Beermann & Pavel Mihaylov NTNU, Trondheim, Norway and Ontotext, Sophia, Bulgaria.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

1 Ontolog Open Ontology Repository Review 19 February 2009.
LSA Archiving Tutorial January 2005 Archives, linguists, and language speakers.
The Cost of Open Access? RCS Workshop Conference Aston 23rd July 2010 Bill Hubbard Centre for Research Communications University of Nottingham.
Contents of the program Management in libraries and information centers Management in libraries and information centers Information and communication.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Building Digital Library using DSpace Dr. M.Krishnamurthy, Librarian Indian Statistical Institute 8 th Mile Mysore Road R.V.College Post Bangalore
Caching the MDSPlus Data via Hibernate By Ajith M Jose Comp6703 Project Client: Raju Karia Supervisor: Dr. Henry Gardner (Development of “WebScope”)
Hibernate 1. Introduction ORM goal: Take advantage of the things SQL databases do well, without leaving the Java language of objects and classes. ORM.
Chapter 3 Database Management
Sally Rumsey ORA Service & Development Manager Why ORA? Why Fedora?
Contexts for International Issues & Comparative Research in Library & Archive Robert M. Hayes.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
Geneve, February 12, 2004 CERN OAI 3 Workshop - Tutorial 2 F. Lützenkirchen Implementing institutional Content Repositories with MyCoRe and MILESS 3rd.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
1 LOMGen: A Learning Object Metadata Generator Applied to Computer Science Terminology A. Singh, H. Boley, V.C. Bhavsar National Research Council and University.
Digital Library Architecture and Technology
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
What is the NIH RePORTER? And How Will it Help My PI?
Implementation of digital repository at the Ruđer Bošković Institute: organizational and technical issues Alen Vodopijevec Ruđer Bošković Institute, Library.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
National and University Library Zagreb Digitisation Activities.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Structured Documentation Management (Smart Documents for Open Data) Project.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
+ Information Systems and Databases 2.2 Organisation.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
LINGUISTICS RESEARCH AND ANALYSIS OF THE BULGARIAN FOLKLORE. EXPERIMENTAL IMPLEMENTATION OF LINGUISTIC COMPONENTS IN BULGARIAN FOLKLORE DIGITAL LIBRARY.
June 20, 2007ESRI Intl. User Conference Dawn Wright - Oregon State University Val Cummins - Coastal & Marine Resources Centre, IRELAND Liz O’Dea - Coastal.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 7.
Service-oriented architecture of the Bulgarian folklore library Konstantin Rangochev † Vasil Badev † Desislava Paneva † Detelin Luchev ‡ † Institute of.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Group 1 – Session 3 Key Points. Experiences in digital archiving Who is involved? –Partnerships with library and computer centre –Who should be responsible?
Data Archives: Brokers for DART Improving Data Access and Research Transparency in Switzerland November 7, 2014 Bern Brian Kleiner, FORS.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
LTER Information Management Training Materials LTER Information Managers Committee Introduction to Databases.
Texas A&M University Libraries– 5/11/2009 Unmil P. Karadkar Center for the Study of Digital Libraries and The Department of Computer Science Texas A&M.
TypeCraft Software Evaluation 21/02/ :45 Powered by None Complete: 10 On, Partial: 0 Off, Excluded: 0 Off Country: All, Region:
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
Introduction to ORM Hibernate Hibernate vs JDBC. May 12, 2011 INTRODUCTION TO ORM ORM is a programming technique for converting data between relational.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Connecting to External Data. Financial data can be obtained from a number of different data sources.
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
A Presentation Presentation On JSP On JSP & Online Shopping Cart Online Shopping Cart.
Metadata V1 By Dick M.A. Schaap – technical coordinator Oostende, June 08.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Data Management Planning Joy Davidson
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
What is WWW? The term WWW refers to the World Wide Web or simply the Web. The World Wide Web consists of all the public Web sites connected to the Internet.
Ass. Prof. Krassimira Anguelova
Statewide Digitization and the FCLA Digital Archive
VI-SEEM Data Repository
Data Base System Lecture 2: Introduction to Database
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Uralic multimedia corpora: ISO/TEI corpus data in the project INEL
I-ASIST Meeting April 11, 2006 Stacy Kowalczyk
Database SQL.
Oracle and XML Mingzhu Wei /7/2019.
Trust & Understanding Building Bridges: Mapping the XML
SDMX IT Tools SDMX Registry
Presentation transcript:

1 e-Research for Linguists Dorothee Beermann & Pavel Mihaylov NTNU, Trondheim, Norway and Ontotext, Sophia, Bulgaria

2 Product description online service online service For Language Studies in the Humanities Language Science and Teaching Linguists Language Teachers Anthropologists Create, store, retrieve, share * Interlinear Glosser * Repository of Interlinear Glossed Text (IGT) * Collaborative Editing Interlinear Glossed Text

3 Schematic representation of TypeCraft architecture and functions TC-database TC java-server TCwiki Apache Manage user Manage data access Data creation/retrieval system administration archiving xml export Based on:

4 University of Ghana, Legon African Linguists One important user group African Linguists NO CORPORA → create language resources LITTLE BOOKS AVAILABLE → make them accessible to others EDUCATIONAL POLICY → draw attention to my language NO PUBLICATION CHANNELS → make my work available ” - Medadi Erisa Ssentanda ”

5 Two years for a master in Linguistics! “Recently linguistic data has come under scrutiny. Researchers from different linguistic fields have questioned its validity, and the integrity of theories that “are built” on this data.” Interlinear Glossed Text - the root of all linguistic research -

6 TypeCraft Storage and Datamodel TC uses an PostgreSQL database for data storage. The data mapping between Java objects and database tables is managed by Hibernate. TC is not bound to any specific SQL database. TypeCraft data can be divided into two specific types: Common data: pos tags, gloss tags, global tags, ISO languages. Shared between all annotated tokens and users. Individual data: texts, phrases, words and morphemes, together with their annotation. This is data specific to each user. Individual data items reference common data items.

7 Interlinear Glossed Text Brokerage

8

9 Sharing can be done by: Archiving in one of the specialised institutional centers, such as Some funders might require researchers to deposit their data in an archive managed by the funding institution. Advantages of centralised data centers are better control over standards, data sharing policy and perhaps a better data quality. Alternative: Self -archiving as part of a shared research infrastructure + openness, transparency, flexibility, real-time data sharing = safe-keeping, long-term preservation, data accessibility - danger of reduced data quality There are different ways of data sharing!