Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV.

Slides:



Advertisements
Similar presentations
Monumenta Germaniae Historica One of the most prestigious editorial undertakings for the critical publication of medieval historical texts In collaboration.
Advertisements

Problem solving methodology Information Technology Units Adapted from VCAA Study Design - Information Technology Byron Mitchell, November.
Special applications for Digital Libraries: computer-aided philological and linguistic analysis of digital documents Istituto di Linguistica Computazionale.
International Conference “Corpus linguistics – 2013” St. Petersburg, June 25–27, 2013 Roland Mittmann, M.A. Institute of Empirical Linguistics.
What is a national corpus. Primary objective of a national corpus is to provide linguists with a tool to investigate a language in the diversity of types.
Tools and resources Summary of working group discussion.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 10 Managing a Database.
Search Engine Technology for Digital Libraries State of the Art and Future 7th International Bielefeld Conference Jürgen Oesterle
Living in a Digital World Discovering Computers 2010.
Context and Relationships Developing Electronic Research Tools for Irish Studies.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
E-Lit: Historical Overview of IT in English Literature
Template produced at the Graphics Support Workshop, Media Centre Combining the strengths of UMIST and The Victoria University of Manchester Aims The GerManC.
Database Software Application
 Definition of HTML Definition of HTML  Tags in HTML Tags in HTML  Creation of HTML document Creation of HTML document  Structure of HTML Structure.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Korea Terminology Research Center for Language and Knowledge Engineering Infrastructures in Korea and for the Korean Language Key-Sun Choi.
Introduction to Database Systems
The National Library Bibliographic Classification (BBK) as a Base for Subject Search in the Integrated RSL Digital Library. The Project presentation. Lavrenova.
Luc Audrain Hachette Livre Head of digitalization
0 Automated Formative Assessment: Providing Linguistic Support through Online Modules Presented by: Ken Petersen 11/18/2011 ASEES 42nd Annual Convention.
Discovering Computers Fundamentals, 2012 Edition Your Interactive Guide to the Digital World.
JINR DOCUMENT SERVER: Current Status and Future Plans I. Filozova 1, S. Kuniaev 2, G. Musulmanbekov 1, R. Semenov 1, G. Shestakova 1, P. Ustenko 2, T.Zaikina.
1 California State University, Fullerton Chapter 8 Personal Productivity and Problem Solving.
CHAPTER 9 Using the World Wide Web. OBJECTIVES 1.Describe the Internet and the World Wide Web 2.Define related Internet terms 3.Explain the components.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Dr. Kristin Bakken, NO 2014 Oddrun Grønvik, NO 2014 Dr. Daniel Ridings, DOK Sept. 7th 2004.
SDPL 2001Notes 4: Intro to Stylesheets1 4. Introduction to Stylesheets n Discussed recently: –Programmatic manipulation of (data-oriented) documents n.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
Similar Document Retrieval and Analysis in Information Retrieval System based on correlation method for full text indexing.
2XML Marko Tadić Department of linguistics, Faculty of philosophy, University of Zagreb ( Tübingen,
Information Systems & Databases 2.2) Organisation methods.

Problem solving methodology Information Technology Units Adapted from VCAA Study Design - Information Technology Byron Mitchell, November.
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
Spanish FrameNet Project Autonomous University of Barcelona Marc Ortega.
There are seven main components of a database in Access 2000: Tables. Use tables to store database information. Forms Use forms to enter or edit the information.
Moving from your Control Panel to QMplus. Module description.
LINGUISTICS RESEARCH AND ANALYSIS OF THE BULGARIAN FOLKLORE. EXPERIMENTAL IMPLEMENTATION OF LINGUISTIC COMPONENTS IN BULGARIAN FOLKLORE DIGITAL LIBRARY.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Corpus lexicography in Russia: recent trends and perspectives Maria Khokhlova St.Petersburg State University Philological Faculty
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Database Management Supplement 1. 2 I. The Hierarchy of Data Database File (Entity, Table) Record (info for a specific entity, Row) Field (Attribute,
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Rencontres TEI Council Lyon 2009 Serge Heiden ICAR Laboratory / Lyon University Council, ENS-LSH, Lyon (France), 1 April 2009.
1 U3O2: Database Design Tools  Naming Conventions  Eg.s prefix tables with tblCustomer, tblProducts; customer table, cusCustomerID, cusAddress; Queries,
Presentation on Database management Submitted To: Prof: Rutvi Sarang Submitted By: Dharmishtha A. Baria Roll:No:1(sem-3)
Digital OU September 15, Partners The Andrew W. Mellon Foundation Society for Classical Studies Medieval Academy of America Renaissance.
6 Copyright © 2010, Oracle and/or its affiliates. All rights reserved. Site Hub User Role – Managing Sites.
Introduction to Database Systems
Information Retrieval and Web Search
Chapter Ten Managing a Database.
CADIAL search engine at INEX
Database Management Systems
Information Retrieval and Web Search
Federated & Meta Search
European Network of e-Lexicography
Thanks to Bill Arms, Marti Hearst
Using Access to Implement a Relational Database
ICEweb 2 a new way of compiling high-quality web-based components for ICE corpora Martin Weisser Center for Linguistics & Applied Linguistics, Guangdong.
Istituto di Linguistica Computazionale – Pisa
Understand basic HTML and CSS terminology, concepts, and basic operations. Objective 3.01.
Aggregating Online Resources: Grolier Online as an Educational Portal
Presentation transcript:

Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents Victor BARANOV Linguistics Department Izhevsk State Technical University Laboratory of Computer-Aided Philological Research Udmurtia State University

Dagstuhl, December, 2006 Digital Historical Corpora2 Title page of the portal of IAS “Manuscript”

Dagstuhl, December, 2006 Digital Historical Corpora3 Model of hierarchies and subnets of manuscript and text units

Dagstuhl, December, 2006 Digital Historical Corpora4 Net of linguistic relationships се быша дроузи мои. се. быша дроузи мои себышадроузимои се быша дроузи мои. быша дроузи мои се быша дроузи быша дроузи Дроузи мои Text Predicate part Syntactic group Word-form Relationship Средство связи Εnd of the “single" relationship Εnd of the “multiple" relationship Mean of relationship Word-combination Co-ordination Dependence

Dagstuhl, December, 2006 Digital Historical Corpora5 Model of the Manuscript system

Dagstuhl, December, 2006 Digital Historical Corpora6 Editor OldEd: main panels

Dagstuhl, December, 2006 Digital Historical Corpora7 Editor OldEd: Text input and editing

Dagstuhl, December, 2006 Digital Historical Corpora8 Editor OldEd: Fragmentation of the manuscript texts into units and relationships with the dictionary units Dictionary of fragments Properties of fragments Fragments

Dagstuhl, December, 2006 Digital Historical Corpora9 Editor OldEd: Visualization of unit relationships Symbol Geometric hierarchy: Line Page Linguistic hierarchy: word-form normalize forms Dictionary: Lemma Dictionary: word-forms of texts Properties and values of the Lemma

Dagstuhl, December, 2006 Digital Historical Corpora10 Editor OldEd: Page layout

Dagstuhl, December, 2006 Digital Historical Corpora11 Result of creation of the layout on the site Marginalia

Dagstuhl, December, 2006 Digital Historical Corpora12 Automated lemmatization and establishing relationships between words and lemmas

Dagstuhl, December, 2006 Digital Historical Corpora13 Electronic edition: search page Search criteria Collections & Manuscripts Search result

Dagstuhl, December, 2006 Digital Historical Corpora14 Search result: word index and concordance

Dagstuhl, December, 2006 Digital Historical Corpora15 Module of retrievals: selection of the text

Dagstuhl, December, 2006 Digital Historical Corpora16 Module of retrievals: selection of the unit

Dagstuhl, December, 2006 Digital Historical Corpora17 Module of retrievals: setting the unit properties and values

Dagstuhl, December, 2006 Digital Historical Corpora18 Module of retrievals: saving the query

Dagstuhl, December, 2006 Digital Historical Corpora19 Module of retrievals: specifying the composition of the query result

Dagstuhl, December, 2006 Digital Historical Corpora20 Comparative index of the word forms

Dagstuhl, December, 2006 Digital Historical Corpora21 Comparative index of the fragments

Dagstuhl, December, 2006 Digital Historical Corpora22 Grammar dictionaries Grammar dictionary of the modern Russian language Grammar dictionary of the Old Russian language Grammar dictionary of the Old Slavonic language Grammar dictionary pseudo-elements Text N Text 6 Text 5 Text 4 Text 3 Text 2 Text 1

Dagstuhl, December, 2006 Digital Historical Corpora23 Grammar dictionaries: retrieval form

Dagstuhl, December, 2006 Digital Historical Corpora24 Grammar dictionaries: bringing the Old Russian word-forms to the lemma

Dagstuhl, December, 2006 Digital Historical Corpora25 Grammar dictionaries: оbtaining paradigm of lemma

Dagstuhl, December, 2006 Digital Historical Corpora26 Electronic editions

Dagstuhl, December, 2006 Digital Historical Corpora27 Electronic edition: reverse index of word-forms and context

Dagstuhl, December, 2006 Digital Historical Corpora28 Acknowledgment The work on the creation of IRS Manuscript is being carried out with the support from the Russian Foundation of Basic Research (Grant # в). Τhe work on the creation of the automated morphologic analyzer with the support of the Russian Foundation for the Humanities (Grant # в).

Dagstuhl, December, 2006 Digital Historical Corpora29 Contacts Victor Baranov - Laboratory of Computer-Aided Philological Research Udmurtia State University Linguistics Department Izhevsk State Technical University Izhevsk, Russia