The LANCHART data format and search engine
Data formats in the LANCHART Project Recording (digitalization)wav-file Transcription: Transcriberwav-file & trs-file Analytical coding: Praatwav-file & TextGrid Searching & counting: The LANCHART search engineMySQL database transcription automatic conversion automatic import
Praat TextGrid
events (AMF) ortografi (AMF) The Praat TextGrid hej med dig host tier- group ortografi (XJM) events (XJM) hejsa tiergroup participant tier tier name
grammatik (AMF) ortografi (AMF) What a basic search engine does sådan noget man kan når det er ens farmors G AS DS SB RH R AN
grammatik (AMF) ortografi (AMF) kunne du ligge og dø hvor ingen opdagede det The job for the LANCHART search engine G AS DS SA RJ L FAO OB matchoverlapping match Ggr ordstil (AMF) genre Common tier
The LANCHART search engine A WebService JSP / Servlets + front-end JavaScript A Database Engine, MySQL Search engine: Highly normalized to eliminate redundancy Updated every night from Korpus
Support for multiple transcription & analysis formats Conversions are done using a XML-based `super’- format, so that new formats can be added by creating conversion programmes
Support for multiple transcription & analysis formats ’Superformat’ is XML-based allowing for XSL Transformations for conversion Programmed in Java for portability Super format CLAN/.Chat Praat/.TextGridTranscriber/.trs Other formats