Presentation is loading. Please wait.

Presentation is loading. Please wait.

Speech data in Swedish national archives and government agencies Jens Edlund, KTH Royal Institute of Technology Dept. of Speech, Music and Hearing.

Similar presentations


Presentation on theme: "Speech data in Swedish national archives and government agencies Jens Edlund, KTH Royal Institute of Technology Dept. of Speech, Music and Hearing."— Presentation transcript:

1 Speech data in Swedish national archives and government agencies Jens Edlund, KTH Royal Institute of Technology Dept. of Speech, Music and Hearing

2 About me Full time researcher Mainly human face-to-face interaction (humanities) so in everyday life, a CLARIN user But also Speech technology (technology/computer science) And for the purposes of this talk, a CLARIN representative

3 About KTH Speech, Music and Hearing A CLARIN K-centre “a centre of expertise providing an information service offering technical advice on speech analysis” A research institution One of the oldest speech labs in the world (founded by Gunnar Fant in 1951)

4 Structure of this talk Background Speech in general Speech in Swedish archives and agencies CLARIN obstacles (with a speech focus) Lack of collaborations with “users” Lack of suitable analyses and methods

5 Speech vs writing Speech is often, but not always perceived as a special case of writing Speech is consistently treated like a special case of writing But Speech predates writing Speech is the most commonly occurring form of language Writing is a special case of speech? In practice, there are many similarities, but the differences are huge SpeechWriting

6 Some salient differences Speech is transient It exists only in the present This is true, in a sense, even if recorded Speech is largely interactive and emergent It is created, edited, and undrstodd dynamically This is true for read speech as well

7 Speech in Swedish archives and agencies Rough inventory performed in 2015 Direct interviews with ~25 data holders, indirect with another ~25 Key finding There is a lot of materials around Nobody uses these materials – at all Virtually none of it can be made fully public (easily) Obstacles IP rights is just the beginning (other legislation) Lack of responsible people Lack of descriptions Sheer size

8 How to find users To start with, don’t call them “users”? Start with existing methodologies and the needs that come out of this Autumn 2015 workshop in Stockholm Discussions in “triplets”: a researcher, a data holder, and a language tech representative Resultet in several pretty mature project ideas A large project application combining three research tracks oral history, language change, human behaviour in interaction (under review)

9 How to find users (2) Interdisciplinary themes Similar to the CLARIN + efforts Initial theme for SWE-CLARN: food Enormous interest from a wide range of researchers We only just started…

10 How to analyse archive speech “Turn it into text” Automatic transcriptions Manual correction Annotation Text storage And dig the audio back down again…

11 But… Current methods are not designed for this type of speech Text is not speech Different studies call for different analyses There is a very real danger in standardizing too soon

12 Analysis as an iterative process Automatic transcription Produces (erroneous) machine transcriptions Manual correction Produces (correct) manual transcription So we have the sound, a negative example, and a positive example This is very good for training Take home message: don’t throw away materials that can be used to improve the automatic methods

13 Data for speech analysis development For example Parametrized speech Statistics Word (and n-gram) frequencies Pronunciation variation … Huge amounts are needed Expensive to get, even with data access Suggestion: build in data taps in various processes

14 Tapping training data from processes Example 0: Automatic transcription Example 1: Transliteration Example 2: Digitization of text Example 3: Digitization of speech

15 Thank you! Questions (or feel free to ask me in the breaks)


Download ppt "Speech data in Swedish national archives and government agencies Jens Edlund, KTH Royal Institute of Technology Dept. of Speech, Music and Hearing."

Similar presentations


Ads by Google