Visualizing Natural Language Resources Kristina Kocijan University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information and Communication Sciences Zagreb, Croatia
Is it about beautiful pictures? Sooo, what is this presentation about?
“ ” Beauty is in the eye of the beholder. 3rd century BC, Greek saying Baudelaire’s beauty: data is beautiful if it is the result of reason and calculation. Thoreau’s beauty: data is beautiful by its very plainness.
About beautiful pictures! Sooo, what is this presentation about?
New ways of presenting data? Sooo, that’s it – only beautiful pictures?
“ ” The hope is that, in not too many years, human brains and computing machines will be coupled together very tightly and that the resulting partnership will think as no human brain has ever thought and process data in a way not approached by the information-handling machines we know today.” J.C.R. Licklider, in ‘Man-Computer Symbiosis’, March, 1960.
Reading the same data In different forms
Reading the same data Slowly, slowly, very slowly Faster, alas lucking info NounsCommonCollectiveProper Fem Mas Neut No gender Total per type Total nouns
Reading the same data Speedy, and empowering
Reading the same data Speedy, and empowering
Reading the same data Speedy, and empowering
Presenting the same data Statistics for the nouns in a dictionary Statistics for the nouns in a corpus NounsCommonProper Fem %3.61 % Mas %4.64 % Neut %0.12 % No gender 0 %4.17 % Total per type %12.54 % Total nouns NounsCommonProper Fem %5.05 % Mas %5.07 % Neut %0.10 % No gender 0 %57.80 % Total per type %68.03 % Total nouns
Distribution of top 10 paradigmas In DIC: ALAT ASTRONOM BLAGOST BRATIĆ CRTANJE DAVOR FABIANA GUSJENICA LEPTIR MEDO In Corp: ALAT BAT BESKRAJ BLAGO BLAGOST BRATIĆ CRTANJE GUSJENICA MEDO PROLAZNIK
Genitive+sg endings In DICIn Corpus
Genitive+sg endings In Corpus
Genitive+sg endings - weighted In Corpus
Visual Story As told by Data
“ ” Often the most effective way to describe, explore and summarize a set of numbers – even a very large set – is to look at pictures of those numbers. Edward R. Tufte in ‘Visual Display of Quantitative Information’, 2001.
Story behind the NLR data Instrumental Genitive Vocative Dative Accusative Locative
Story behind the NLR data
Thank you! Visualizing Natural Language Resources Kristina Kocijan University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information and Communication Sciences Zagreb, Croatia Questions?