Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université 26 November 2015
Natural Language Processing (NLP) lexicons, term extraction, annotated corpora, NLP tools, etc. NLP and cognitive science NLP applied to DH legal text Processing, computational dialectology, historical text processing, etc. Digital philology Greek and Latin processing digital text and manuscript encoding and visualization digital epigraphy Big data analysis, social network analysis, etc. 3D visualization and reconstruction applied to historical and archeological research Development of online databases for the humanities literature, linguistics, history, archeology, etc. Main research areas (not an exhaustive list…)
Università di Pisa Department of Philology, Literature and Linguistics » Computational Linguistics Lab (colinglab.fileli.unipi.it/) » Phonetics Lab ( ml) » Digital Culture Lab (labcd.humnet.unipi.it/) Department of Informatics » Media Lab (medialab.di.unipi.it/wiki/Projects) Department of Civilization » sections of Philosophy, History of Arts, History, Archeology Academic and research institutions involved in Digital Humanities in Pisa
CNR Istituto di Linguistica Computazionale (ILC-CNR) » CLARIN coordinator for Italy Istituto di Scienze e Tecnologie dell’Informazione (ISTI- CNR) » Visual Computing Lab (vcg.isti.cnr.it/) » Human Interfaces in Information Systems Laboratory Lab (giove.isti.cnr.it/) » Knowledge Discovery and Data Mining Lab (www- kdd.isti.cnr.it/) Istituto di Informatica e Telematica (IIT-CNR) Scuola Superiore Sant’Anna Perceptual Robotics Lab ( Academic and research institutions involved in Digital Humanities in Pisa
CoLing Lab
An ongoing project to carry out a computational analysis and semantic indexing of Italian texts about WWI University of Pisa, CoLing Lab ILC-CNR, Pisa history consultant: Prof. Nicola Labanca (University of Siena) Texts are annotated automatically with state-of-the-art NLP tools to extract various kinds of information simple and multi-word terms named entities events and their participants georeferenced locations War Memories (Memorie di Guerra)
HLT Natural Language Processing ontologies terms thesauri kos
Started in 2002, Informatica Umanistica includes a 3-years bachelor (Laurea) and a 2-years master (Laurea magistrale) hosted by the Dept. of Philology, Literature and Linguistics, in collaboration with the Dept. of Informatics Informatica Umanistica (Digital Humanities) /
First year (60 credits) informatics » Foundations of programming languages (12 credits) » Web design and programming (12 credits) humanities » Cultural geography (6 credits) » Italian linguistics (9 credits) » Writing laboratory (6 credits) » English language (9 credits) » General linguistics (6 credits) Informatica Umanistica bachelor program study plan
Second year (60 credits) informatics » Algorithmics (6 credits) » Databases and Web Laboratory (12 credits) humanities » Introduction to historical studies (6 credits) » Italian contemporary literature (6 credits) » Computational linguistics (12 credits) » Italian Literature (12 credits) » History of arts (6 credits) Informatica Umanistica bachelor program study plan
Third year (60 credits) informatics » Telematics (6 credits) » One course among (6 credits): » Digital libraries, Multimedia production, E-learning technologies, Graphic design humanities » Text encoding (6 credits) » Latin language or literature (6 credits) » {English, French, Spanish, German} literature (6 credits) Free choice courses (18 credits) Stage (6 credits) Final thesis (6 credits) Informatica Umanistica bachelor program study plan
Professional profiles after the bachelor content managment and Web development language technology electronic publishing houses e-learning journalism and communication Informatica Umanistica bachelor program
Employment rates after the bachelor data collected in 2013 employed62.5% unemployed37.5% high20% medium40% low40% high20% medium60% low20% job satisfaction job coherence with the degree employment rate after 12 months since the degree employed44.4% unemployed55.6% medium25% low25% null50% high25% medium50% low25% job satisfaction job coherence with the degree employment rate after 12 months since the degree Informatica umanisticaHumanities
Programming and data analysis (15 credits) Italian linguistics II (12 credits) (except for curr. D) 12 free choice credits 6 credits for the Seminar of Digital Culture 21 credits for the final thesis 54 specific credits for each curriculum: A. Electronic Publishing B. Graphics, Interactivity and Virtual Environments C. Knowledge Management D. Language Technology Informatica Umanistica master program
18 credits among: Social network analysis, Electronic publishing, Collaborative work platforms, Interface design and usability, Information retrieval, Digital philology 12 credits among: Cartography, Legal aspects of informatics, Digital history, History of printing and publishing 24 credits among: Communication, Editorial writing, Online journalism, Internet marketing, Italian contemporary literature, Methods of physics for the humanities, Sociology of cultural processes, Technologies for Web marketing, Theory of literature, Visual analytics Informatica Umanistica Electronic Publishing
18 credits among: Digital audio, Electronic publishing, 3D graphics for cultural heritage, Collaborative work platforms, Interface design and usability, Interface programming 12 credits among: Cartography, Legal aspects of informatics, Digital history, Theory of TV and multimedia arts 24 credits among: Virtual environments, Communication, Online journalism, Seminar on cinema, Geographical information systems, Sociology of cultural processes, Technologies for Web marketing, Technologies for e-learning, Visual analytics Informatica Umanistica Graphics, Interactivity and Virtual Environments
18 credits among: Machine learning, Databases for decision support, Data mining, Data-driven decision methods, Artificial intelligence, Social network analysis, Information retrieval, Collaborative work platforms 12 credits among: Cartography, Legal aspects of informatics, Digital history, 24 credits among: Communication, Knowledge management, Internet marketing, Logic, Geographical information systems, Informaton technologies for literature production, Methods of physics for the humanities, Technologies for Web marketing, Technologies for e-learning, Visual analytics Informatica Umanistica Knowledge Management
18 credits among: Machine learning, Data mining, Artificial intelligence, Social network analysis, Information retrieval, Natural language processing, Information retrieval Italian linguistics (12 credits) is replaced with: Computational linguistics II General linguistics II 12 credits among: Cartography, Legal aspects of informatics, Digital history, 24 credits among: Applied linguistics, Phonetics and phonology, Philosophy of languages, Logic, Neurolinguistics, Methods of physics for the humanities Informatica Umanistica Language technology
Enrolled students bachelor master %73.1%64.9%56.9%44.7% master students with a bachelor degree at Uni. Pisa
Thanks!