Download presentation
Presentation is loading. Please wait.
Published byGeorgia Johnson Modified over 9 years ago
1
SemanticMining: Major Events July - December 2005
3
Workshop on Foundations of Clinical Terminologies and Classifications Location: Timişoara, Romania, April 8, 2006 Collocation: EFMI Special Topic Conference Endorsement: EFMI, GMDS, SemanticMining Local Organizer: George Mihalas Scientific Chairs: Stefan Schulz, Jeremy Rogers. SPC to be defined Invited Speakers: Olivier Bodenreider, N.N. Submission Deadline: Jan 15 Bursaries by Semantic Mining
4
WP 20: Research Activity Multilingual Medical Dictionary
5
WP20 Partners contributing to activities in 2005 Jul-Dec UKLFRFreiburg University Hospital Medical Informatics JENAUniversity of Jena Computational Linguistics LIU IMTLinköping Univ. Medical Informatics LIU IDALinköping Univ. Computer Science UGOTGöteborg University SUSahlgrenska Hospital Göteborg DIMGeneva Univ. Hospital Med. Informatics OUOpen University INSERMParis, Med. Informatics
6
Revised Subtasks of WP20 1.Development and Implementation of Lexical Acquisition Methodologies 2.Population of Medical Subword Lexicon 3.Specification and Acquisition of Multilingual Lexical Resources 4.Specification and Acquisition of Multilingual Corpora
7
WP 20 achievements Milestone 4 reached: interchange format for a multi-lingual medical dictionary, multi-lingual medical dictionary in at least five languages with more than 20,000 entries WP20 workshop in September (Paris) Specification of Common Link Format Ongoing enhancement of the MorphoSaurus Lexicon Enhancement of the MorphoEditWeb lexicon editor Use of standardized IR performance measurements in order to assess the progress in the lexicon Research on subword cognate aquisition. Ongoing feasibility studies and prototypical implementations in collaboration with German partners (industry, government) Implementation of corpus exchange guideline
8
WP 20 achievements (continued) Use of NLG techniques for the automated generation of Procedure description out of a formalized procedure ontology Automatic annotation of medical corpus with morphosyntactic and semantic information. Restructuring of LEXIN lexicon (20,000 entries) into a medically-oriented resource Compound decomposition, named entity recognition, and acronym decomposition for Swedish Adaptation of ITools for French terminology extraction Continuation of preparation of parallel French-English medical corpus Semi-automatic enrichment of French subword lexicon by exploiting existing subword lexicons in other languages
9
WP 20 Cross-WP activities WP 21 and WP 26. Use of Morphosaurus indexing for content retrieval in EHR archetype descriptions. WP21: Joint proposal on biomedical processes within biomedical ontologies WP22 translation of SNOMED CT into Swedish WP22 use of SNOMED Microglossary for a bilingual English-Russian lexicon WP24 Elaboration of a white paper on token and sentence segmentation WP23 (planned): Acquisition of Multilingual Terms from Lab Medicine
10
WP20 Strategic Planning Using the common interchange and link formats, multilingual linking/merging of lexicons plus semantic classes from subword lexicons Corpora pool as new deliverable for 2006 Extension of the parallel corpora alignment for new language pairs Ongoing population and cleansing of subword lexicons, extension of the corpus-based validation of subword lexicons to new language pairs Cross-validation of lexicons (e.g. removing duplicates) IR studies with new language pairs Experimental addition of Italian and Russian to the subword lexicon using automated Workshop at LREC (accepted) Tutorial at MIE
11
WP 20 Mobility Rahil Qamarfrom UOM to UKLFR Vincent Claveaufrom INSERM to UKLFR Mikael Nyströmfrom LIU to UKLFR Louise Délégerfrom INSERM to LIU Caspar Hasenclever from UKLFR to JENA Michael Popratfrom JENA to UKLFR Joachim Wermterfrom JENA to UKLFR Harald Kirschfrom EBI to UKLFR
12
WP 20 Joint Publications R.H. Baud, M. Nyström, L. Borin, R. Evans, S. Schulz, P. Zweigenbaum. Interchanging Lexical Information for a Multilingual Dictionary. AMIA 2005 annual symposium. Accepted for publication. K. Markó, S. Schulz, U. Hahn: Automatic Lexicon Acquisition for a Medical Cross-Language Information Retrieval System. Proceedings of the XIX International Congress of the European Federation for Medical Informatics (MIE '05), Geneva, Switzerland. 2005: 829-834. K. Markó, S. Schulz, O. Medelyan, U. Hahn: Bootstrapping Dictionaries for Cross-Language Information Retrieval. Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '05 ), Salvador, Brazil. 2005: 528-535. K. Markó, S. Schulz, U. Hahn: MorphoSaurus - Design and Evaluation of an Interlingua-based, Cross-language Document Retrieval Engine for the Medical Domain. Methods of Information in Medicine. 4/2005(44): 537-545 K. Markó, S. Schulz, U. Hahn: Unsupervised Multilingual Word Sense Disambiguation via an Interlingua. Proceedings of the 20th National Conference on Artificial Intelligence (AAAI '05), Pittsburgh, Pennsylvania. 2005: 1075-1080AAAI '05 M. Poprat, U. Hahn. Enough is Enough – Estimating Upper Bounds of the Size of Training Corpora for Unsupervised PP Attachment Disambiguation. Proceedings of Fifth International Conference on Recent Advances in Natural Language Processing (RANLP-2005) K. Markó, S. Schulz and U. Hahn. Multilingual Lexical Acquisition by Bootstrapping Cognate Seed Lexicons. Proceedings of Fifth International Conference on Recent Advances in Natural Language Processing (RANLP-2005) U. Hahn, P. Daumke, S. Schulz, K. Markó: : Cross-Language Mining for Acronyms and their Completions from the Web. Proceedings of the 8th International Conference on Discovery Science (DS '05), Singapore. 2005. S. Schulz, K. Markó, R. L. de Andrade, E. Pacheco, P. Nohama, U. Hahn, M. Romacker: The Morphosaurus Medical Subword Lexicon. Lexicographic and Semantic Aspects. Proceedings of the 3th Workshop em Tecnologia da Informação e da Linguagem Humana (TIL '05), São Leopoldo, Brasil. 2005TIL '05 Mikael Nyström, Magnus Merkel, Lars Ahrenberg, Michael Petterstedt, Håkan Petersson & Hans Åhlfeldt. Generering av ett medicinskt engelskt-svenskt lexikon med hjälp av interaktiv ordlänkning. Accepted for Svenska Läkaresällskapets riksstämma (Annual Meeting of Swedish Society of Medicine). K. Marko, P. Daumke, S. Schulz, U. Hahn. Automatische Generierung einer sprachübergreifenden Akronymdatenbank. 50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds), Freiburg 11. - 15. September 2005 (Annual Meeting of the German Society of Medical Informatics, Biometry and Epidemiology) M. Poprat, K. Markó, U. Hahn. Automatische Klassifikation medizinischer Dokumente nach Sprache und Zielgruppe für Text-Retrieval-Systeme. 50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds), Freiburg 11. - 15. September 2005 (Annual Meeting of the German Society of Medical Informatics, Biometry and Epidemiology) Website: http://www.morphosaurus.net, updated in August 2005
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.