On our way to to Information Overload ?. Or to prevent it by Appropriate use of Technology ?

Slides:



Advertisements
Similar presentations
CD40 ligand and tumor necro sis factor alpha, the cells acquire a mature phenotype of dendritic cells that is characterized by up-regulation of human leukocy.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Searching for Medicines Information New Zealand College of Pharmacists.
Mining Brain-Related Transcription Factor- Disease Relationships for Novel Linkages Unified Medical Language System (UMLS) Specialised Databases Atlas.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The Thomson Reuters CITATION CONNECTION Digital Library st March – 3 rd April 2014, Jasná David Horký Country Manager – Central and Eastern Europe.
UpdataCapital The Context The Application Latest technical developments.
1 Knowledge Management for Disease Coding (KMDC): Background & Introduction Timothy Hays, Ph.D. Project Manager, Knowledge Management for Disease Coding.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
IST Computational Biology1 Information Retrieval Biological Databases 2 Pedro Fernandes Instituto Gulbenkian de Ciência, Oeiras PT.
The Semantic Web: New-style data-integration (and how it works for life-scientists too!) Frank van Harmelen AI Department Vrije Universiteit Amsterdam.
Social Pharmacy and Pharmacoepidemiology Lister Hill National Center for Biomedical Communications Text-based Discovery in Biomedicine The Architecture.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Presentation by Yuri de Lugt. Presentation structure Definitions of knowledge management Forms of knowledge Knowledge infrastructure Collexis background.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Bioinformatics and medicine: Are we meeting the challenge?
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Next generation semantic support technology Barend Mons CD40 ligand and tumor necro sis factor alpha, the cells acquire a mature phenotype of dendritic.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Flexible Text Mining using Interactive Information Extraction David Milward
BIOSEMANTICS Semantic Support Technology For on-line Knowledge Tracking and Discovery Second Order Semantic Enrichment CD40 ligand and tumor necro sis.
Next generation semantic support technology CD40 ligand and tumor necro sis factor alpha, the cells acquire a mature phenotype of dendritic cells that.
Vision Talk “If it is not in Wikipedia, it probably does not exist.” Jimmy Wales used to say that about Google when Google still existed.
Biosemantics group Martijn Schuemie. Overview  The biosemantics group  Ontology assembly  Concept tagging  Homonym disambiguation  Concept profile.
Medline on OvidSP. Medline Facts Extensive MeSH thesaurus structure with many synonyms used in mapping and multidatabase searching with Embase Thesaurus.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Orphanet Europe State of the Art of Database and Services Polish activity Orphanet Europe State of the Art of Database and Services Polish.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
MINING FOR MEANING: Data mining & Knowledge extraction Laboratory of Parasitic Diseases, NIAID.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Overview of Bioinformatics 1 Module Denis Manley..
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
HUMAN GENOME PROJECT International effort of 13 years (1990 – 2003) Identified all the approximate 20,000 – 25,000 genes in human DNA Determined the sequences.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
From the Advanced Search page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Distribution of information in biomedical abstracts and full- text publications M. J. Schuemie et al. Dept. of Medical Informatics, Erasmus University.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Overview of Bioinformatics Module Denis Manley.. Contact Details Lecturer Name: Denis Manley Room number: KE-1-013a
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
1 Semantic Relations for Interpreting DNA Microarray Data and for Novel Hypotheses Generation Dimitar Hristovski, 1 PhD, Andrej Kastrin, 2 Borut Peterlin,
The financial support of the European Commission is gratefully acknowledged. Material in this presentation reflects only the author’s views and the Commission.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Automatically Identifying Candidate Treatments from Existing Medical Literature Catherine Blake Information & Computer Science University.
From the initial page of the Cochrane Library, we have clicked on the Cochrane Reviews: By Topic hyperlink. This has displayed the Topics for Cochrane.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004
The National Library of Medicine and its databases Lívia Vasas, PhD
Innovative Novartis Knowledge Center
TDM in the Life Sciences Application to Drug Repositioning *
The National Library of Medicine and its databases
Platform for Healthcare
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Lívia Vasas, PhD 2018 The National Library of Medicine and its databases Mozilla Firefox/Google Chrome Lívia Vasas, PhD.
The National Library of Medicine and its databases
Lívia Vasas, PhD 2018 The National Library of Medicine and its databases Mozilla Firefox/Google Chrome Lívia Vasas, PhD.
9 Future Challenges for Bioinformatics
Introduction to Information Retrieval
The National Library of Medicine and its databases
Presentation transcript:

On our way to to Information Overload ?

Or to prevent it by Appropriate use of Technology ?

C C C C C C consolidated knowledge Collexis Fingerprints (CFP’s)

English French Spanish People medical researchers around the world Activities in elect. text like projects, publications Medline abstracts... Disease:#12674 Multilingual Thesaurus Indexer Matches keywords, translates them to identical numbers and ranks them by their relevance Maladie:#12674 Enfermedad:#12674 Malaria:#24530 Hospital:#19994 Paludisme:#24530 Paludismo:#24530 Hôpital :#19994 Hospital:# The Common Language Each activity is represented as a set of keyword numbers ranked by their relevance #4256 : 1.0 #3627 : 0.8 #19994 : 0.5 #28746 : 0.3 #32874 : 0.1 #14325 : 1.0 #3627 : 0.8 #19994 : 0.5 #28746 : 0.3 #32874 : 0.1 #85643 : 1.0 #3627 : 0.8 #19994 : 0.5 #28746 : 0.3 #32874 : 0.1 #17345 : 1.0 #3627 : 0.8 #19994 : 0.5 #28746 : 0.3 #32874 : 0.1 #1c8456 : 0.1 #00356 : 0.1 „Collexion“ of activities You: #17345:1.0 #3627 :0.8 #19994:0.5 #28746:0.3 #32874:0.1 Your activity as text Submit and indexed to keyword numbers Find similar activities and the people behind Cross-language networking

The Early evolution of Fingerprint Manipulation contents fingerprints add people fingerprints add organization fingerprint Jobs CV’s, Skills Articles, books s, Word RFP’s

BIOSEMANTICS “Cellese”: the language that cells use to communicate internally and externally. The Molecular Language and its biological MEANING The Group –Jan Kors PhD. –Erik van Mulligen PhD –Bob Schijvenaars PhD –Marc Weeber PhD –Christiaan v.d. Eyck MsC –Rob Jelier PhD –Barend Mons PhD –Johan van der Lei PhD

A consortium to combine State-of-the-art Information and Knowledge Mining Technologies To support : Thesaurus and ontology enrichment Disambiguation of concepts Semantic meta-analysis of massive information To enable : Information-based discovery Evidence based policy making

Thesaurus and Ontology Enrichment New concepts Synonyms Homonyms Genes, Proteins Pictures

Validation 3 Free text Unexplained Text (XML) Potential concepts Thesauri: Mesh HUGO SwissProt SAGE Others FUA 4 1 Fingerprint s (known concepts) partners E-BioSci EMBO Elsevier NLP 2 TNO LUMC HUGONC Genebio AMC EUR UVA SERENDIP

Too much to read: major trends foreseen: From Reading to Consulting From Reading to Meta-analysis From Text to Knowledge Representations

C C C C C C Semantic types Co-occurrence data The first step: to the Conceptual Semantic Network

Calcium deposition Pleocytosis Basal Ganglia Encephalopathy Cerebrospinal Fluid Tomography, X-Ray Computed Parents Family Aicardi Goutieres syndrome Ferrocalcinotic deposition Spastic quadraplegia Fahr disease Microcephaly AGS1 x G-protein coupled receptors G-substrate Lipoid dermatoarthritis Receptors Complement Factor B RNA, Complementary Xenopus oocyte AGS1 SwissProt: Activator of G-protein signaling 1 (AGS1) * AICARDI-GOUTIERES SYNDROME 1; (AGS1) : OMIM Aicardi Goutieres syndrome 1 Heterogeneity Linkage (Genetics) Clinical diagnosis Family 2 AGS1 ** Lod Score Genetic Heterogeneity analysis Toxoplasmosis Calcium deposition 3 Encephalopathy 4 Cadmium Genus: Human cytomegalovir... Cerebrospinal fluid abnorm. 5.. Interferon-alpha Chromosomes Viral Child Head Tricuspid Valve Stenosis

Fingerprinting disambiguation ACS META-ANALYSIS

Applications Cross-language, jargon and cross-system matching (implemented): Information-based discovery (Research) Community building (Experts,Policy Making) Trendwatching and Indicators (Policy Making)

Seed-Term based Conceptual Semantic Networks

? Clustering of genes on-the-fly

Predicting new knowledge ?

III= Distribution over distance categories of concept-pairs without co-occurrence in the learning set. IV= Distance categories of concept pairs related to the probability that there is no explicit relationship or co-occurrence in Medline (zero ratio). A ratio of 0 means that an automatic Query in Medline with the concept pair with “AND” in between does lead to 0 hits in Medline.

New Drug discovery ?

Semantic Filtering

Name: A Institute contact details IBC text Acronym Organisation contact details DOI metadata text Title metadata DOI

The knowlet: content, people, organisations Publications Molecular Databases Image databases Patents Events Calls

Knowledge Maps, Nature Biotechnology Map

Knowledge Maps: Medline Bioterrorism Map 1997

Knowledge Maps: Medline Bioterrorism Map 2001

Private Research DC Public E-BioSci Pharma etc. ORIEL SERENDIP FP6 etc. I-Research Ministies WHO, FAO etc. SHARED BIREME/VHL EDCTP Oxford intiative etc.