1 “Penuria nominum” – shortage of words Knowledge beyond the capacity of language? by György Surján ESKI Hungary Commentary to Judith Blake Beyond Data.

Slides:



Advertisements
Similar presentations
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Advertisements

A new approach to Artificial Intelligence.  There are HUGE differences between brain architecture and computer architecture  The difficulty to emulate.
Visit us at: Purpose - Do well now and in future Research findings suggest that core subject knowledge, non- routine problem solving,
Structural Reliability Analysis – Basics
Bioinformatics and the Engineering Library ASEE 2008 Amy Stout.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
Four Dark Corners of Requirements Engineering
School of Computing and Mathematics, University of Huddersfield Knowledge Engineering: Issues for the Planning Community Lee McCluskey Department of Computing.
Bacterial Physiology (Micr430)
The reform of A level qualifications in the sciences Dennis Opposs SCORE seminar on grading of practical work in A level sciences, 17 October 2014, London.
1 Unity of Invention: Biotech Examples TC1600 Special Program Examiner Julie Burke (571)
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
SOFTWARE ENGINEERING BIT-8 APRIL, 16,2008 Introduction to UML.
10th Intl. Protégé Conference - July 15-18, Budapest, Hungary Do ontologies work? Barriers and Possibilities to Implement Ontologies in (Health)
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
Intellectual Property, Patents & Technology Transfer Sagar Manoli Shashidhar, Philippe Abdel-Sayed Responsible Conduct in Biomedical Research EPFL,
U.S. Patent and Trademark Office Technology Center 1600 Michael P. Woodward Unity of Invention: Biotech Examples.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
INTRODUCTION TO SCIENCE Chapter 1 Physical Science.
AP Biology Discussion Notes Wednesday 01/28/2015.
The Development of Numbers The Development of Numbers next Taking the Fear out of Math © Math As A Second Language All Rights Reserved Hieroglyphics Tally.
From Bench to Bedside: Applications to Drug Discovery and Development Eric Neumann W3C HCLSIG co-chair Teranode Corporation HCLSIG F2F Cambridge MA.
Section 1 The Nature of Science.  Science is a process  1 st investigate/research a problem  2 nd experiment  3 rd observe the outcome  4 th test.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
An Introduction to Software Engineering. Communication Systems.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 14 Slide 1 Object-oriented Design.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Earth Science Chapter 1.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
EB3233 Bioinformatics Introduction to Bioinformatics.
2008/01/30Lecture 11 Game Theory. 2008/01/30Lecture 12 What is Game Theory? Game theory is a field of Mathematics, analyzing strategically inter-dependent.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
An overview of Bioinformatics. Cell and Central Dogma.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Unit 2 The Molecule of Life Genes and Heredity. What is a gene?
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Teaching Bioinformatics Nevena Ackovska Ana Madevska - Bogdanova.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Evolution and the Foundations of Biology
Starter What do you know about DNA and gene expression?
AP Biology Discussion Notes Wednesday 2/10/2015. Goals for Today Be able to describe how DNA & RNA molecules differ from each other. Be able to name and.
High throughput biology data management and data intensive computing drivers George Michaels.
Chapter 13 Section 13.3 The Human Genome. Genomes contain all the information needed for an organism to grow and survive The Human Genome Project (HGP)
From the double helix to the genome
BME435 BIOINFORMATICS.
Human Genome Project By: Scott Kutschke.
Independent Study of Ontologies
Hood College Master of Science in Bioinformatics (Proposed)
Department Of Computer Science Engineering
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Nancy Baker SILS Bioinformatics Seminar January 21, 2004
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Future of Genetic Research
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

1 “Penuria nominum” – shortage of words Knowledge beyond the capacity of language? by György Surján ESKI Hungary Commentary to Judith Blake Beyond Data Integration: Data Management for Knowledge Discovery Ontology and Biomedical Informatics Rome 29 April – 2 May 2005

2 Overview: (A commentary of an outsider) 1.Modern science is analytical 2.Problem of identity 3.Capacity of language is limited 4.Genomics and proteomics deals with extremely large databases. 5.Ontologies are bound to reality by language tags

3 1. Modern science is analytical

4 Mouse and human genes agree in 90% Mouse and human body is built from rather similar building blocks. The difference is not in the building elements, but in the different way of integration of the elements. The difference is not only phenotypic, but functional: Humans are able to create such a sculpture demonstrating the beauty of human body

5 Q1. Is analytical approach sufficient to explain differences of living organisms? By changing of 10 % of its genes would a mouse be able to create sculptures like Michelangello?

6 2. The problem of identity Importance of identity in ontology: Entities having different identity criteria can not belong to the same class.

7 = We have the strong feeling of our self-identity all over of our whole life, despite of all changes that happen to us

8 = Identity is independent from similarity and recognition

9 Identity of genes or proteins Entities may gain or loose parts without loosing their identity Genes loosing some nucleotides are still identical? Q2. What are the identity criteria for genes and proteins?

10 Elementary particles have no identity Humans and developed animals obviously have Q3. At which level of organisation identity emerges? (Do biological macromolecules have identity?)

11 Shepard in the 19th century ~3-400 words Anatomy (intermediate language certificate) ~4000 terms SNOMED 3.1, Encyclopaedia Britannica~ terms WordNet~ strings UMLS Metathesaurus> terms 3. Capacity of language is limited  Our language capacity is huge, but nevertheless finite

12 Limiting factors: 1. Capacity of human brain 2. Number of terms shared by a community

13 Example of numbers Different names for the first 13 numbers (zero- twelve) in English, then we use combinations hundred10 2 thousand10 3 million 10 6 billion10 9 …. ? We have linguistic solution to express extremely large numbers in price of precision loss 

14 Up to now, mankind has not met any situation which could exhaust the capacity of human language, not because the number of things to be expressed were less than this capacity, but we always could find some acceptable compromise. We do not know where are the limitations of our language capacity, but the feeling of this limitation was well known centuries ago (penuria nominum): In the 17 th century Harsdörfer proposed a machine with 5 wheels containing 256 syllables, prefixes and suffixes, beeing able to generate about 97 million (mostly nonsense) German words in order to find the real name of God and also to being able to use different names for all particualrs in the world instead of referring them by names of their classes (U. Eco: Between La Mancha and Babel)

15 Size of genomics databases: GO~ terms Human genome ~ ? genes GenBank~ sequences

16 Are we able to use 42 million names? Q4. Is it possible to describe molecular biology using human language? Is there any other representation tool to be used for that purpose?

17 5. Ontologies are bound to reality by language tags formal languages are used to describe structures

18 language tag ID language tag Reality Language What is the meaning ?

19 Q5. If language fails in genomics and proteomics, is there a need and possibility for alternative methods of ontology engineering, that does not requires language? If ontologies are bound to reality by language, than it is hard to create (use) ontology where the problem field exceeds the capacity of language.

20 Q4. Is it possible to describe molecular biology using human language? Is there any other representation tool to be used for that purpose? Q3. At which level of organisation identity emerges? (Do biological macromolecules have identity?) Q1. Is analytical approach sufficient to explain differences of living organisms? Q2. What are the identity criteria for genes? Summary of questions Q5. If language fails in genomics and proteomics, is there a need and possibility for alternative methods of ontology engineering, that does not requires language?