JYC: CSM17 BioinformaticsCSM17 Week 3: Biological Identification A fundamental activity Traditional methods - keys Special problems Computer-based methods.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
Landcare Research LCR Manage 7 of 25 Nationally Significant Collections & Databases –CHR (plants, mosses etc) - 600k collections –PDD (fungi) - 80k collections.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
The Naturalist Fredrik Ronquist Swedish Museum of Natural History.
National Herbarium of New South Wales Royal Botanic Gardens & Domain Trust, Sydney New South Wales Flora Online Karen Wilson and Gary Chapple.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Arthur ChapmanData Quality Training SABIF June 2012 Taxonomic and Nomenclature Data A. D. Chapman Data Quality.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Session 28 Techie Terminology and Benefits for Financial Aid Administrators Tim Bornholtz Holly Hyland.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
JYC: CSM17 BioinformaticsCSM17 Week 8: Simulations (1) Soft Computing: Genetic Algorithms Evolutionary Computation Neural Networks.
Fundamentals of Information Systems, Second Edition 1 Organizing Data and Information Chapter 3.
JYC: CSM17 BioinformaticsCSM17 Week1:What is Bioinformatics? A Multidisciplinary Subject incorporating: Biology –the study of living systems Informatics.
EGR 106 – Week 2 – Arrays & Scripts Brief review of last week Arrays: – Concept – Construction – Addressing Scripts and the editor Audio arrays Textbook.
JYC: CSM17 BioinformaticsCSM17 Week2: Biological Classification Fundamental concepts Traditional methods Nomenclature (naming) Taxonomy & systematics Overview.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
Basic concepts of Data Mining, Clustering and Genetic Algorithms Tsai-Yang Jea Department of Computer Science and Engineering SUNY at Buffalo.
JYC: CSM17 BioinformaticsCSM17 Week 3: Biological Identification A fundamental activity Traditional methods - keys Special problems Computer-based methods.
JYC: CSM17 BioinformaticsCSM17 Week 4: Evolutionary systems Evolution and phylogeny Historical systems Modern methods Tools and software.
JYC: CSM17 BioinformaticsCSM17 Week 7: Simulations: Genetic Algorithms Evolutionary Computation.
JYC: CSM17 BioinformaticsCSM17 Week 4: Evolutionary systems Evolution and phylogeny Historical systems Modern methods Tools and software.
JYC: CSM17 BioinformaticsCSM17 Week 3: Biological Identification A fundamental activity Traditional methods - keys Special problems Computer-based methods.
JYC: CSM17 Bioinformatics Week 9: Simulations #3: Neural Networks biological neurons natural neural networks artificial neural networks applications.
JYC: CSM17 BioinformaticsCSM17 Week 2: Biological Classification Fundamental concepts Traditional methods Nomenclature (naming) Taxonomy & systematics.
Developing a Basic Web Page with HTML
Introduction to Artificial Neural Network and Fuzzy Systems
WEB DESIGNING Prof. Jesse A. Role Ph. D TM UEAB 2010.
What is Multimedia? Multimedia is a combination of text, art, sound, animation, and video. It is delivered to the user by electronic or digitally manipulated.
Artificial Intelligence (AI) Addition to the lecture 11.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
Fundamentals of Information Systems, Second Edition 1 Organizing Data and Information.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
HTML (HyperText Markup Language)
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
Animal Species Database of China JI, Li-Qiang Institute of Zoology, CAS Beijing, China CODATA, 2006, Beijing.
Field Work, Herbaria, Databases, Floras, and Monographs for Plant Systematics Spring 2014.
Networking ITTC with TT:CLEAR Xiaohua ZHANG Tsinghua University, Beijing, China.
PCWG Analysis Tool Peter Stuart September 15, 2015.
ONLINE POLYCLAVE KEY FOR THE PLANTS FROM THE KIRBY PARK NATURAL AREA, WILKES-BARRE, PA. Breanne Dibble a, Matthew Johnston a, Theodore Orelien a, Jeffrey.
MULTIMEDIA DEFINITION OF MULTIMEDIA
Information Systems Overview (COIS 20024) Lecture: Week 3 Computer Software (Information Systems Resources)
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
Plant systematics - introduction Home page: Instructor (Winter.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
LBSC 690 Session 4 Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
Morpho – metadata management software SEEK Training January 2004.
XP 2 HTML Tutorial 1: Developing a Basic Web Page.
Ms. Tracy MODULE 1- LESSON 7. BELL RINGER What are the primary functions of a word-processing program?
introductionwhyexamples What is a Web site? A web site is: a presentation tool; a way to communicate; a learning tool; a teaching tool; a marketing important.
Introduction to Visual Basic 2008 Programming
SPECIALIZED APPLICATION SOFTWARE
Unit# 8: Introduction to Computer Programming
Systematics: The Science Of Biological Diversity Chapter 12
Bioinformatics CSM17 Week 4: Evolutionary systems
VIETNAM ACADEMY OF SCIENCE AND TECHNOLOGY
Database & Information Systems
Interactive Keys for Plant Identification
Bioinformatics CSM17 Week2: Biological Classification
Presentation transcript:

JYC: CSM17 BioinformaticsCSM17 Week 3: Biological Identification A fundamental activity Traditional methods - keys Special problems Computer-based methods

JYC: CSM17 Fundamental concepts types are often not typical! homology

JYC: CSM17 How to identify an organism? Traditional/classical methods... Find someone who knows what it is ! Indented and bracketed Keys –since the 1600s ! Floras and monographs Mostly phenotypic characters

JYC: CSM17 Traditional Methods... A key to identify Human, Cow, Dog (only!) 1. Number of legs two Human 1. Number of legs four Stomach chambers four; eats grass Cow 2. Stomach chambers one; eats meat Dog

JYC: CSM17 Difficulties caused by... new taxa (e.g. new species) phenotypic variation genotypic variation maturity sexual dimorphism incomplete material ‘incorrect’ classification

JYC: CSM17 The value of characters Ease of observation Clarity / unambiguous Information content: Entropy (H)

JYC: CSM17 Computer-based methods Key generators eg. DELTA On-line keys –Polyclaves e.g.LucID, CABIKEY Expert Systems

JYC: CSM17 DELTA DEscriptive Language for TAxonomy a suite of programs and tools a database format KEY generator

JYC: CSM17 Main files ITEMS CHARS SPECS

JYC: CSM17 CHARS The Characters (attributes) Character types –Unordered Multistate (UM) e.g. 1. red, 2. blue, 3. green –Ordered Multistate (OM) e.g. small, medium, large –Integer Numeric (IN) e.g. 1, 2, 5, 3, 8, 9 etc. –Real Numeric (RN) e.g. 32.5, 0, 45.2, 3.1 etc. –Text (TE) e.g. Collected by J.Smith in 1992

JYC: CSM17 CHARS *SHOW Tilia species - character list. *CHARACTER LIST #1. Leaf width/ cm/ #2. Axillary tufts/ 1. absent/ 2. indistinct or sparse/ 3. clearly present/ #3. Flowers per cyme/

JYC: CSM17 ITEMS the taxa, e.g. species, subspecies, varieties Format... ITEMS *SHOW Comments are written here #NAME/,,..

JYC: CSM17 ITEMS *SHOW This is an example for Tilia #HEN/ 1,9.9 2,3 3,26

JYC: CSM17 SPECS Number of characters Maximum number of character states Maximum number of items (taxa) Character types Number of states per character

JYC: CSM17 SPECS *SHOW Tilia species *NUMBER OF CHARACTERS 22 *MAXIMUM NUMBER OF STATES 7 *MAXIMUM NUMBER OF ITEMS 88 *CHARACTER TYPES 1,RN 2,OM 3,IN *NUMBERS OF STATES 2,3

JYC: CSM17 DELTA KEY Generator Creates a text-based identification key Chooses ‘best’ characters first Uses a ‘comparison’ function Finds the character which requires fewest questions

JYC: CSM17 TOKEY *SHOW Translate into KEY format *INPUT FILE specs *TRANSLATE INTO KEY FORMAT *COMMENT. EXCLUDE CHARACTERS *USE NORMAL VALUES 1 3 *COMMENT. CHARACTER RELIABILITIES *KEY STATES 1, / , / / /26.0

JYC: CSM17 DIANA A DELTA shell Integrates functionality in Windows

JYC: CSM17 INTKEY An interactive multimedia on-line key system bundled with DELTA Example for Grasses Can include pictures User chooses order of characters

JYC: CSM17 ETI - Expert center for Taxonomic Identification University of Amsterdam, The Netherlands Series of Multimedia interactive software Includes interactive key, pictures, videos... Written by acknowledged experts

JYC: CSM17 AI: Expert Systems, Neural Nets EXPERT KEY (Atkinson & Gammerman) ISAR (Chesmore et al.) ANNKEY (Clark & Warwick)

JYC: CSM17 Leading to the Future... DNA and RNA –CATCATCATCATCAT eg. Forensic science, Paternity, Maternity XDELTA uses XML - eXtensible Markup Language (L.Dodds) Taxonomic Markup Language (R.Gilmour)

JYC: CSM17 Useful Websites DELTA and DIANA: LUCID: Digital Taxonomy:

JYC: CSM17 References & Bibliography Atkinson & Gammerman (1987). An application of expert systems technology to biological identification. Taxon 36 (4), pp Chesmore, E.D. et al. (1998). Automated analysis of insect sounds. In Bridge, P. et. al. (eds.) Information Technology, Plant Pathology and Biodiversity, CAB International, pp Clark, J.Y. (2003). Artificial neural networks for species identification by taxonomists”. BioSystems, vol. 72, pp Clark, J.Y. & Warwick, K. (1998). Artificial keys for botanical identification using a multilayer perceptron neural network (MLP). Artificial Intelligence Review vol 12, pp Dallwitz, M.J., Paine, T.A. & Zurcher, E.J. (1997). User’s guide to the DELTA system -a general system for processing taxonomic descriptions, Edition 4.07, CSIRO Division of Entomology: Canberra, Australia. ( Pankhurst, R.J. (1991). Practical Taxonomic Computing. University of Cambridge Press: UK. Pankhurst, R.J. (1978). Biological Identification. The Principles and Practice of Identification Methods in Biology. Edward Arnold, London Pankhurst, R.J. (1998). A historical review of identification by computer. In Bridge, P. et. al. (eds.) Information Technology, Plant Pathology and Biodiversity, CAB International, pp