Predicting food web connectivity Phylogenetic scope, evidence thresholds, and intelligent agents Cynthia Sims Parr Ecological Society of America Memphis,

Slides:



Advertisements
Similar presentations
UMBC an Honors University in Maryland The Semantic Web … It Just Might Work. Joel Sachs Joint work with: Cyndy Parr, Andriy Parafiynyk,
Advertisements

UMBC an Honors University in Maryland Examples of Integrating Ecological Information on the Semantic Web Joel Sachs and Cynthia Simms Parr contact:
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
1 Food Webs Augmented With Additional Data: Structure and Dynamics Daniel C. Reuman, Rockefeller University, New York, U.S.A. Joel E. Cohen, Rockefeller.
Reconstructing and Using Phylogenies
Reading Phylogenetic Trees
Chapter 18 Classification
Gail Hodge Information International Associates, Inc. US Geological Survey, Consultant Joel Sachs Ebiquity Lab, University of Maryland Baltimore County.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
EcoLens and TreePlus: Tools for exploring ecological interaction data Cynthia Sims Parr Bongshin Lee, Ben Bederson University of Maryland, College Park.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
A computational phylogenetic approach to interaction analysis Cynthia Sims Parr University of Maryland College Park Ecological Society of America Montreal,
Bell Work Dogs of a certain breed can have black fur or white fur. Black fur is dominant, but the breeder only wants puppies with white fur. Cross two.
Phylogeny & The Tree of Life. Phylogeny  The evolutionary history of a species or group of species.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Practical interoperability across semantic stores of data for blah blah
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
Finding knowledge, data and answers on the Semantic Web
Spire News Joel Sachs Spire Semantic Prototypes In Ecoinformaics UMBC Ebiquity UMBC Ebiquity UMD MIND SWAP UMD MIND SWAP NASA GSFC.
Taxonomic ontologies: Bridging phylogenetic and taxonomic history Peter Midford University of Kansas Phenoscape Project.
Evolution Part III “Speciation through Isolation, Patterns in Evolution, Fossil record, Geologic Time, and Cladistics”
- Anusha Uppaluri. Contents Problem Problems Importance Related Work Conclusion References Questions 2.
Prokaryote Taxonomy & Diversity Classification, Nomenclature & Identification Phenetic Classification Molecular Phylogeny Approach Classification (hierarchical.
Pacific Ecoinformatics and Computational Ecology Lab
UMBC an Honors University in Maryland 1 Adding Semantics to Social Websites for Citizen Science Pranam Kolari University of Maryland, Baltimore County.
Research support was provided by NSF, award NSF-ITR-IIS , PI Tim Finin, UMBC. SPIRE Semantic Prototypes in Research Ecoinfomatics Approach We are.
Underlying Principles of Zoology Laws of physics and chemistry apply. Principles of genetics and evolution important. What is learned from one animal group.
A Biodiversity Content Management System for Research, Education, and Outreach Cynthia Sims Parr University of Maryland, College Park Co-authors Roger.
Who am I? Cyndy Parr Ontology Developer Skeptic (middleman & user?) Behavior Semantic Prototypes in Research Ecoinformatics.
Trees, taxonomy & location: mapping phylogeography using Biodiverse Dan Rosauer & Shawn Laffan University of New South Wales & Centre for Plant Biodiversity.
Reading Phylogenetic Trees
Cynthia Parr Phenotype RCN NESCent 25 February 2013.
UNIT 6 - Evolution SWBAT compare the relatedness of various species by applying taxonomic principles (cladistics, phylogeny, morphology and DNA.
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin University of Maryland,
Patterns of divergent selection from combined DNA barcode and phenotypic data Tim Barraclough, Imperial College London.
Google’s Deep-Web Crawl By Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy August 30, 2008 Speaker : Sahana Chiwane.
Classification Classification Classification.
Interactive Visualizations for Biodiversity Information Bongshin Lee Researcher Visualization and Interaction Research Group Microsoft Research Bongshin.
UMBC an Honors University in Maryland 1 Finding knowledge, data and answers on the Semantic Web Tim Finin University of Maryland, Baltimore County
UMBC an Honors University in Maryland 1 Information Integration and the Semantic Web Finding knowledge, data and answers Tim Finin 1, Anupam Joshi 1, Li.
UMBC an Honors University in Maryland 1 Using the Semantic Web to Support Ecoinformatics Andriy Parafiynyk University of Maryland, Baltimore County
Using linked data to interpret tables Varish Mulwad September 14,
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny & the Tree of Life
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
UNIT 5A Classification & Kingdoms. I. Classification a. Organize items so you can better understand and find them b. Based on Similarities c. Taxonomy:
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Update on Ecoinformatics Technical Working Group Activities Larry Fitzwater Computer Scientist US Environmental Protection Agency Rome, Italy – 17 May.
Spire Semantic Prototypes In Ecoinformaics UMBC CS UMBC CS UMD MIND SWAP UMD MIND SWAP UMBC GEST UMBC GEST NASA GSFC NASA GSFC RMBL Peace RMBL Peace UC.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson and David Bryant Presented by Peggy Wang.
Lesson Overview Lesson Overview Modern Evolutionary Classification 18.2.
Metagenomic Species Diversity.
Finding knowledge, data and answers on the Semantic Web
Phylogeny & the Tree of Life
Improving Data Discovery Through Semantic Search
Ben Saylor1, Anagha Kulkarni1, Neo Martinez2, Ilmi Yoon1
Evolutionary history of related organisms
UMBC AN HONORS UNIVERSITY IN MARYLAND
Browsing with TaxonTree: Visualizing Biodiversity Information
ACTIVATING STRATEGY.
Heredity and Classification
Reading Phylogenetic Trees
Enabling Semantic Ecoblogging and Bioblitzes
Phylogenetic Trees Jasmin sutkovic.
Classification Notes B-5.7
Presentation transcript:

Predicting food web connectivity Phylogenetic scope, evidence thresholds, and intelligent agents Cynthia Sims Parr Ecological Society of America Memphis, TN August 8, 2006

Bacteria Microprotozoa Amphithoe longimana Caprella penantis Cymadusa compta Lembos rectangularis Batea catharinensis Ostracoda Melanitta Tadorna tadorna ELVIS: Ecosystem Localization, Visualization, and Information System Oreochromis niloticus Nile tilapia ? ?... Species list constructor Food web constructor

ELVIS’s Food Web Constructor predicts basic network structure Prelude to systems models

Food Web G S node link Evolutionary tree step G taxon S A

Evolutionary Distance Weighting 1. Set distance thresholds 2. Find relatives of target nodes X, Y with known link status E.g. relative A is close to X, relative B close to Y where Link Value between A and B is known 3. For each found link, compute weight based on distance 4. Compute certainty index for a predicted link by combining weighted link values, with a discount for negative evidence

Food web database 4600 distinct taxa Food web data: Cohen 1989, Dunne et al. 2006, Vazquez 2006, Jonsson et al Evolutionary tree: Parr et al plants from ITIS + hierarchy of non-taxonomic nodes

Testing the algorithm Take each web out of the database Attempt to predict its links Compare prediction with actual data Accuracy percentage of all predictions that are correct 89% Precision percentage of predicted links that are correct 55% Recall percentage of actual links that are predicted 47%

Choosing parameters 30 web subsample Representative of habitats, years, # nodes, percent identified to species Iterate over parameter settings Tradeoff between Precision percentage of predicted links that are correct Recall percentage of actual links that are predicted

Evolutionary distance threshold 2 steps up and 4 steps down steps up steps down precision steps up recall

Evolutionary direction penalty not very sensitive ancestor descendent siblings

Negative evidence discount is sensitive

Results over all webs

Is evolutionary distance weighting better than strict database search? Paired T-tests df=251 ***p<0.001 Database search Evolutionary distance weighting % *** Database search is more precise, but evolutionary distance wt has better recall.

Older webs contribute Recall percentage of actual links that are predicted 47%  48% with no EcoWEB data Precision percentage of predicted links that are correct 55%  39% with no EcoWEB data

…but large webs are harder to predict large webs have better taxonomic resolution recent webs are bigger large webs have fewer unknown “taxa”

Some phyla are easier to predict than others

Trait space distance weighting Euclidean distance in natural history N-space Parameterize functions from the literature that might predict links using characteristics of taxa. For example, size or stoichiometry. LinkStatus AB = ƒ(α, size A, size B ), ƒ(β, stoich A, stoich B ) … …need more data How can we do better predicting links?

ETHAN Evolutionary Trees and Natural History ontology Animal Diversity Web geographic range habitats physical description reproduction lifespan behavior and trophic info conservation status “Esox lucius” hasMaxMass “1.4 kg” “Esox lucius” isSubclassOf “Esox” “Esox” eats “Actinopterygii” Triples

UMBC Triple Shop Query What are body masses of fishes that eat fishes? Enter a SPARQL query SELECT DISTINCT ?predator ?prey ?preymaxmass ?predatormaxmass WHERE { ?link rdf:type spec:ConfirmedFoodWebLink. ?link spec:predator ?predator. ?link spec:prey ?prey. ?predator rdfs:subClassOf ethan:Actinopterygii. ?prey rdfs:subClassOf ethan:Actinopterygii. OPTIONAL { ?predator kw:mass_kg_high ?predatormaxmass }. OPTIONAL { ?prey kw:mass_kg_high ?preymaxmass } }... leaving out the FROM clause

UMBC Triple Shop Create a dataset Find semantic web docs that can answer query. Actinopterygii.owl webs_publisher.php? published_study=11 Esox_lucius.owl

UMBC Triple Shop Get results Apply query to dataset with semantic reasoning.

Food Web Constructor uses evolutionary approach and large databases We chose parameters using subsample Explored results over entire database Evolutionary distance weighting recalls links better than database search Older webs are useful Large webs harder to predict Some phyla are easier than others to predict For future algorithms, we can gather and integrate data via ontologies and intelligent agents Summary

UMBC: Tim Finin, Joel Sachs, Andriy Parafiynyk, Li Ding, Rong Pan, Lushan Han, UMCP: David Wang, RMBL: Neo Martinez, Rich Williams, Jennifer Dunne, UC Davis: Jim Quinn, Allan Hollander UMMZ Animal Diversity Web: Phil Myers, Roger Espinosa UMCP: Bill Fagan, Bongshin Lee, Ben Bederson

ADW database MySQL XSLT template ADW taxon acct HTML Keywords HTML ETHAN Taxon acct OWL SPIRE taxon database MySQL Evolutionary Tree side of ontology OWL Phylum- sized ET chunk OWL Taxon Path OWL Filters Acct data tabular text Others ITIS ETHAN workflow Plants, etc. Animal name tree Keywords OWL

Semantic Prototypes In Ecoinformatics UMBC U Maryland NASA Goddard NASA Goddard Rocky Mtn Bio Lab Rocky Mtn Bio Lab UC Davis Semantic Web Tools Info. Retrieval Agents Food Web Constructor Evidence Provider Invasive Species Forecasting System Remote Sensing Data Food Webs Ecological Interaction Ontologies Species List constructor

Food Web Constructor example Nile Tilapia in St. Marks Question What are potential predators and prey of Oreochromis niloticus in the St. Marks estuary in Florida? Procedure Submit species list for St. Marks, with Oreochromis niloticus added.

Food Web Constructor generates possible links

Evidence provider gives details

Nile tilapia – what organisms could be impacted?

Implications: parameterized functions Requires good data for target species Can incrementally add natural history functions to get better estimate, try different functions from literature or use genetic algorithms Parameterizing functions: multivariate statistics, machine learning, fuzzy inference Could use evolutionary info if you localize parameter estimates to clades or taxonomic subsets LinkPredicted CD = ƒ(α, size C,size D ) + ƒ(β, stoich C,stoich D )

Distance weighting options Evolutionary Uses phylogeny or classification or combination of these – assumes related organisms like each other Distance could be branch length or # steps Does not need natural history data 2 steps Y 3 changes X

“TaxonA” hasBreedingDuration “5 months” Ontologies Richer way to design databases: instances of concepts that have well-defined meanings and formal relationships. “Taxon A” hasAgeOfSexualMaturity “1 year” “Higher Taxon” lives in “Australia” “Taxon B” lives in “Australia” “Taxon A” lives in “Australia” Breeding Season Reproductive Characteristic TaxonB Breeding Duration is-a has-a Sexual maturity is-a HigherTaxon is-a TaxonA Age of Sexual Maturity has-a is-a