An update on the Transforming Taxonomic Interfaces Initiative Matt Yoder Michael Twidale Andrea Thomer Kenney Guo.

Slides:



Advertisements
Similar presentations
Yansong Feng and Mirella Lapata
Advertisements

The Process of Data Ingestion in ÆKOS Andrew Graham and Matt Schneider TERN Ecoinformatics Data Analysts Logos used with consent. Content of this presentation.
The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
Virtual University - Human Computer Interaction 1 © Imran Hussain | UMT Imran Hussain University of Management and Technology (UMT) Lecture 16 HCI PROCESS.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Robert J. Mislevy & Min Liu University of Maryland Geneva Haertel SRI International Robert J. Mislevy & Min Liu University of Maryland Geneva Haertel SRI.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Object Oriented Analysis Process
1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.
Chapter 2: Understanding and conceptualizing interaction
Part 2: Requirements Days 7, 9, 11, 13 Chapter 2: How to Gather Requirements: Some Techniques to Use Chapter 3: Finding Out about the Users and the Domain.
What is a prototype? A prototype is a small scale model of your larger product. Can be a physical object, or a simple software program. Many physical.
Linking collections to related resources: Multi-scale, multi-dimensional, multi-disciplinary collaborative research in biodiversity. Is this a “Big.
IBE312: Ch15 Building an IA Team & Ch16 Tools & Software 2013.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Chapter 2: Understanding and conceptualizing interaction Question 1.
Christian Wolff Regensburg University, Media Computing FGIR Workshop Hildesheim University, October 2006 Information Retrieval is for Everybody.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
Free Open-Source, Open- Platform System for Information Mash-Up and Exploration in Earth Science Tawan Banchuen, Will Smart, Brandon Whitehead, Mark Gahegan,
IS 466 ADVANCED TOPICS IN INFORMATION SYSTEMS LECTURER : NOUF ALMUJALLY 22 – 10 – 2011 College Of Computer Science and Information, Information Systems.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Software Project Failure Software Project Failure Night Two, Part One CSCI 521 Software Project Management.
CDM Developer Workshop. TDWG Andreas Kohlbecker Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Purpose What do you want from this workshop?
PLATFORM INDEPENDENT SOFTWARE DEVELOPMENT MONITORING Mária Bieliková, Karol Rástočný, Eduard Kuric, et. al.
Chapter 10  2000 by Prentice Hall Information Systems for Managerial Decision Making Uma Gupta Introduction to Information Systems.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Requirements Engineering Requirements Elicitation Process Lecture-8.
Connecting Specimens, Images and Vocabulary Specify, Morphbank, Morphster Beach, Noble, Spears – KU Mast, Riccardi – FSU Miranker, Tirmizi UT.
Chapter 13 Architectural Design
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Relevance Feedback in Image Retrieval Systems: A Survey Part II Lin Luo, Tao Huang, Chengcui Zhang School of Computer Science Florida International University.
Chapter 4 프로세스 모델 Process Models
User Interfaces 4 BTECH: IT WIKI PAGE:
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Information Dynamics & Interoperability Presented at: NIT 2001 Global Digital Library Development in the New Millennium Beijing, China, May 2001, and DELOS.
User Requirements and Engagement in Health Informatics Alistair Sutcliffe Sarah Thew, Oscar De Bruijn, Manchester Business School, Jock McNaught National.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
Requirement engineering & Requirement tasks/Management. 1Prepared By:Jay A.Dave.
1 CEN 4020 Software Engineering PPT4: Requirement analysis.
NMFS Use Case 1 review/ evaluation and next steps April 19, 2012 Woods Hole, MA Peter Fox (RPI* and WHOI**) and Andrew Maffei (WHOI) *Tetherless World.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
Metadata: Formal and Informal GY7025 Geographical Database Design.
Elaboration popo.
User Characterization in Search Personalization
Development of the Amphibian Anatomical Ontology
Phenoscape Data Jamboree 2
Specifying collaborative decision-making systems
Introduction.
Terms: Data: Database: Database Management System: INTRODUCTION
Time to change how we describe biodiversity
Presentation transcript:

An update on the Transforming Taxonomic Interfaces Initiative Matt Yoder Michael Twidale Andrea Thomer Kenney Guo

(seeking feedback)

Quick History of TTI Describing phenotypes semantically Hymenoptera Anatomy Ontology unify a complex lexicon Application of the HAO to Taxonomy very “old” science monographic/comprehensive

Why richer semantics? Species are numerous, rarely studied, maximize work Giant integrated databases Nanopublications Populate the graph of life with inference Calculate species limits

“Rapid prototyping of semantic enhancements to biodiversity informatics platforms” CSCW - Conceptual paper “Semantic Express” - Can I express X? Formalized suite of taxonomists end use requirements Catalog of annotated wireframe interfaces Formalized description of how to rapidly prototype features … all this catalyze development of workbenches for taxonomists

Distilling interviews ~ 35 taxonomists Monographers, integrative, still “novice” after 3 years Both confirm and ignore frustrations with “relative” phenotype descriptors Lean on images (stacks!) Figure out their own workflow “Rebooting wet-cycles” - picking up is not instantaneous Validation over multiple sources

Distilling interviews Digital work is overwhelmingly desktop based (!cloud) Paper is ubiquitous, so is excel “Digitization” can be extramural Enter and refine The myth of the collaborative taxonomy?!

Taxonomic Interfaces Hackath on Idea 12 people, proposing ideas for application interfaces, documented in workbooks Outcome: a

Representing data (Kenney) Simple Example Natural Language Semantic Annotation (CharaParser) EQ using termsEQ using term IDsEQ/Manchester EntityQualityEntityQuality Notaulus shape: falciform notaulusfalciformHAO: PATO: has part some (notaulus and (is bearer of some falciform))

Representing data (Kenney) Complex Example Natural Language Semantic Annotation (CharaParser)EQ using termsEQ using term IDsEQ/Manchester EntityQualityEntityQuality Malar space length: longer than 0.5 of compound eye height malar linelengthHAO: PATO: has part some (malar line and (is bearer of some (length and (has_measurement some ((has_unit some (length and (inheres in some compound eye))) and (has_magnitude some float[>= 0.5f]))))))

Representing data (Kenney) Super Complex Example “the antero-dorsally projecting process of the posterior maxilla is located posterior to the laterally projecting and bent knob of the ethmoid in species X” (Dahdul et al., 2010)

Representing data (Kenney) Human Variants ●“dermal sculpture on skull-roof weak” ○curator 1: PATO: poorly developed ○curator 2: PATO: decreased magnitude ○curator 3: PATOTEMP: weakly sculptured surface

Taxonomic information work Data are both collected (specimens, measurements, molecular sequences) and created (coded traits) through skill and judgment What to record, what not; what’s the difference that makes a difference (to tell a species apart)? Species are not described in isolation: New species concepts are situated amongst existing concepts and literature Literally placed in a graph structure As much a conceptual engineering task as descriptive from

Taxonomic information work When a researcher works with data, it’s not just through data seeking, entry or retrieval Other data sources (species descriptions, matrices, specimen records) are assessed, integrated and improved Middle ground between information seeking and retrieval? Information use? Information interaction? Information engineering? Semantic engineering?

Representations of Confidence trusting your own work (e.g. using terms like “circa”) trusting another’s (in a team) work trusting another’s (in another team) work

Workflows and Flows of Work { picture redacted }

Workflows and Flows of Work Keeping track of the work Capturing Context What doesn’t get written down How much? What? Why should we worry? What taxonomists do: They observe a lot more than they write down

Potential Implications for LIS How do other domains of data and metadata represent uncertainty? How might taxonomists’ data practices inform data curation best practices? Taxonomy data = very long-lived data! Might other fields learn something from its management? Finding physical/digital analogs/parallels - “tagging” (notes on specimens, notes on data) so that users feel comfortable with the paradigm Software development considerations- how do you engineer big complex applications for the integrative taxonomist?

Acknowledgements Funding: NSF-DBI Collaborating taxonomists