Information Artifact Ontology: General Background Barry Smith 1.

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

Database Systems: Design, Implementation, and Management Tenth Edition
Use Case & Use Case Diagram
Barry Smith University at Buffalo NY, USA Tatiana Malyuta
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Social Event, Monday, November 18, 6pm. The Information Artifact Ontology: Roots in BFO Barry Smith October 14, 2013.
Object-Oriented Analysis and Design
Lecture 13 Revision IMS Systems Analysis and Design.
introduction to MSc projects
Introduction to Databases Transparencies
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
How to Organize the World of Ontologies Barry Smith 1.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Lecture Nine Database Planning, Design, and Administration
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Course Instructor: Aisha Azeem
Configuration Management
Enterprise Architecture
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.
Educator’s Guide Using Instructables With Your Students.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
From speech acts to document acts: an ontology of institutions
Tne Role of Ontologies in Military Collaboration Barry Smith 1.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Exploring a topic in depth... From Reading to Writing The drama Antigone was written and performed 2,500 years ago in a society that was very different.
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Outcome Based Evaluation for Digital Library Projects and Services
Lecture 3 BFO: A Standard Upper Level Ontology. 2 BFO als standard upper-level Ontologie Introduces Basic Formal Ontology (BFO) Shows how BFO is extended.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Towards an Ontology of Military Plans and Planning 2 Barry Smith National Center for Ontological Research, Buffalo.
The Information Artifact Ontology
Brian Donohue, J. Neil Otte, and Barry Smith University at Buffalo November 2014.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
1 HL7 RIM Barry Smith
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
1 Introduction to Software Engineering Lecture 1.
LIS654 lecture 5 DC metadata and omeka tables Thomas Krichel
Software Engineering Prof. Ing. Ivo Vondrak, CSc. Dept. of Computer Science Technical University of Ostrava
1 15 quality goals for requirements  Justified  Correct  Complete  Consistent  Unambiguous  Feasible  Abstract  Traceable  Delimited  Interfaced.
UML Class Diagram Trisha Cummings. What we will be covering What is a Class Diagram? Essential Elements of a UML Class Diagram UML Packages Logical Distribution.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Information Artifact Ontology: General Background Barry Smith 1.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Need for common standard upper ontology
IAO June Representations = ideas, documents, oil paintings; always about something Representational units = the smallest representations (atoms.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Information Artifact Ontology Barry Smith 1.
UTA/ARRI. Enterprise Engineering for The Agile Enterprise Don Liles The University of Texas at Arlington.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Linked Data Publishing on the Semantic Web Dr Nicholas Gibbins
Understanding the Value and Importance of Proper Data Documentation 5-1 At the conclusion of this module the participant will be able to List the seven.
Global Rangelands Data Entry Guidelines March 23, 2015.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Using Use Case Diagrams
Information Analysis, Organization, and Presentation
Presentation transcript:

Information Artifact Ontology: General Background Barry Smith 1

Military Doctrine and Standardization of Terminology 3rd Century BC Standardized beacon signals used by Chinese military along Great Wall 1792 Drill manual for the units of the Continental Army to respond uniformly to commands during the Revolutionary War 1943 General James Gavin’s Training Memorandum on the Employment of Airborne Forces 2

General James Gavin, On to Berlin: Battles of an Airborne Commander for success of the D-Day invasion ‘one of our most critical needs was to standardize the operating practices of our forces. … even simple terminology had to be agreed upon. … British flew in what they called “bomber stream” formations, We preferred troop-carrier group formations of 36 planes that flew in a V... We referred to landing area as the “jump area,” the British called it “drop zone,” …’ 3

4

5

Current state DOD Dictionary of Military and Associated Terms (Joint Publication 1-02) New military dictionaries and terminology artifacts continue to be developed Dominant ethos: Library Science (all terminologies are equal), Lexicography (logical consistency of definitions is not important) 6

Two kinds of data 1.Data about entities in the world (topics, subject-matters) standard ontologies 2. Data about the information artifacts in which these entities are represented (= metadata) Information Artifact Ontology and extensions, including IAO-Intel 7

Information Content Entities (ICEs) ICEs are about something in reality (they have this something as a subject; they represent, or mention or describe this something; they inform us about this something). Aboutness may be identifiable from different perspectives. Thus one analyst may interpret a given ICE as being about the geography of a given encampment; another may view it as providing information about the morale of those encamped there.

Information artifact (roughly) an entity created through some deliberate act or acts by one or more human beings, and which endures through time, potentially in multiple (for example digital or printed) copies Examples: a diagram on a sheet of paper, a video file, a map on a computer monitor, an article in a newspaper, a message on a network, the output of some querying process in a computer memory 9

What IAO is for IAO is not designed to replace existing ontological or other standards lots of documents exist conforming to lots of different standards purpose of IAO is to allow generation of the needed metadata in a uniform, non-redundant and algorithmically processable fashion 10

Sample terms in IAO Report Summary Diagram Overlay Assessment Estimate List Order Matrix Template 11

Attributes of Information Artifacts Examples – Purpose – Life­cycle Stage (draft, finished version, revision) – Language, – Format – Provenance – Source (person, organization) These are generic attributes, common to all areas IAO will contain a Low-Level Ontology module for each dimension 12

Generic Purpose Attributes – Descriptive purpose: scientific paper, newspaper article, after-action report – Prescriptive purpose: legal code, license, statement of rules of engagement – Directive purpose (of specifying a plan or method for achieving something): instruction, manual, protocol – Designative purpose: a registry of members of an organization, a phone book, a database linking proper names of persons with their social security numbers 13

Other IAO-Intel Attribute Dimensions Role in the Intelligence Process (JP 3-0, III-11) Priority Intelligence Requirement (PIR) Commander’s Critical Information Requirement (CCIR) Essential Element of Information (EEI) Essential Element of Friendly Information (EEFI) Confidence Level (JP 2.0, Appendix A) Highly Likely Likely Even Chance Unlikely Highly Unlikely Discipline (JP 2.0, I-5) Legal Ideology Religion Propaganda Intelligence Signal Human Rumor intelligence Web intelligence Intelligence Excellence (JP 2.0, II-6) Anticipatory Timely Accurate Usable Complete Relevant Objective Available

Use of IAO-Intel – Example: Digitalizing an MCOO IA #1 - Modified Combined Obstacle Overlay (MCOO) - a joint intelligence preparation of the operational environment product used to portray the militarily significant aspects of the operational environment, such as obstacles restricting military movement, key geography, and military objectives. 15

Digitalizing an MCOO Annotations to the attributes of IA#1 – ICE: MCOO – IBE: Acetate Sheet – uses-symbology MIL-STD-2525C – authored-by person #4644 Annotations relating to the aboutness of IA#1 – Avenue of Approach – Strategic Defense Belt – Amphibious Operations – Objective 16

Anatomy Ontology (FMA*, CARO) Environment Ontology (EnvO) Infectious Disease Ontology (IDO*) Biological Process Ontology (GO*) Cell Ontology (CL) Cellular Component Ontology (FMA*, GO*) Phenotypic Quality Ontology (PaTO) Subcellular Anatomy Ontology (SAO) Sequence Ontology (SO*) Molecular Function (GO*) Protein Ontology (PRO*) Extension Strategy + Modular Organization top level mid-level domain level Information Artifact Ontology (IAO) Ontology for Biomedical Investigations (OBI) Spatial Ontology (BSPO) Basic Formal Ontology (BFO) 17

IAO-ScienceIAO-Intel IAO-Computing IAO- Biology IAO- Physics IAO- Intel- Navy IAO- Intel- Army IAO- Intel- FBI IAO- Software EMO- Ontology Each module built by downward population from its parent top level mid-level (generic hub) domain level (spokes populating downwards) Information Artifact Ontology(IAO) Basic Formal Ontology (BFO) 18

Users of BFO Examples AIRS Ontologies cROP Ontologies MilPortal Ontologies NIF Standard Ontologies OBO Foundry Ontologies OAE Ontology of Adverse Events EnvO Emotion Ontology IDO Infectious Disease Ontology (NIAID) US Army Biometrics Ontology 19

Continuant 20 BFO Occurrent

Continuant 21 BFO Occurrent Independent Continuant Specifically Dependent Continuant Generically Dependent Continuant

22 BFO Occurrent Independent Continuant Specifically Dependent Continuant is tied to just one bearer Generically Dependent Continuant

23 BFO Occurrent Independent Continuant Specifically Dependent Continuant is tied to just one bearer Generically Dependent Continuant can migrate from one bearer to another

Continuant 24 BFO Occurrent Independent Continuant Specifically Dependent Continuant Generically Dependent Continuant universals instances this man, that book this excitation pattern, that pattern of piles of ink this gene sequence, this digital image

Continuant Independent Continuant Specifically Dependent Continuant Quality 25 Generically Dependent Continuant Material Entity BFO DispositionRole

Continuant Independent Continuant Specifically Dependent Continuant Quality 26 Generically Dependent Continuant Material Entity Information Bearing Entity Information Quality Entity depends_on BFO IAO

Continuant Independent Continuant Specifically Dependent Continuant Quality Information Content Entity 27 Generically Dependent Continuant Material Entity BFO IAO

Continuant Independent Continuant Specifically Dependent Continuant Quality Information Content Entity 28 Generically Dependent Continuant Material Entity Information Bearing Entity Information Quality Entity depends_on concretized_by BFO IAO

Independent Continuant Specifically Dependent Continuant Quality Information Content Entity 29 Generically Dependent Continuant Material Entity Information Bearing Entity Information Quality Entity depends_on concretized_by universals instances this hard drive, that book this excitation pattern, that pattern of piles of ink this pdf file this digital image

located near Latrine Well ‘VT ’ Distance Measurement Result Village Name ‘Khanabad Village’ Village is_a instance_ of Geopolitical Entity Spatial Region Geographic Coordinates Set designates instance_of located in instance_of has location designates has location instance_of ’16 meters’ instance_of measurement_of 30 Universals and Instances (from Bill Mandrick)

IAO and BFO BFO: Generically Dependent Continuant BFO: Independent Continuant BFO: Specifically Dependent Continuant Information Content Entity (ICE) Information Quality Entity (Pattern) (IQE) Information Structure Entity (ISE) Information Bearing Entity (IBE) 31

Information Artifacts artifact =def. an entity created through some deliberate act or acts by one or more human beings and which endures through time information artifact: an artifact that created to serve as a bearer of information (a) information bearing entity (IBE) – a hard drive, a passport, a piece of paper with a drawing of a map (b) information content entity (ICE) – an entity which is about something and which can potentially exist in multiple (for example digital or printed) copies – a jpg file, a pdf file

IAO: information content entity =def. an entity that is generically dependent on some artifact and stands in the relation of aboutness to some entity Problems of non-referring information entities Problems of information structure entities 33

Types and tokens Copyable information artifacts can exist both as tokens Peirce and as types Peirce Token = the particular information artifact of interest, tied to some particular physical information bearer: the photographic image on this piece of paper retrieved from this enemy combatant Type = The copyable information content that is carried by the artifact in question. The same photographic image type may be printed out in multiple paper tokens Warning: this is not the same as the instance-class distinction

The Dublin Core: How not to solve the problem of creating consistent information artifact metadata

Dublin Core Metadata Initiative (DCMI) an open organization supporting innovation in metadata design and best practices across the metadata ecology Resource (as in ‘RDF’) + 15 basic ‘elements’: 0. RESOURCE8. TYPE 1. TITLE9. FORMAT 2. CREATOR 10. IDENTIFIER 3. SUBJECT 11. SOURCE 4. DESCRIPTION 12. LANGUAGE 5. PUBLISHER 13. RELATION 6. CONTRIBUTORS14. COVERAGE 7. DATE 15. RIGHTS MANAGEMENT

Dublin Core Metadata Initiative (DCMI) An open organization supporting innovation in metadata design and best practices across the metadata ecology

The Core Resource (as in ‘RDF’) + 15 basic ‘elements’: 0. RESOURCE8. TYPE 1. TITLE9. FORMAT 2. CREATOR 10. IDENTIFIER 3. SUBJECT 11. SOURCE 4. DESCRIPTION 12. LANGUAGE 5. PUBLISHER 13. RELATION 6. CONTRIBUTORS14. COVERAGE 7. DATE 15. RIGHTS MANAGEMENT

1) What’s a “resource”?resource A resource is anything that has identity. Familiar examples include an electronic document, an image, a service (e.g., "today's weather report for Los Angeles"), and a collection of other resources. Assumption: resource = information artifact 2) How do “elements” apply to “resources”?elements An Element is a characteristic that a resource may “have”, such as a Title, Publisher, or Subject.

The same resource can be instantiated in different ways FormatFormat: The file format, physical medium, or dimensions of the resource. Examples of dimensions include size and duration. Recommended best practice is to use a controlled vocabulary such as the list of Internet Media Types [MIME]. Example: image/jpeg.MIME The Core (cont.)

What describes the content / topic / subject-matter? TitleTitle: The name given to the resource. DescriptionDescription: An account of the content of the resource. Description may include but is not limited to: an abstract, table of contents, reference to a graphical representation of content or a free-text account of the content. SubjectSubject: The topic of the content of the resource. Typically, a subject will be expressed as keywords or key phrases or classification codes that describe the topic of the resource. The Core (cont.)

Benefits of Dublin Core Available in multiple formats W3C recommended Mapping to PROV

Problems with Dublin Core Scope not defined (‘anthing that has identity’) Does not provide logical definitions, but relies rather on vague natural language expressions (including use of “scare” “quotes” to warn the user that terms are not intended literally) Provides only suggestive guidance as to use of associated standards Does not interoperate well with other (topic) ontologies

Confuses words and things Source: A reference to a resource from which the present resource is derived. The present resource may be derived from the Source resource in whole or part. Source

Engages in sloppy bundling TypeType: The nature or genre of the content of the resource. Type includes terms describing general categories, functions, genres, or aggregation levels for content. What is ‘content of the resource’? Is the nature of the content distinct from the nature of the resource? No taxonomic organization, but rather a tangled hierarchy No distinction between things (continuants) and processes (occurrents) – consider performance of a work

Does not address the goals of a Metadata Ontology Ability to expand consistently to new application areas Ability to gracefully integrate with domain ontologies and with other IA-related ontologies Ability to represent metadata of different categories – Complex application-specific content specific ways in which one IA relates to another IA – Content vs. Bearers of content

Requirements to Achieve These Goals Conformance to ontology best practices – lopment_of_a_Shared_Semantic_Resource lopment_of_a_Shared_Semantic_Resource – st_Practices st_Practices – intro/pdf/5.%20Ontology%20Design.pdf intro/pdf/5.%20Ontology%20Design.pdf Conformance to an upper level ontology as starting point for coherent definitions Separation of aspects of an information artifact such as physical bearer, content, content organization

DC Does Not Conform to Best Practices Term Name: LocationPeriodOrJurisdiction URI: Label: Location, Period, or Jurisdiction Definition: A location, period of time, or jurisdiction. LOCATION PERIOD OR JURISDICTION is defined in the DC hierarchy as a subclass of LOCATION

Problems with verbal definitions – PROVENANCE – “A statement of any changes in ownership and custody of the resource since its creation that are significant for its authenticity, integrity, and interpretation.” – The same definition is applied to the class and the property: PROVENANCE STATEMENT that is the Range of PROVENANCE is defined in exactly the same way.

Does Not Conform to an ULO DC does not conform to an upper level ontology and does not show signs of downward development from more general to more specific terms. As a result – Generic element associations are absent or arbitrary or informal. – If such associations were established, they would need to be established manually instead of being inherited. For example, there are such classes as AGENT and AGENT CLASS where AGENT CLASS is defined as “A group of agents” but no formal relation with the class AGENT is asserted.

Does Not Conform to an ULO (cont.) In the absence of a high-level single hierarchy, the relations between classes are not clear. For example PROVENANCE is defined as “A statement of any changes in ownership and custody of the resource since its creation that are significant for its authenticity, integrity, and interpretation” seems to overlap with CREATOR, CONTRIBUTOR, and IS VERSION OF. But how?

Limited Usability of DC DC does not try to separately address such aspects of an information artifact as its physical bearer, content, and content organization Will not allow for rich explications and annotations of document repositories, in particular repositories of military documents, and for various classifications of documents that are based on the content or bearer

54 Shimon Edelman’s Riddle of Representation two humans, a monkey, and a robot are looking at a piece of cheese; what is common to the representational processes in their visual systems?

55 Answer: The cheese, of course

56 The real cheese

Each IA is concretized_by at least one IQE (Information Quality Entity) The same IA can be concretized in multiple different media (paper, silicon, neuron …) Concretization 57

Generically dependent continuants such as plans, laws … are concretized in specifically dependent continuants (the plan in your head, the protocol being realized by your research team, the law being implemented by this government agency) 58

Types and tokens A A A One type, three tokens A type is a pattern Patterns can be complex 59

fragment of the War and Peace pattern 60

War and Peace is an instance of the universal novel Specifically Dependent Continuant War and Peace quality 61 Independent Continuant This bound copy of War and Peace Generically Dependent Continuant The novel War and Peace instance_of depends_on concretized_by

Is War and Peace a kind or an instance? If War and Peace were a kind, and the copies of War and Peace in my library and in your library were instances, then there would be many War(s) and Peaces. Hence War and Peace is an instance. What is a work of literature? 62

There can be two copies of the US Declaration of Independence There cannot be two US Declarations of Independence There cannot be subkinds of the US Declaration of Independence Hence the US Declaration of Independent is an instance and not a kind. There are not two Declarations of Independence 63

Rule for universals Their names are pluralizable There can be three people There cannot be three Michelle Obamas. Information Content Entities are GDCs = entities which can exist in many copies 64

they have a different kind of provenance ◦ Aspirin as product of Bayer GmbH ◦ aspirin as molecular structure ◦ This Financial Report is submitted to the SEC Generically dependent continuants are distinct from universals 65

IAO and BFO BFO: Generically Dependent Continuant BFO: Independent Continuant BFO: Specifically Dependent Continuant Information Content Entity (ICE) Information Quality Entity (Pattern) (IQE) Information Structure Entity (ISE) Information Bearing Entity (IBE)

Information Bearing Entities – IBEs An IBE is a material entity that has been created to serve as a bearer of information. IBEs are either (1) self-sufficient material wholes, or (2) proper material parts of such wholes. Examples under (1): a hard drive, a paper printout (e.g., a report) Examples under (2): a specific sector on a hard drive, a single page of a paper printout.

Information Quality Entities (IQEs) An IQE is the pattern on an IBE in virtue of which it is a bearer of some information An IQE exists in a given IBE because of a certain patterned arrangement for example of ink or other chemicals, or of electromagnetic excitations. Every ICE is concretized by at least one IQE

Information Structure Entities (ISEs) Information Structure Entity (ISE) is a structural part of an ICE, for example an empty cell in a spread­sheet; or a blank Microsoft Word file. ISEs thus capture part of what is involved when we talk about the ‘format’ of an IA.

Organization of IAO-Intel – IA ‘IA’ refers either – to some combination of ICEs and ISEs (roughly: the IA as body of copyable information content); or – to some concreti­zation of ICEs and ISEs in some IBE in which some IQE inheres (the information artifact is: this content here and now, on this specific computer screen or this printed page). Different information artifact kinds will differ in different ways along these dimensions, as illustrated in Table 2.

IAIBEISEICE MS Word file (.doc,.docx) Hard drive (magnetized sector) MS Word format Varies KML file Hard drive (magnetized sector) KML Map overlay JPEG file (.jpg) Hard drive (magnetized sector) JPEG format Image file Hard drive (magnetized sector) Internet Message Format (e.g., RFC 5322 compliant) Message USMTF Message file A specific government network USMTF Format Message Passport Paper document; (may include photographs, RFID tags) ID formats, security marking formats … Name, Personal data, Passport number, Visas Title DeedOfficial paper documentVaries ReportVaries Overlay Sheet ( e.g. Map Overlay Sheet) Acetate sheet MIL-STD-2525 Symbols; FM Operational Terms and Graphics Map overlay

IAO and BFO BFO: Generically Dependent Continuant BFO: Independent Continuant BFO: Specifically Dependent Continuant Information Content Entity (ICE) Information Quality Entity (Pattern) (IQE) Information Structure Entity (ISE) Information Bearing Entity (IBE)

IAO and BFO (cont.) BFO relations between ICEs, ISEs, IQEs and IBEs can be set forth as follows: – ICE generically-depends-on IBE – ISE generically-depends-on IBE – IQE specifically-depends-on IBE – ICE concretized-by IQE – ISE concretized-by IQE IAO contains in addition relations which allow to formulate metadata concerning attributes of IAs such as author, creation date, classification status, and so forth

Anatomy Ontology (FMA*, CARO) Environment Ontology (EnvO) Infectious Disease Ontology (IDO*) Biological Process Ontology (GO*) Cell Ontology (CL) Cellular Component Ontology (FMA*, GO*) Phenotypic Quality Ontology (PaTO) Subcellular Anatomy Ontology (SAO) Sequence Ontology (SO*) Molecular Function (GO*) Protein Ontology (PRO*) Extension Strategy + Modular Organization top level mid-level domain level Information Artifact Ontology (IAO) Ontology for Biomedical Investigations (OBI) Spatial Ontology (BSPO) Basic Formal Ontology (BFO) 74

OBO Foundry approach extended into other domains (all populating downwards from BFO) 75 NIF StandardNeuroscience Information Framework IDO ConsortiumInfectious Disease Ontology cROPCommon Reference Ontologies for Plants MilPortal.orgMilitary Ontology AIRS Ontology SuiteIntelligence Ontology Suite

76

Language 77 Speech actsWriting Acts of thinking*Printing Document acts … *Mental Functioning Ontology (MFO)

Coverage domain of IAO 78 Speech actsWriting Acts of thinkingPrinting Document acts …

Generic Purpose Attributes – Descriptive purpose: scientific paper, newspaper article, after-action report – Prescriptive purpose: legal code, license, statement of rules of engagement – Directive purpose (of specifying a plan or method for achieving something): instruction, manual, protocol – Designative purpose: a registry of members of an organization, a phone book, a database linking proper names of persons with their social security numbers 79

80

81

John Searle: start with biology, add speech

The Searle Thesis Through the performance of speech acts (of promising, marrying, accusing, exchusing) we bring into being ₋claims, ₋obligations, ₋relations of authority, ₋relations of membership, … = the entities making up the ontology of the social world 83

How, on this view, can institutional entities, endure through time? in the local case: through beliefs, memories, desires – planning a weekly coffee morning with your friends … But what about the global case (where there is no face-to-face contact, where there are many cheaters, where beliefs conflict ontologically)? 84

Hernando de Soto Institute for Liberty and Democracy, Lima, Peru Bill Clinton: “The most promising anti-poverty initiative in the world” 85

The de Soto thesis: documents and document systems are the mechanisms for creating the institutional orders of Western capitalism The Mystery of Capital: Why Capitalism Triumphs in the West and Fails Everywhere Else, New York: Basic Books,

With the invention of documented claims and obligations a new dimension of socio-economic reality comes into existence: bank accounts, stocks, shares, bonds, mortgages, credit cards these form enduring social networks – document systems – of entirely new types debts become information entities analogous to digital artifacts 87

From speech act theory to document act theory 88 Generalizing the de Soto thesis: documents and document systems are the mechanisms for creating all institutional orders of modern civilization

89 Identity

An extralegal standardized sales contract for a one- acre parcel in the outskirts of Arusha, including the involvement of witnesses in the preparation of the document and the use of fingerprints to ensure the authenticity of the document. Standardization 90

Standardized documents allow standardized transactions improve the flow of communications allow assets to be described using standard categories, so as to enable comparisons allow the transition from ad hoc narratives (as in ancient title deeds) to structured representations communication is advanced because signals are abbreviated supports the creation of more effective registries 91

A. N. Whitehead It is a profoundly erroneous truism, repeated by all copy-books and by eminent people when they are making speeches, that we should cultivate the habit of thinking what we are doing. The precise opposite is the case. Civilization advances by extending the number of important operations which we can perform without thinking about them. 92

Standardized documents enable – new types of distributed ownership through stocks, shares, pensions, … – currency notes – new types of legal accountability – new types of business organization – new types of massively planned social agency – democracy – the state – law … 93

Scope of document act theory the social and institutional (deontic, quasi- legal) powers of documents the sorts of things we can do with documents the social interactions in which documents play an essential role the enduring institutional systems to which documents belong 94

The ontology not only of capital, bankruptcy, stock market … but also of the Holy Roman Empire the Swedish language the United Nations the internet a symphony concert urban planning mathematicians is to be understood in terms of the different sorts of documents which these phenomena involve 95

96 How to do things with words (speech act theory) 1.We represent how things are: record, report, description, assertion … 2.We try to get people to do things: request, order, command … 3.We commit ourselves to doing things promise, agreement, … 4.We bring about changes in the world through utterances congratulating, blessing, forgiving …

97 How to do things with documents (document act theory) 1.We represent how things are: map, chemical diagram, x-ray image, … 2.We try to get people to do things: blueprint, musical score, plan of battle … 3.We commit ourselves to doing things contract, planning agreement, flow chart … 4.We bring about changes in the world through document acts organigram, act of parliament, license, diploma …

How to do things with diagrams 98

From speech acts to document acts Documents can be copied, modified, stored … Documents can be aggregated (attachment of liens …) Documents can be meshed together (for example into plans and sub-plans – as in a musical score, plans for a military operation) Documents can be algorithmically executable (Turbotax …) 99

John Searle: Directions of fit world-to-mind: I promise I will mow your lawn tomorrow mind-to-world: I see that my lawn has been mowed automatic mind-to-world-and-world-to- mind: I say “I promise to pay you $100 dollars” and thereby make it true that I promise to pay you $100 dollars 100

Directions of fit for documents world-to-mind: a plan is formulated to change the world (to make it conform to the mind of the planner …) mind-to-world: a report is published evaluating the success of the execution of the plan automatic mind-to-world-and-world-to- mind: Act of Parliament is published declaring that such-and-such is the law and such-and-such is the law 101

(musical) directions of fit world-to-score: the score tells the world how to shape itself to create a performance that is in conformance with the score score-to-world: the score, when the performance is completed, serves as a record of the performance automatic score-to-world-and-world-to- score: Berlioz completes the score and thereby brings into being a work that is precisely in conformance to the score 102

Individual performers may use their scores in different ways 1.they may mark up their copies of the score to add specific instructions for their own use 2.they may mark up their copy of the score to record errors in their own performance 103

104 what begins as a plan, ends as a record

Blueprint what begins as a plan ends as a record of process of product 105

From speech acts to document acts 106 Searle, Tuomela, Gilbert, Bratman deal with simple local interaction of cooperative agents communicating by speech “Would you like to dance?” “Let’s lift this table” “Shall we cook dinner together?” “Waiter, bring me a beer!” …

Scott J. Shapiro, “Massively Shared Agency”, 2013 [Bratman, Searle …] ‘are unable to account for the existence of massively shared agency. they ‘have largely concentrated on analyzing shared activities among highly committed participants. The working assumption has been that those who sing duets or paint houses together are all committed to the success of the activity.’ 107

Shapiro: To adapt standard theory of collective agency to deal with massively shared actions we need to add authority Authorities are … “mesh creating” mechanisms. When disputes between participants break out with respect to the proper way to proceed, authorities can create a mesh between the subplans of the participants by demanding that both sides accept a certain solution. Basic for Shapiro’s theory of the nature of law 108

Conclusion Documents, as much as authority, are what make possible the sorts of massively shared agency we find in business corporations, universities, organized religions, governments, legal systems, standing armies 109