© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Berlin, Germany The Common European Research Information Format CERIF
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Introduction of Speaker Brigitte Jörg M.A. Information Science Information Systems, Business Administration Researcher, Project Manager DFKI GmbH, Language Technology Lab, Saarbrücken CERIF TG Leader, Board Member euriCRIS Contact: dfki.de
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Outline What is CERIF? Grounding Explanations Model Metadata Research Information CRIS The Conceptual (Logical) CERIF Model Entities Relationships Structure Modules The CERIF Interchange Format Concept / Structure XML CERIF XML Examples
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is CERIF ? C ommon E uropean R esearch I nformation F ormat
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is CERIF ? C ommon E uropean R esearch I nformation F ormat – A Concept about Research Entities and their Relationships Specification (Conceptual Level) – An abstract formal description of the concept about entities and their relationships Model (Logical Level) – A formal machine readable description of the concept Database Scripts (Physical Level) SQL Script CREATE Table Person CREATE Table Project CREATE Table OrgUnit Organisation of Data / Information accordingly !!
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is CERIF ? C ommon E uropean R esearch I nformation F ormat (1) data model ( conceptual, logical, physical ) (2) allows for a ( metadata ) representation of –research entities –their activities / interconnections (research) –their output (results) (3) allows for high flexibility with formal semantic relationships enables quality maintenance, archiving, access and interchange of research information supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is CERIF ? C ommon E uropean R esearch I nformation F ormat CERIF is an EU Recommendation to Member States The European Commission (EC) has authorised euroCRIS to maintain and develop CERIF and its usage
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial The CERIF Evolution EU Working Group on Research Databases Workshop CERIF 91 PROJECT Similar Ideas UN/UNESCO OECD CODATA Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … 2000 CLASSIFICATION RESULTSEQUIPMENT PROJECT OrgUnitPERSON EXPERTISE Roles CERIF 2000 Model - Networking of DBs - Exchange of Records - Recommendation to Member States - Data Model (RDBMS, OO, IR) - Multilinguality - Controlled Vocabulary - Roles / Types - User-driven - EC Recommendation to Member States 2ndLevel CORE Language Semantics Link CERIF 2006 / 2008 Model - Data Model (RDBMS, OO, IR) - Model Normalization - Robust Structure - Extensible Structure - Consistent Structure - Semantic Layer - XML Exchange Specification - Elaboration on Publication - (Core) CERIF Semantics
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Outline What is CERIF? Grounding Explanations Model Metadata Research Information CRIS The Conceptual (Logical) CERIF Model Entities Relationships Structure The CERIF (XML) Interchange Format The CERIF 2008 Release CERIF and Related Activities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is Research Information ? Data/Metadata or Information about: Scientists Project Managers Ongoing and Completed Projects Research Departments Funding Organisations and Programmes Research Results Publications Resources Equipment their timely Relationships ( Semantics )...
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is a CRIS? C urrent R esearch I nformation S ystem = CRIS … information about People + Organisations + Projects + Funding Programmes + Research Results + … … that means Timeliness Vitality … driven by A Concept A Model … incorporated as a Implementation (ICT) an integrated approach towards managing research information
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What is a CRIS? C urrent R esearch I nformation S ystem = CRIS … information about People + Organisations + Projects + Funding Programmes + Research Results + … … that means Timeliness Vitality … driven by A Concept A Model … incorporated as a Implementation (ICT) an integrated approach towards managing research information heterogenous entities changing relationships CERIF Metadata Integration
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Users of CRISs ? Researchers (exchange information, find partners, track competitors, form collaborations) Research Managers (assess performance, assess research output, find reviewers for evaluation of proposals) Research Strategists (decide on priorities and resourcing, compare with other countries) Publication Editors (find potential authors, find reviewers for proposed papers) Intermediaries / Brokers (find research products, identify ideas to be carried forward) Media (communicate results) General Public (for interest)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Users of CRISs ? Researchers (find partners, track competitors, form collaborations) Research Managers (assess performance, assess research output, find reviewers for evaluation of proposals) Research Strategists (decide on priorities and resourcing, compare with other countries) Publication Editors (find potential authors, find reviewers for proposed papers) Intermediaries / Brokers (find research products, identify ideas to be carried forward) Media (communicate results) General Public (for interest) Research is International Research Information involves various Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial What kind of Questions do we want to answer from CRISs? How many articles has author X published in 2007 as a first author? How often have articles by author X been cited? Did author X publish with institutionally external authors? In how many FP7 projects does organisation Z participate? How many publications have resulted from project Y? How many people have been employed in the course of FP6 projects from the 1st call in the NMS? How many PhD students have participated in FP6 projects? How many women have been involved in FP6 projects? How often have articles in journal A been requested in 2007? How many articles have been published in the field of B? …
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Outline What is CERIF? Grounding Explanations Model Metadata Research Information CRIS The Conceptual (Logical) CERIF Model Entities Relationships Structure Modules The CERIF Interchange Format Concept / Structure XML CERIF XML Examples
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF: C ommon E uropean R esearch I nformation F ormat
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Concept of the CERIF Model CERIF : A model to manage Research Information Research Entities Project, Person, Organisation Funding Programme, Service, Equipment, Publication, Patent, Product, … Activities / Interconnections in their Context Relationships Semantics / Roles / Types -> for Exchange -> for Interoperability -> for Implementation of CRISs ( C urrent R esearch I nformation S ystems)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Concept of the CERIF Model - Structure CERIF Entity Types Core Entities Result Entities 2nd Level Entities Link Entities CERIF Features Multiple Language Semantics
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Core CERIF Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Core CERIF Entities in Detail Person ID URI Sex FirstNames OtherNames FamilyNames NameVariants ResearchInterest Keywords Project ID URI Acronym StartDate EndDate Title Abstract Keywords OrganisationUnit ID URI Acronym Name HeadCount CurrencyCode Turnover ResearchActivity Keywords
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Result Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Result Entities in Detail ResultProduct ID URI InternationalID ResultPublication ID URI Title Subtitle Abstract Bibl. Note PublicationDate TotalPages StartPage EndPage Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF 2nd Level Entities Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility FundingProgramme Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Call Grant
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Some CERIF 2nd Level Entities in Detail Facility ID URI Name Description Keywords FundingProgramme ID URI Name CurrencyCode Budget StartDate EndDate Description Keywords Event ID URI Name FeeOrFree StartDate EndDate CityTown CountryCode Description Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords Service ID URI Name Description Keywords Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility FundingProgramme Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Call Grant
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Some CERIF Link Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Some CERIF Multiple Language Features
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Some CERIF Multiple Language Features in Detail ResultPublication Title [language=EN] Abstract [languange=EN] Keywords [language=EN] OrganisationUnit Name [language=EN] ResearchActivity [languange=EN] Keywords [language=EN] Project Title [language=EN] Abstract [languange=EN] Keywords [language=EN] Person ResearchInterest [language=EN] Keywords [language=EN] Facility Name [language=EN] Description [languange=EN] Keywords [language=EN] Service Name [language=EN] Description [languange=EN] Keywords [language=EN] ResultPatent Name [language=EN] Description [languange=EN] Keywords [language=EN] ResultProduct Name [language=EN] Description [languange=EN] Keywords [language=EN] Multiple Language Features are associated with Core, Result, 2nd Level, Classification Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Some CERIF Semantic Features role=author role=coordinator role=CEO role=deliverable1.2 role=author1 institute role=funder Semantic Features are associated with Link Entities
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Associated Formal Semantic Features in more Detail Person_ResultPublication PersID PublID ClassificationID ClassificationSchemeID StartDate; EndDate Project_Person ProjID PerslID ClassificationID ClassificationSchemeID StartDate; EndDate OrganisationUnit_Result Publication OrgID PublID ClassificationID ClassificationSchemeID StartDate; EndDate Project_ResultPublicationPersI D PublID ClassificationID ClassificationSchemeID StartDate; EndDate Project_Organisation ProjID OrgID ClassificationID ClassificationSchemeID StartDate; EndDate Person_OrganisationUnit PersID OrgID ClassificationID ClassificationSchemeID StartDate; EndDate Project_FundingProgramme ProjID FundProgID ClassificationID ClassificationSchemeID StartDate; EndDate role=author role=originator role=coordinator role=affiliation role=author1 institute role=co-funder role=investigatedBy CERIF Model fraction
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Semantic Layer Classification ClassID (isA) ClassSchemeID (Taxonomy) StartDate, EndDate URI Term [language=EN] Description [language=EN] ClassificationScheme ClassSchemeID (Taxonomy) URI ( Description [language=EN] Classification_Classification ClassID1 (Ontology) ClassID2 (SemanticWeb) ClassSchemeID1 (WebTechnologies) ClassSchemeID2 (WebTechnologies) ClassId (isA) ClassSchemeID (Taxonomy) Fraction (0.3) StartDate, EndDate ClassScheme_ClassScheme ClassSchemeID1 (LT World) ClassSchemeID2 (CLARIN) ClassID (mapsWith) ClassSchemeID (LT-World Mappings) Fraction (0.3) StartDate, EndDate
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Semantic Layer Allows to capture any Schema or Structure Flat Lists Taxonomies Ontologies Open / Extensible in all directions New Schemas New Concepts / Terms New Relationships Enables to manage Roles / Types Semantics Subject Headings Archiving (Time component) Allows for simple Mappings between Schemas (Interchange) Allows for an efficient (independent) Maintenance
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Core CERIF Entities in Detail (ERM View)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF: C ommon E uropean R esearch I nformation F ormat
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Outline What is CERIF? Grounding Explanations Model Metadata Research Information CRIS The Conceptual (Logical) CERIF Model Entities Relationships Structure Modules The CERIF Interchange Format Concept / Structure XML CERIF XML Examples
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Interchange Format According to the CERIF Model Structure Core Entities Result Entities 2nd Level Entities Link Entities Multilingual Features Semantic Features Entity Person Entity Interchange Entity 1:1
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Interchange Format 1 female 2 male --- Person ID URI Sex
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Interchange Format journal.html?ID2 --- ResultPublication ID URI PublicationDate Num Vol Edition Series Issue TotalPages StartPage EndPage ISBN ISSN ResultPublication
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Interchange Format Person_Publication personID publicationID ClassID ClassSchemeID Fraction StartDate EndDate Person_Publication
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Interchange Format According to W3C Standards Refers to XML Schemas for Validation XML files corresponding to CERIF Structure -> Entities / Separation of Relationships Available Specification Document as part of the CERIF 2008 Release
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial Outline What is CERIF? Grounding Explanations Model Metadata Research Information CRIS The Conceptual (Logical) CERIF Model Entities Relationships Structure The CERIF Interchange Format Concept / Structure Modules XML CERIF XML Examples
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Example (Integrated Person) Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Funding ProjectFunding Programme ResultPublicatio n BrigitteJoerg DFKI Affilation REF2010-Evaluation
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF XML Example (Person) BrigitteJoerg f BrigitteJoerg Joerg Brigitte BrigitteJoerg DataScienceJournalArticle Article REF2010-Evaluation BrigitteJoerg DFKI Affilation REF2010-Evaluation
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Example (Integrated Organisation) Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Funding ProjectFunding Programme ResultPublicatio n
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF XML Example (Organisation) DFKI EURO DFKI German Research Center for Artificial Intelligence DFKI PrivateNotForProfit OrganisationTypes DFKI Artificial-Intelligence Research-Fields
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Example (Integrated Project) Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Funding ProjectFunding Programme ResultPublicatio n
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF XML Example (Project)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Example (Integrated Publication) Citation CV Prize Qualification ExpertiseAndSkills Equipment Facility Service ElectronicAddresse PostalAddress Country Currency LanguageEvent Metrics Funding ProjectFunding Programme ResultPublicatio n
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF XML Example (Publication)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF XML Example (Person_Organisation)
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Semantics [Publication Types] Book Review Book Chapter Book Chapter Abstract Inbook Book Chapter Review AnthologyMonograph Reference Book Textbook Encyclopedia Otherbook Journal Article Journal Article Abstract Journal Article Review Conference Proceedings Article Letter to Editor PhD Thesis Doctoral Thesis Poster Presentation Book Manual Conference Proceedings Letter Report Short Communication Commentary Annotation News Clipping Publication Types
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Semantics [Person-Publication Roles] is author (numbered) of is author of is reviewer of is author (percentage) of is editor (numbered) of is editor of is subject of is translator of is publisher of Person_Publication Scheme
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial CERIF Semantics [Some more examples] number of authors number of incoming citations number of requests number of external institutes number of downloads number of access is of publication type ISI Impact Factor claims IPR of Publication_Metrics Roles received Best Paper Award number of self citations area/type of research number of citations
© Brigitte Jörg Edinburgh – April, 21st, CERIF Introduction Tutorial THANK YOU for your attention !!!