Conditor Towards a national reference repository for French scientific production Valérie Bonvallot (CNRS-Inist) – Thierry Dautcourt (Inria)

Slides:



Advertisements
Similar presentations
The way to open resources Laurent Romary CNRS. Two aspects of scientific communication Research papers –All types (Conferences, journals, grey literature.
Advertisements

IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
Open Access to Humanities Data — a scholarly perspective Laurent Romary Inria — French national research center in computer science Humboldt University.
Interoperability scenarios between UKPMC and OpenAIRE Jo McEntyre, Wolfram Horstmann.
Research Information Systems for Higher Education Institutions in Italy.
Lecture №2 State System of Scientific and Technical Information.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Supporting Open Access Implementation via CRIS/repository interoperability Pablo de Castro euroCRIS Board member Open Access Project Officer at LIBER
SSTIC’s OpenAccess related activities in international context Ľubomír BILSKÝ Bratislava, 25 March 2015 Slovak Scientific and Technical Information Center.
IntroductionDC Metadata towards an e-research cyberinfrastructure The case of French ETDs.
Repositories for research information management Wolfram Horstmann CERIF-CRIS and Repositories, Brussels, 12/13-oct-2011.
1 / 1509 / 17 / 14 Digital preservation of architectural 3D data Rosetta in the context of the DURAARK project IGeLU Conference Oxford, September 17 th.
Romain Wenz- BnF-DIBN – SWIB 2010 November The data.bnf.fr project describing resources of the French National Library.
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
P. 1 A review of interlending and document supply in France: 2014 Restructuring resource sharing: new organizations, technologies, methods – IFLA Satellite.
The emerging role of Institutional CRIS in facilitating Open Scholarship Anna Clements, Assistant Director (Digital Research) Jackie Proven, Repository.
Federated Networks of Open Access Repositories in Mexico and Latin America Rosalina Vázquez Tapia, Autonomous University of San Luis Potosí.
Status of ICT structure, infrastructure and applications existed to manage and disseminate information and knowledge of Agricultural Biotechnology Innovations.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
1 Jean-François Desnos, Geneviève Gras, Béatrice Meier, Laurent Pilet Université Joseph Fourier, Université de Strasbourg, Clermont Université.
Resource Sharing Development and Challenge in Academic Libraries: the Case Study of CALIS Yao XiaoXia CALIS Administrative Center , PUL , shanghai.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Uniting heritage digitization and EAD metadata : “Calames Plus” solutions and other tracks LIBER Annual Conference Tartu, Thu. 28 June
The emerging role of Institutional CRIS in facilitating Open Scholarship Anna Clements, Assistant Director (Digital Research) Jackie Proven, Repository.
Towards a shared platform of scientific publications in New Caledonia: the example of “Univers NC” an OAI-PMH compliant common open archive Isabelle Gasser.
Making Grey Literature Available through Institutional Repositories LeRoy J. LaFleur, Social Sciences Bibliographer Nathan A. Rupp, Metadata Librarian.
SELL, Izmir, May 2009 France Country Report Couperin 18-20/05/ /07/08 SELL 2009 France Country Report Grégory Colcanap & Catherine Etienne Consortium.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Juha Mykkänen University of Kuopio, HIS R&D Unit Health Kuopio seminar Brussels, 5 November 2004 SerAPI project: Service-oriented architecture and Web.
University of St Andrews euroCRIS Strategic Seminar, September 12 th -13 th, 2011, BrusselsAnna Clements, University of St Andrews Anna Clements Enterprise.
Consorzio Interuniversitario Improving Scientific Research in Higher Education Institutions: a process management experience in Italian Universities.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
ICT/ICM Status in Jordan Hesham Athamneh & Jordan Team.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
WDC-MARE – World Data Center for Marine Environmental Sciences Data portal based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler,
Automated (meta)data collection – problems and solutions Grete Christina Lingjærde and Andora Sjøgren USIT, University of Oslo.
Tackling the Infrastructure Requirements: Potential Role of SK-CRIS and National CRIS Systems in Supporting Open Access Implementation Pablo de Castro.
PACSCL Consortial Survey Initiative Group Training Session February 12, 2008 at The Historical Society of Pennsylvania.
An OAI-Compliant Federated Physics Digital Library for the NSDL Department of Computer Science Old Dominion University, Norfolk, VA In Collaboration.
Nicola Bertazzoni Marta Zaetta Integrating CRIS with other systems.
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
Direction de l’Information Scientifique 1 Scientific and Technical Information at CNRS Laurent Romary Directeur de l’information scientifique - CNRS.
HOW TO PUBLISH IN HIGH-IMPACT PUBLICATION. At the end of this session, participants will be able to choose the method of measurement on research performance.
THE ROLE OF UNIVERSITIES FOR ECONOMIC DEVELOPMENT IN URBAN POLES RUnUP Thematic Network Clive Winters, Lead Expert Chris Wilson, Lead Partner.
DANS is an institute of KNAW and NWO Data Archiving and Networked Services Measurement of research impact in OpenAIRE 2020: via text mining or the CRISs?
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
©euroCRIS/Keith G JefferyCRIS: Stakeholders, Benefits, History, Process, Architecture CRIS “a Current Research Information System, commonly.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
BIBSAM-konsortiet 13/01/2016 ICLC Paris 2009 Updates: the BIBSAM consortium, Sweden Technical conditions in licenses Anna Lundén, coordinator.
Assessing current print periodical usage for collection development Gracemary Smulewitz Distributed Technical Services Rutgers University Libraries.
CNR – National Research Council, Rome (IT) Central Library ‘G. Marconi’ National Centre for Grey Literature and National ISSN Centre CNR – National Centre.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Data Citation Implementation Pilot Workshop
International Planetary Data Alliance Registry Project Update September 16, 2011.
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
7th Annual Hong Kong Innovative Users Group Meeting
Jordan PIŠČANC, University of Trieste
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Reusing and repurposing metadata in a Current Research Information System and Institutional Repository 3 June 2010 Robin Armstrong Viner Cataloguing.
Digital Asset Management Part 15: Summary
Martin Moyle Digital Curation Manager UCL Library Services, UK
Metadata to fit your needs... How much is too much?
Towards a national research information infrastructure in the Netherlands based on CERIF: challenges and opportunities Chris Baars, supervisor Electronic.
Objectives, activities, and results of the database Lituanistika
IdRef – Service of reference frames for Higher Education and Research
Márton Németh – László Drótos How to catalogue a web archive?
This presentation will probably involve audience discussion, which will create action items. Use PowerPoint to keep track of these action items during.
European databases for research output
Presentation transcript:

Conditor Towards a national reference repository for French scientific production Valérie Bonvallot (CNRS-Inist) – Thierry Dautcourt (Inria) - Paris 11 may

A multi-partner project in the French higher education and research area : Ministry, public institutions with a scientific and technical vocation, Universities, Agencies, etc. 2 Building a national reference repository for French scientific production based on common reference repositories shared by universities and research organizations

Building a bibliographic reference repository to:  Share metadata describing French scientific production  Pool inventories of scientific production 3 Archive No full text Decision-making tool No indicator production Portal No browser interface for end users Current Research Information System No research management Conditor : a reference repository with quality data allowing interoperability

4 International bibliographic databases WoS Scopus Pubmed etc. CRIS Archives Hal Researchers, team leaders, information specialists Researchers, laboratory directors, research unit managers … Local databases Structures, staff, NRA projects etc. « STI » reference repositories Addresses, themes, authors, journals, congresses etc. Management reference repositories Conditor: position in the French STI landscape Institutional identification databases Common reference repositories Conditor Management team S t r R u R c t. National Repertory of Research Structures (RNSR) A u t R h R o r s IdRef ISSN ORCID ISNI

5 Experimental principles: pragmatism  Working with multi-skill volunteers National Center for Scientific Research (CNRS) National institute for agricultural research (Inra) National institute dedicated to computational science (Inria) French Research Institute for Development (IRD) Bibliographic agency for higher education (Abes) Bordeaux University Paris Dauphine University Ministry of Higher Education and Research Experimental group: representatives from 8 organizations and establishments  Using resources we already have  Assessing difficulties, benefits and involvement

Conditor: constitution method of a corpus Several strict alignments of character strings Name entities, search in addresses Incorporation of identifiers for research structures and authors « Enriched » Conditor corpus Mapping XML formatting Normalisation / homogenisation Identifiers Document titles Authors Sources Collations Addresses Document types IdRef RNSR Reference system of CNRS structures Step 1 MetaData (MD) Treatment and curation Step 2 Detection of duplicates Step 3 Enrichment using reference repositories 6 Reference repositories used « Matching group » Data from 9 databases for the 2011 publication year from Open archives Bibliograph. database Bibliometr. database Mini CRIS Library Catalogue

No funding in database 1 No affiliation in database 1 Curation and enrichment Record in 3 databases 7 BIRD HAL INRIA

Curation and enrichment No funding in database 2 1 affiliation missing in database 1 Record not in INRA database Record in 2 databases 8 HAL Inist

 Improving some aspects in the corpus building ◦ Detection of duplicates ◦ Data incorporation from national structures and authors systems  What we learn ◦ Conditor is « feasible » ◦ Fully-automated treatment isn’t sufficient ◦ A social structure is needed  Potential advantages Sharing a common national warehouse of descriptive bibliographical records is essential to : ◦ Manage publications not found in databases used for evaluation ◦ Avoid several manual data entries ◦ Improve information systems interoperability ◦ Improve through use common reference data dictionaries repositories and persistent digital identifiers (national research structures, parent organizations, authors, journals, fundings, congresses, etc.) 9

10 5 years corpus building Design and development of functionalities in an iterative way and progressive implementation Project launch Year N Year N+1 Conditor service Management functionalities -Retrieval -Modification -Deletion -Validation -Dissemination 3 years corpus5 years corpus corpus Treatment functionalities -Duplicate identification -Enrichment through reference repositories

11 Kiitos Köszönöm мерси Hhvala vam Tänan Efharisto Paldies Ačiū Grazzi Dank je Dziękuję Obrigado/a Mulţumesc Děkuji Dakujem Merci Tak Grazie Gracias Thanks Danke