GBIF-Sweden : An up-date on Swedish biodiversity collection and observation informatics Anders Telenius, Swedish Museum of Natural History, Stockholm Would have been…
GBIF-Sweden today - tomorrow Organization: at Swedish Museum of Natural History; Biodiversity Informatics unit; Staff (Fredrik Ronquist, Anders Telenius, Mickaël Graf, Marie Svensson). Node Manager FR (?); NSG repr. L. M. Nilsson/Magnus Friberg (Science Research Council)
GBIF-Sweden today - tomorrow Funding situation: New appl (SRC: RFI – Advisory Board f. Res. Infrastructure) 19 April; 1.4 M EUR/5 yrs. N.b. Science Council..!! Still the SRC has decided to support the environmentally inclined Lifewatch Sweden initiative (which state they will develop independent of Lifewatch Europe (but remember: SMNH representative in Lifewatch advisory board (FR/AT) GBIF-Sweden at SMNH – Lifewatch at Species Info. Centre, Sw. Univ. Of Agr. Sci. (incl. Species Gateway): BI- investment diverted/governed by SRC?? How?? We consider a decentralized organization is positive (coll. vs.obs.data; other iniatives relations)
GBIF-Sweden today - tomorrow Prioritized tasks GBIF-Sweden Continuation of running matters (provider contacts, new data sets, project coordination, applications etc.; One 25% proj.leader, one 100% coordinator/node manager, one 100% database developer) Automatized data delivery (IPT v. 2.0) amongst (large) providers Digitization effort. Automatic procedures encouraged/Tech. development proj.) New Nordic/Baltic data portal (contacts to be established). Relation to Lifewatch clarified Database of Swedish/Nordic/Baltic metadata
GBIF-Sweden today - tomorrow Prioritized tasks GBIF-Sweden (cont´d) Repatriation of ”Swedish” records elsewhere. User encouragement (outreach activities + research/investigation) Participation in collection standardization project within SMNH/Sweden nationwide (IRIS/DINA : ”Digitalt informationssystem för naturhistoriska samlingar”) Coordination with PESI Focal Point (extra staff needed?) Education (participants at Nordic Master in Systematics/Evolution; Introductory and basic level courses in Biodiversity/Systematics/Evolution/Ecology at universities) Cooperation GBIF – the National Heritage board
GBIF-Sweden today - tomorrow Initiative reputation/condition: GBIF-Providers fairly reluctant; presumptive Lifewatch providers indifferent; users practically absent
GBIF-Sweden : An up-date on Swedish biodiversity collection and observation informatics Anders Telenius, Swedish Museum of Natural History, Stockholm Is!
GBIF-Sweden : Heading for 30 million records presented 1 (7): The number of datasets in GBIF-Sweden
Presently GBIF-Sweden comprise 123 registered datasets (114 ”active”). 92 have responded to an inquiry concerning size and digitization status. An up-to-date report thus exists on 80% of all Swedish GBIF data- sets. In 2005 the corresponding figure was 31% (of the same 123 GBIF-partners-to-be. The increase was steady over the years. 51 out of all 123 registered datasets are published at in 2010 – as compared with two in 2005). This represents an 1.5% - 41% increase of activity during October 2005 – July Unfortunately we still lack recent information regarding 60% of the datasets – only 40 % of the data owners responded to the inquiry! No. of days since the first measurement No. of data sets No. of data sets in GBIF-Sweden
2 (7): The number of data, degree of digitization and publication rate in GBIF-Sweden
Altogether providers report that approximately 60 million original data might become available in Sweden – disregarding metadata! No. of days since the first measurement No. of objects No. of data (OBJECTS) in GBIF-Sweden
The increase in the number of potentially available data is actually considerably larger than shown, since the numbers of individuals and objects were not separated in the early inquiries (2005, 2006). Anyway, since the start of the initiative, approximately twice the initially expected number of objects may become mobilized. No. of data (OBJECTS) in GBIF-Sweden No. of objects No. of days since the first measurement
A considerable share of the increase represents observation data (rapidly adding to the database). Still, almost half of the total amount refer to collection data. No. of data (OBJECTS) in GBIF-Sweden No. of objects No. of days since the first measurement
The number of digitized data has increased sixfold in five years – mostly because of the addition of recent observation data. No. of days since the first measurement No. of data (OBJECTS) in GBIF-Sweden No. of objects
To day GBIF-Sweden publish in excess of 150 times as many data as in No. of days since the first measurement No. of data (OBJECTS) in GBIF-Sweden No. of objects
3 (7): The proportions of digitized and published data in GBIF-Sweden
The availability of data in terms of digitized information approaches 50 % (from less than 12% in 2005). Considering all available data the degree of publication has increased dramatically (0,44% in % in 2010) No. of days since the first measurement Degree of digitization (%) in GBIF-Sweden % objects
The part of the data out of the data digitized that are published, increase rapidly: from slightly below 4% in 2005 to 94% in A large part of this increase depend on the import of observation data (starting in ), but even so collection data are getting markedly more common. No. of days since the first measurement Degree of digitization (%) in GBIF-Sweden % objects
4 (7): The numbers of data, degree of digitization and publication rate in GBIF-Sweden : COLLECTIONS COLLECTIONS
GBIF-Sweden might eventually offer 35 million collection data. The actual increase in five years is hidden behind overly ambitious initial estimates by dataproviders. The amount of digitized data increased by 100%, and starting at nil GBIF now publish 6 million collection data. No. of days since the first measurement No. of data (OBJECTS) in GBIF-Sweden : COLLECTIONS No. of objects
COLLECTIONS 5 (7): The proportions of digitized and published data in GBIF-Sweden : COLLECTIONS
The degree of digitization in collections increase from 11% in 2005 to 22% in 2010, and the rate of presentation follows the trend as counted per potental data (ca. 35 million). In contrast, the pre- sentation of digitized data increses from nil to almost 80%! No. of days since the first measurement % objects Degree of digitization and presentation of data (% objects) in GBIF-Sweden : COLLECTIONS
6 (7): The numbers of data, degree of digitization and publication rate in GBIF-Sweden : OBSERVATIONS OBSERVATIONS
By the addition of observational datasets in 2009, the total amount of data published is now just below 30 million. All obser- vation data are digitized, hence the three curves showing the num- ber of such data, the degree of digitization and the publication are practically identical. No. of days since the first measurement No. of data (OBJECTS) in GBIF-Sweden : OBSERVATIONS No. of objects
7 (7): The proportions of digitized and published data in GBIF-Sweden : OBSERVATIONS OBSERVATIONS
As above: Both the number and the percentage of digitized data out of all potentially published or actually presented reach almost 100%. Only the odd dataset is occasionally at fault. No. of days since the first measurement % objects NB Different scale Degree of digitization and presentation of data (% objects) in GBIF-Sweden : OBSERVATIONS
Then what to do? We will increase the support to digitization efforts (funding is however NOT presently the responsibility of GBIF-Sweden) We will implement the use of IPT/TAPIR-link at major publishers We will validate already published data We will add some new datasets (e.g. gene banks, migration data) We will attempt to connect GBIF to a similar initiative within the humanities in Sweden (“K-SAMSÖK”) etc.
Enfin… The development of the presentation of data by GBIF-Sweden is pretty much as expected… …but in order to reach 100% coverage by the end of 2011 (as set out at the beginning of the plan for the five-year period ) we need to increase the rate of publication further. The most difficult obstacle to this ambition is that such a great proportion of our collections are not yet digitized. Also, some data sets are not fit for import (incompatible database systems, data cleaning needed etc.)
The end!