Presentation is loading. Please wait.

Presentation is loading. Please wait.

Value of a coordinate: geographic analysis of agricultural biodiversity Andy Jarvis, Julian Ramirez, Nora Castañeda, Samy Gaiji, Luigi Guarino, Hector.

Similar presentations


Presentation on theme: "Value of a coordinate: geographic analysis of agricultural biodiversity Andy Jarvis, Julian Ramirez, Nora Castañeda, Samy Gaiji, Luigi Guarino, Hector."— Presentation transcript:

1 Value of a coordinate: geographic analysis of agricultural biodiversity Andy Jarvis, Julian Ramirez, Nora Castañeda, Samy Gaiji, Luigi Guarino, Hector Tobón, and Daniel Amariles

2 Contents Why crop wild relatives? How a coordinate can help us complete the collections Cleaning coordinate data Needs from standards

3 Wild relatives of crops Include both progenitor species and closely related species of cultivated crops Faba beans – 0 wild relatives Potato – 172 wild relative species Increasingly useful in breeding, especially for biotic resistance

4 Florunner, with no root- knot nematode resistance COAN, with population density of root-knot nematodes >90% less than in Florunner Wild relative species A. batizocoi - 12 germplasm accessions A. cardenasii - 17 germplasm accessions A. diogoi - 5 germplasm accessions

5 Gap Analysis: Strategies to fill the holes in our seed collections

6 The Gap Analysis road map Taxonomy reviewData gatheringGeoreferentiaton Environmental data gathering Gap Analysis process Final recommendations

7 Geographic dimension Environmental dimension Taxonomic dimension The Gap Analysis process Proxy for: Range of traits Proxy for: Diversity Possibly biotic traits Proxy for: Abiotic traits

8 http://gisweb.ciat.cgiar.org/gapanalysis/

9 Total number of herbarium specimens and germplasm accessions available for each major crop wild relative genepool through the GBIF portal

10 Environmental coverage

11 HERBARIUMGERMPLASM

12 NO GERMPLASM DEFICIENT GERMPLASM

13 POTENTIAL RICHNESS RARE ENVIRONMENTS

14 Which species, and where

15 Wild Vigna collecting priorities Spatial analysis on current conserved materials *Gaps* in current collections Definition and prioritisation of collecting areas 8 100x100km cells to complete collections of 23 wild Vigna priority species

16 Richness in collecting zones at genepool level

17 Predicted change in species richness to 2050.

18

19 Exploration and ex-situ conservation of Capsicum flexuosum Uncommon species of wild chili, found in Paraguay and Argentina, historically used by local indigenous communities 18 known registers of the plant prior to this work 2 germplasm accessions conserved in the USDA GIS used to target field collections OBJECTIVE: Locate and collect germplasm of this species in Paraguay 6 new collections of C. flexuosum 160 seeds conserved ex situ

20 Behind all this Data Quality

21 The GBIF database: status of the data The database holds 177,887,193 occurrences Plantae occurrences are 44,706,505 (25,13%) 33,340,000 (74.5%) have coordinates How many of them are correct, and reliable? How many new georreferences could we get? CURRENT STATUS OF THE Plantae RECORDS

22 The GBIF database: status of the data How to make the terrestrial data reliable enough? – Verify coordinates at different levels Are the records where they say they are? Are the records inside land areas (for terrestrial plant species only) Are all the records within the environmental niche of the taxon? – Correct wrong references – Add coordinates to those that do not have – Cross-check with curators and feedback to the database

23 Using a random sample of 950.000 occurrences with coordinates

24 Are the records where they say they are?: country-level verification Records mostly located in country boundaries Inaccuracies in coordinates Records with null country: 58.051  6,11% of total Records with incorrect country:6.918  0,72% of total Total excluded by country64.969  6,83% of total

25 Are the terrestrial plant species in land?: Coastal verification Errors, and more errors Records in the ocean: 9.866  1,03% of total Records near land (range 5km):34.347  3,61% of total Records outside of mask:369  0,04% of total Total excluded by mask 44.582  4.69% of total

26 Not so bad at all… stats 44’706.505 plant records 33’340.008 (74,57%) with coordinates From those – 88.5% are geographically correct at two levels – 6.8% have null or incorrect country (incl. sea plant species) – 4.7% are near the coasts but not in-land Summary of errors or misrepresented data

27 TOTAL EVALUATED RECORDS: 950.000 Good records: 840.449  88.47% of total RESULTING DATABASE

28 Next steps It now takes 27 minutes to verify 950,000 records, 177million would be 83 hours (3 ½ days) Identify terrestrial plant species and separate them from sea species Use a georreferencing algorithm to: – Correct wrong references – Incorporate new location data to those with NULL lat,lon Interpret 2 nd & 3 rd -level administrative boundaries and use them too Implement environmental cross-checking (outliers)

29 Geo-referencing: BioGeomancer http://bg.berkeley.edu/

30 Conclusions A coordinate can tell us a lot, and answer a number of interesting research questions, solve a lot of problems Agricultural world sadly behind the mainstream biodiversity world – Data not online, not available – Databases not connected Quality of coordinate data is critical: – We need the concept of precision included – We need fields such as location descriptions, and administrative 2 nd and 3 rd level descriptions for georeferencing – We need effective two way communications for verifying, correcting and assigning coordinates from nodes to indexes and vice-versa Economy of scale

31 a.jarvis@cgiar.org


Download ppt "Value of a coordinate: geographic analysis of agricultural biodiversity Andy Jarvis, Julian Ramirez, Nora Castañeda, Samy Gaiji, Luigi Guarino, Hector."

Similar presentations


Ads by Google