Elasticsearch-powered Faceted Search in the Deep Carbon Observatory Data Portal Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic Institute Summer Meeting July 19-22, 2016 Durham, NC zednis2@rpi.edu @szednik https://tw.rpi.edu/web/doc/esip2016-dco-faceted-search
Deep Carbon Observatory (DCO) 10 year initiative Global community scientists doing collaborative multi-disciplinary work Focusing on carbon’s role in geophysical processes in Earth The Deep Carbon Observatory is a global research program to transform our understanding of carbon in Earth. At its heart, DCO is a community of scientists, from biologists to physicists, geoscientists to chemists, and many others whose work crosses these disciplinary lines, forging a new, integrative field of deep carbon science. https://deepcarbon.net/about/about-dco
DCO Data Portal We take heterogeneous data and information from multiple different science domains across different organizations and organize them into a knowledge graph with over 550,000 triples.
DCO Data Portal
Challenge and Objective Allow users to easily find relevant or interesting publications potential collaborators datasets projects from within the DCO community DCO Data Portal User profile page
Approach Implement faceted search interfaces for several people publications projects/field studies datasets A canonical example of a faceted navigation at LinkedIn
A new solution was needed Faceted Search v1 (S2S) difficult to maintain difficult to update non-standard behavior slow A new solution was needed
Elasticsearch Search-engine based on Lucene Real-time search and analytics capabilities Sophisticated RESTful API Distributed, scalable, and highly available schema-free supports faceting
DCO Elasticsearch Ingest denomalize… Knowledge graph Stack of JSON documents http://www.caresearch.com.au/caresearch/tabid/2115/Default.aspx
Search Document JSON
FacetView2 + Handlebars.js Elasticsearch REST API JS library for faceted search Templating engine easy to maintain easy to configure fast HTTP GET JSON Webserver Client Browser https://github.com/CottageLabs/facetview2
FacetView2 config
Handlebars.js Templates
Publication Browser (v2)
Field Studies Map (v2)
Summary and Future Work Highlight Elasticsearch + Facetview2 provides a simple and performant solution for faceted search of DCO Data Portal content Very easy to deploy and evolve Future work Incorporate new facet types Add analytics to search interface? Thanks for listening zednis2@rpi.edu @szednik 16
DCO Faceted Browser Links Faceted Browsers https://info.deepcarbon.net/vivo/people https://info.deepcarbon.net/vivo/publications https://info.deepcarbon.net/vivo/datasets https://info.deepcarbon.net/vivo/projects Faceted Field Studies Map https://info.deepcarbon.net/vivo/field-studies