Technology Exploration -Discussion and Next Steps- Satoko H. MIURA Tech. Expo Group Lead JAXA
Summary of today’s presentations How to Handle Big Data?
Today’s Presentations 1.System a.Optimized HPC systems (parallel processing, storage and scheduling) 2.Data Management a.Standardized Discrete Global Grid Systems 3.Data Index a.NOSQL influenced solution 4.Data Analysis a.Make Analytics Simple (Analytics API, Using Natural Languages) 5.Data Access a.Storage structure: “Chunk” size 6.Example a.National Environment Research Data Collections (NERDC) / NCI
Introductions to Discussion: Bringing Processing Close to the Data
Next Steps We will continue more discussion using the Tech. Expo. webinars and future meetings. Proposal on the Webinars are shown in the following slides.
Proposal : Tech. Expo webinars 1.A series of monthly webinars, each minutes long, for various Tech Expo Topics. Each webinar will be conducted by an expert in the specific topic. 2.We will create a wiki page for each monthly topic, advertising the expert speaker, and a description of the talk and the logistics of the webinar. 3.We will record each webinar session and make those recordings available on the WGISS website 4.We will create of list of tech expo topics. Some topics may require multiple webinars, each with a different speaker. 5.The speakers will come from a variety of international organizations – no CEOS affiliation needed, just noted expertise in the subject.
Proposal : Tech. Expo webinars (cont.) 1.Big Data, HPC and Cloud Computing a.What CEOS needs? CEOS Challenges b.Distributed data centers c.Data processing (incl, Data cube) d.Data distribution e.Data Analysis f.API and use of standards (quality judgment scheme, may be with WGCV) g.Network (bandwidth, application) h.Advantages and disadvantages 2.Searching for free satellite data from CEOS agencies a.In suitable forms (related to FAD) 3.GCMD/IDN Keywords – what are they, how to add to these lists
Proposal : Tech. Expo webinars (cont.) 4.Search Relevancy for Collection searches 5.Data Quality (may be with WGCV) a.ISO? 6.Semantics a.Augmentation of meta data 7.Visualization of Data a.Web-based visualization b.Volume rendering c.Tiling d.Augmented Reality 8.Using authentication/SSO 9.Metrics of usage, metrics of datasets 10.Crowd Sourcing