Big Data Analytics and Machine Intelligence Capability Team

Slides:



Advertisements
Similar presentations
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
Advertisements

Project: BrailleScript Advisor: Dr. Mayer Goldberg Team: Semyon Medvedik Ivan Golman Ruslan Sergienko.
Input Validation For Free Text Fields ADD Project Members: Hagar Offer & Ran Mor Academic Advisor: Dr Gera Weiss Technical Advisors: Raffi Lipkin & Nadav.
Academic Advisor: Dr. Yuval Elovici Technical Advisor: Dr. Lidror Troyansky ADD Presentation.
Tutorial 8 Sharing, Integrating and Analyzing Data
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
Databases & Data Warehouses Chapter 3 Database Processing.
Developing Health Geographic Information Systems (HGIS) for Khorasan Province in Iran (Technical Report) S.H. Sanaei-Nejad, (MSc, PhD) Ferdowsi University.
Technology Capabilities. Market Research + Tech Capabilities Datamatics has in-house capabilities to deliver Technical expertise. Our clients rely on.
My Resource for Excellence. Canadian Heritage Information Network Collections Management Software Criteria Checklist Heather Dunn, CHIN.
Earth Observing System Data and Information System (EOSDIS) provides access to more than 3,000 types of Earth science data products and specialized services.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Patient Empowerment for Chronic Diseases System Sifat Islam Graduate Student, Center for Systems Integration, FAU, Copyright © 2011 Center.
CHAPTER FOUR COMPUTER SOFTWARE.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
Introduction to Interactive Media Interactive Media Tools: Software.
Upgrading to IBM Cognos 10
PCWG Analysis Tool Peter Stuart September 15, 2015.
UWG 2013 Meeting PO.DAAC Web Services Demo. What are PO.DAAC Web Services?
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
The Basics of Javadoc Presented By: Wes Toland. Outline  Overview  Background  Environment  Features Javadoc Comment Format Javadoc Program HTML API.
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
Ch 1. A Python Q&A Session Spring Why do people use Python? Software quality Developer productivity Program portability Support libraries Component.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
1 Aspire Document Processing 1. 2 Document Processing – “Aspire” Very High Performance Structured Document Processing Architecture Dynamic configuration.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Digital Data Preservation: a schema-driven model Student: Stacy Kowalczyk Co-Authors: Clare McInerney and Phil Mitchell Digital Data Preservation – the.
Presented By:. What is JavaHelp: Most software developers do not look forward to spending time documenting and explaining their product. JavaSoft has.
CAA Database Overview Sinéad McCaffrey. Metadata ObservatoryExperiment Instrument Mission Dataset File.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
BI TOOLS FOR THE IBM I …AND BEYOND A quick look at IBM’s DB2 Web Query for the i and WebFocus.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
Microsoft FrontPage 2003 Illustrated Complete Creating a Web Site.
Business Intelligence MSCS 6931 Compare Tableau and Power BI Haochen(Bamboo) Sun Sep 30, 2015.
Responder Field Edition & Pro
Charity Fund for Sudanese Students in Nanjing
Architecture Review 10/11/2004
VisIt Project Overview
Big Data Panel Discussion
Visual Information Retrieval
 Corpus Formation [CFT]  Web Pages Annotation [Web Annotator]  Web sites detection [NEACrawler]  Web pages collection [NEAC]  IE Remote.
DOCUMENTATION DEVELOPMENT LIFE CYCLE (DDLC)
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Responder Field Edition & Pro
CFS Community Day Core Flight System Command and Data Dictionary Utility December 4, 2017 NASA JSC/Kevin McCluney December 4, 2017.
Digital Measures Replacement
Databases.
Database Vs. Data Warehouse
Jefferson Ridgeway, 2017 NIFS Summer Intern Mentor: Brian Coltin, IRG
Speech Capture, Transcription and Analysis App
Embedding Graphics in Web Pages
Validation of Ebola LOD
PREMIS Tools and Services
Tim Coughlan Chetz Colwell Jane Seale
ICEweb 2 a new way of compiling high-quality web-based components for ICE corpora Martin Weisser Center for Linguistics & Applied Linguistics, Guangdong.
Charles Tappert Seidenberg School of CSIS, Pace University
Getting Started With Solr
DSS Concepts, Methodologies and Technologies
SSIS Data Integration Data Warehouse Acceleration
AI Discovery Template IBM Cloud Architecture Center
Microsoft Azure Data Catalog
APE EAD3 introduction - DARIAH - Brussels
SDMX IT Tools SDMX Registry
Presentation transcript:

Big Data Analytics and Machine Intelligence Capability Team Text Analytics of Selected Reports from the NASA Technical Reports (NTRS) NIFS Internship: Summer 2016 Jefferson Ridgeway, 2016 NIFS Summer Intern Manjula Ambur, Jeremy Yagle, Ted Sidehamer, NASA Langley Research Center Langley Research Center Background: Currently at NASA, NTRS (NASA Technical Reports) houses scientific research documents from the NASA Scientific and Technical Information (STI) Program.  NTRS allows subject matter experts to search documents via a graphical user interface (GUI) online and see immediate search query results and subsequent metadata in XML format. However, with the advancement of technologies and software tools, this method of searching documents has now become inefficient in comparison of other tools that are currently within the industry. One of the recent tools being used in the past few years is IBM's Watson Content Analytics (WCA) software. Goals: Through using the WCA software tool, the goals of this project include: •Investigating methods of locating, extracting, and ingesting a subset of documents from NTRS into WCA •Writing python scripts to prepare XML files from NTRS, Endnote, Mendeley, for ingestion into WCA proper format •Document processes related to adding another collection to the Watson system at NASA LaRC Methodology: The subsequent methodology conducted for this research project is as follows: Acquired a subset of NTRS subject matter search queries from NTRS Personnel in XML or CSV format Created a .java class that can be used in the runnable .jar file that will process NTRS and ingest into WCA Visually Reviewed the XML files to make sure the files are in appropriate WCA ingestion format If XML files were not in appropriate format or need to be edited, a python script was written that will insert or delete tags and write a new XML files If received a CSV file format, a python script was written to convert CSV to XML format for WCA ingestion Created a modularized process for taking directories of XML files and applying previous python script Built WCA Components (crawler, facet structure, index fields) and then pointed crawler to config, .jar, and source files Run ingest of XML Files into WCA software tool and visually inspected and validated WCA Collection Results and Next Steps: The results from conducting this research show the power that WCA provides to the user that include, unlike more general search query tools, key insights and patterns, ability to identify trends and connections, and visualize networks of experts in a given domain. The next steps within this research would be to ingest all of NTRS documents into a WCA collection. Figure 1: Default View of NTRS in WCA Figure 2: Facet Pairs between similar researchers and their works related to “Radiation Effects” in WCA Figure 3: Processed visualize connections made by WCA between different researchers