Ongoing thesis. Analyzing sonar data. Student: Pyojeong Kim

Slides:



Advertisements
Similar presentations
Big Data Training Course for IT Professionals Name of course : Big Data Developer Course Duration : 3 days full time including practical sessions Dates.
Advertisements

Business Intelligence Systems
Implementation of the DDI at the Roper Center A Pilot Project on Resource Integration Marc Maynard and Hui Wang The Roper Center.
Business Intelligence Systems
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
Hadoop File Formats and Data Ingestion
Archive Engine for Large Data Sets Nikolay Malitsky EPICS Collaboration Meeting San Francisco, USA October 5, 2013.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
Distributed Systems Fall 2014 Zubair Amjad. Outline Motivation What is Sqoop? How Sqoop works? Sqoop Architecture Import Export Sqoop Connectors Sqoop.
Creating With Code.
Hive Facebook 2009.
Enabling data management in a big data world Craig Soules Garth Goodson Tanya Shastri.
Database Principles. Basics A database is a collection of data, along with the relationships between the data The data has to be entered into a structure,
Database Management Systems CS 420. Topics Outline 1. Introduction 2. HTML Review 3. VBScript 4. Access DBMS 5. Relational Database 6. Design Process.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Astrophysical Data. Data in Astrohysics 3 2D Images Spectra Tabular data 3D Cube data Simulation data Time series.
Impala. Impala: Goals General-purpose SQL query engine for Hadoop High performance – C++ implementation – runtime code generation (using LLVM) – direct.
Table of Contents Introduction Why Data Analytics Data Analytics Terminology Predictive Analytics Data Analytics challenges Data Analytics Platform Data.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Unveiling Zeus Automated Classification of Malware Samples Abedelaziz Mohaisen Omar Alrawi Verisign Inc, VA, USA Verisign Labs, VA, USA
PREDICTING SONG HOTNESS
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
KNOWLEDGE DISCOVERY & DATA MINING Abhishek M. Mehta ROLL NO:24.
Hadoop Introduction. Audience Introduction of students – Name – Years of experience – Background – Do you know Java? – Do you know linux? – Any exposure.
Text and Images Key Revision Points.
Big Data & Test Automation
Introduction to Big Data James Miller
OMOP CDM on Hadoop Reference Architecture
Big Data, Data Mining, Tools
ANALYZING REAL ESTATE TRANSACTIONS
Presented by Matt Brunner
Database Principles.
SNS COLLEGE OF TECHNOLOGY
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
Hadoop.
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
Integrating QlikView with MPP data sources
An Open Source Project Commonly Used for Processing Big Data Sets
Multi-Layer Network Representation of the NTC Environment Lili Sun, Proof School Arijit Das, Computer Science Introduction The United States Army’s National.
How did it start? • At Google • • • • Lots of semi structured data
Chapter 14 Big Data Analytics and NoSQL
Hadoop Developer.
MSBIC Hadoop Series Processing Data with Pig
Sqoop Mr. Sriram
TABLE OF CONTENTS. TABLE OF CONTENTS Not Possible in single computer and DB Serialised solution not possible Large data backup difficult so data.
EVLA Archive The EVLA Archive is the E2E Archive
Hadoop.
Big Data Intro.
Hive Mr. Sriram
SQOOP.
Overview of Hadoop MapReduce MapReduce is a soft work framework for easily writing applications which process vast amounts of.
07 | Analyzing Big Data with Excel
Ministry of Higher Education
Learning Google
Hadoop for SQL Server Pros
Introduction to Apache
نرم افزار گزارش ساز کارنیک
Azure Data Lake for First Time Swimmers
Methodically Extrapolating Semantic Analyzer
Setup Sqoop.
Introduction of Week 9 Return assignment 5-2
Big DATA.
HDInsight & Power BI By Łukasz Gołębiewski.
UNIT 6 RECENT TRENDS.
Map Reduce, Types, Formats and Features
Big Data.
Presentation transcript:

Ongoing thesis. Analyzing sonar data. Student: Pyojeong Kim Advisors: Arijit Das, Sean Kragelund, Glenn Cook.

UUV & Mine Warfare (mission): Image Credit: http://www.bluebird-electric.net @arijit_da

Sonar Data File Each mission generates about GIG to a Terabyte of data. Depends on the duration. Over decades, Terabytes of files have been generated. Files need to be catalogued (using automated techniques). @arijit_da

Sonar Data File Each binary file has 2 parts. Metadata file name, timestamp, latitude, longitude, depth and more. Image Extraction and processing (future thesis topic) @arijit_da

Sonar Image Processing. Image Credit: https://tinyurl.com/y62xpu2g @arijit_da

Cataloging: Classic Hadoop problem. 100s of thousands of files. Same processing on each file. Map Phase is enough to collect metadata. @arijit_da

MapReduce. @arijit_da

Non Hadoop processing. File format provided by the sonar vendor. Use Java typecasting to map the raw bytes to the fields. For large number of files: Time consuming on single laptop, desktop, server. @arijit_da

Oracle DB Table. @arijit_da

Oracle Table Data. @arijit_da

HIVE Table. @arijit_da

HIVE Table Data. @arijit_da

Sqoop: HDFS HIVE back to Oracle DB Image Credit: https://www.dezyre.com/ @arijit_da

Oracle DB &APEX. @arijit_da

Hadoop in a box. ImageCredit: https://www.casesbypelican.com/ @arijit_da

AI to predict the Depth. Multilayer perceptron (MLP) regression. Other AI Techniques. @arijit_da