Earthquake Prediction

Slides:



Advertisements
Similar presentations
Random Forest Predrag Radenković 3237/10
Advertisements

Ch. Eick: More on Machine Learning & Neural Networks Different Forms of Learning: –Learning agent receives feedback with respect to its actions (e.g. using.
Decision Tree Approach in Data Mining
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
A Quick Overview By Munir Winkel. What do you know about: 1) decision trees 2) random forests? How could they be used?
CS771 Machine Learning : Tools, Techniques & Application Gaurav Krishna Y Harshit Maheshwari Pulkit Jain Sayantan Marik
Neural Computation Final Project -Earthquake Prediction , Spring Alon Talmor Ido Yariv.
Lesson 8: Machine Learning (and the Legionella as a case study) Biological Sequences Analysis, MTA.
Overview of The Operations Research Modeling Approach.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
CS 5604 Spring 2015 Classification Xuewen Cui Rongrong Tao Ruide Zhang May 5th, 2015.
 The Weka The Weka is an well known bird of New Zealand..  W(aikato) E(nvironment) for K(nowlegde) A(nalysis)  Developed by the University of Waikato.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Predicting Income from Census Data using Multiple Classifiers Presented By: Arghya Kusum Das Arnab Ganguly Manohar Karki Saikat Basu Subhajit Sidhanta.
Clustering methods Course code: Pasi Fränti Speech & Image Processing Unit School of Computing University of Eastern Finland Joensuu,
INTRODUCTION TO COMPUTING CHAPTER NO. 06. Compilers and Language Translation Introduction The Compilation Process Phase 1 – Lexical Analysis Phase 2 –
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
An Attempt at Group Belief Characterization and Detection Danny Dunlavy Computer Science and Informatics Department (1415) Sandia National Laboratories.
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
Understanding MYP Criteria
Highline Class, BI 348 Basic Business Analytics using Excel, Chapter 01 Intro to Business Analytics BI 348, Chapter 01.
LOGO Ensemble Learning Lecturer: Dr. Bo Yuan
Weka: a useful tool in data mining and machine learning Team 5 Noha Elsherbiny, Huijun Xiong, and Bhanu Peddi.
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
Today Ensemble Methods. Recap of the course. Classifier Fusion
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Project 2: Classification Using Genetic Programming Kim, MinHyeok Biointelligence laboratory Artificial.
Apache Mahout Qiaodi Zhuang Xijing Zhang.
Predicting Voice Elicited Emotions
Objectives: Terminology Components The Design Cycle Resources: DHS Slides – Chapter 1 Glossary Java Applet URL:.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.
Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.
Improving Support Vector Machine through Parameter Optimized Rujiang Bai, Junhua Liao Shandong University of Technology Library Zibo , China { brj,
Data Mining and Decision Support
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
… Algo 1 Algo 2 Algo 3 Algo N Meta-Learning Algo.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
Gist 2.3 John H. Phan MIBLab Summer Workshop June 28th, 2006.
Unveiling Zeus Automated Classification of Malware Samples Abedelaziz Mohaisen Omar Alrawi Verisign Inc, VA, USA Verisign Labs, VA, USA
Stochasticity and Probability. A new approach to insight Pose question and think of the answer needed to answer it. Ask: How do the data arise? What is.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
A Smart Tool to Predict Salary Trends of H1-B Holders
Restaurant Revenue Prediction using Machine Learning Algorithms
Machine Learning with Spark MLlib
Recognition of bumblebee species by their buzzing sound
Chapter 7. Classification and Prediction
Admission Prediction System
Lecture 12: Real-Time Scheduling
Debesh Jha and Kwon Goo-Rak
Presented by Khawar Shakeel
UNIT – V BUSINESS ANALYTICS
Classification with Perceptrons Reading:
Basic machine learning background with Python scikit-learn
Predict House Sales Price
Machine Learning Week 1.
Combining Base Learners
Introduction to Computer Programming
Introduction to Data Mining, 2nd Edition
What is Science?.
iSRD Spam Review Detection with Imbalanced Data Distributions
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning for Visual Scene Classification with EEG Data
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Tools of Environmental Science
Modeling IDS using hybrid intelligent systems
Credit Card Fraudulent Transaction Detection
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
An introduction to Machine Learning (ML)
Presentation transcript:

Earthquake Prediction https://athena.ecs.csus.edu/~kokkiras/index.html Team - 18 Mentor : Prof. Meiliu Lu Done By : Akhil Madineni Suraj Krishna Kokkirala

Over View Problem Statement Goal Data Overview Data Preprocessing Model Implementation Demo Results Learnings References

Why predicting an Earthquake ? Forecasting earthquakes is one of the most important problems in science because of their catastrophic consequences. Current scientific studies related to earthquake forecasting focus on three key points:  When the event will occur ?  Where  will it occur ? How large will it be ?

Goal The goal is to predict the timing of laboratory earthquakes using seismic signals . The data is been taken from an experimental set-up used to study earthquake physics. The acoustic_data input signal is used to predict the time remaining before the next laboratory earthquake (time_to_failure).

Data Overview File descriptions train.csv - A single, continuous training segment of experimental data. test - A collection of many small segments of acoustic data signals. sample_sumbission.csv - According to the sample submission file, we need to predict time to failure for each segment of test data.

Data Pre-Processing - I Raw Data Fields acoustic_data - the seismic signal [int16] time_to_failure - the time (in seconds) until the next laboratory earthquake [float64]

Data Pre-Processing - II Analyze Acoustic Data and “time to failure” Data.

Data Pre-Processing - III Steps for converting Data as X and Y for our prediction model Read all rows between earthquakes and create folds. For chunks of size 150'000,75000,50000,30000, we extract a couple of features and store them in a row of a matrix X. The response is stored in a vector y(which is last time step of each chunk. We move by "stride" positions and repeat steps 2 & 3 until the earthquake happens.

Model Implementation Random Forest - It is an ensemble learning method for classification, regression and other tasks that operates by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees.

#MODEL Trees Strides Mean Absolute Error(MAE) Model-1 500 150000 2.267461 Model-2 2000 2.265719 Model-3 5000 2.264752 Model-4 75000 2.282808 Model-5 2.282194 Model-6 2.282426 Model-7 50000 2.284351 Model-8 2.282518 Model-9 2.282003 Model-10 30000 2.291800 Model-11 2.289706 Model-12 2.289344

Learnings…. Data Pre-Processing is vital to the accuracy of the models. Opting for appropriate machine learning techniques and algorithms to model the system. Plotting data provides useful insights and can lead to better models. Learnings related from ’R’. To solve the error “cannot allocate a vector of size xxx MB ” use memory.limit function to assign more RAM to R kernel.

References https://en.wikipedia.org/wiki/Earthquake https://paperswithcode.com/