Rami Qahwaji & TufanColak EIMC, University of Bradford BD71DP,

Slides:



Advertisements
Similar presentations
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Advertisements

11/26/081 AUTOMATIC SOLAR ACTIVITY DETECTION BASED ON IMAGES FROM HSOS NAOC, HSOS YANG Xiao, LIN GangHua
MUSICAL SCALE IDENTIFICATION USING NEURAL NETWORKS -Lyndon Quadros.
Face Recognition & Biometric Systems Support Vector Machines (part 2)
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Machine Learning Neural Networks
Neural Networks II CMPUT 466/551 Nilanjan Ray. Outline Radial basis function network Bayesian neural network.
An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
A Study on Feature Selection for Toxicity Prediction*
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Chapter 5 NEURAL NETWORKS
Properties of Machine Learning Applications for Use in Metamorphic Testing Chris Murphy, Gail Kaiser, Lifeng Hu, Leon Wu Columbia University.
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 15: Introduction to Artificial Neural Networks Martin Russell.
Speaker Adaptation for Vowel Classification
Deep Belief Networks for Spam Filtering
Radial Basis Function Networks 표현아 Computer Science, KAIST.
Chapter 6: Multilayer Neural Networks
1 Automated Feature Abstraction of the fMRI Signal using Neural Network Clustering Techniques Stefan Niculescu and Tom Mitchell Siemens Medical Solutions,
Hazırlayan NEURAL NETWORKS Radial Basis Function Networks I PROF. DR. YUSUF OYSAL.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Text Classification With Labeled and Unlabeled Data Presenter: Aleksandar Milisic Supervisor: Dr. David Albrecht.
Radial Basis Function G.Anuradha.
Radial-Basis Function Networks
Hazırlayan NEURAL NETWORKS Radial Basis Function Networks II PROF. DR. YUSUF OYSAL.
Radial Basis Function Networks
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.
Radial Basis Function Networks
Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
“Study on Parallel SVM Based on MapReduce” Kuei-Ti Lu 03/12/2015.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Radial Basis Function Networks:
An Introduction to Support Vector Machine (SVM) Presenter : Ahey Date : 2007/07/20 The slides are based on lecture notes of Prof. 林智仁 and Daniel Yeung.
Comparison of Bayesian Neural Networks with TMVA classifiers Richa Sharma, Vipin Bhatnagar Panjab University, Chandigarh India-CMS March, 2009 Meeting,
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Well Log Data Inversion Using Radial Basis Function Network Kou-Yuan Huang, Li-Sheng Weng Department of Computer Science National Chiao Tung University.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
1 Lecture 6 Neural Network Training. 2 Neural Network Training Network training is basic to establishing the functional relationship between the inputs.
HMMs and SVMs for Secondary Structure Prediction
Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.
Multi-Criteria-based Active Learning for Named Entity Recognition ACL 2004.
What the Long-Term Sunspot Record Tells Us About Space Climate David H. Hathaway NASA/MSFC National Space Science and Technology Center Huntsville, AL,
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
ESWW 6 Rami Qahwaji and Tufan Colak School of Computing, Informatics and Media, Bradford University Richmond Road, Bradford BD7 1DP, England, UK. Tel.
Feasibility of Using Machine Learning Algorithms to Determine Future Price Points of Stocks By: Alexander Dumont.
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Supervised Learning – Network is presented with the input and the desired output. – Uses a set of inputs for which the desired outputs results / classes.
Machine Learning Artificial Neural Networks MPλ ∀ Stergiou Theodoros 1.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
Neural network based hybrid computing model for wind speed prediction K. Gnana Sheela, S.N. Deepa Neurocomputing Volume 122, 25 December 2013, Pages 425–429.
Solar Image Recognition Workshop, Brussels, 23 & 24 Oct. The Detection of Filaments in Solar Images Dr. Rami Qahwaji Department of Electronic Imaging and.
Machine Learning 12. Local Models.
Big data classification using neural network
D. Shaun Bloomfield 1,2, K. Domijan 3, P. A. Higgins 2, P. T
an introduction to: Deep Learning
A Support Vector Machine Approach to Sonar Classification
D. Shaun Bloomfield 1,2, K. Domijan 3, P. A. Higgins 2, P. T
Radial Basis Function G.Anuradha.
network of simple neuron-like computing elements
Introduction to Radial Basis Function Networks
Modeling IDS using hybrid intelligent systems
Presentation transcript:

Rami Qahwaji & TufanColak EIMC, University of Bradford BD71DP, U.K. Automated Short-term Prediction of Solar Flares using Machine Learning

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06 Objective:  We aim to design an automated system that could provide short-term prediction of solar flares by establishing a correlation between sunspots and solar flares using machine learning.

SIPWORKIII 08/09/06 Related Work  Despite the recent advances in solar imaging, machine learning has not been widely applied to solar data, except for verification purposes.  Solar activity (i.e., Wolf Number) was predicted first by (Calvo et al. 1995).  (Borda et al. 2002) described a method for the automatic detection of solar flares using BP MLP.  MLP, SVM and RBF were used for flares detection in (Qu et al. 2003).

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06 Data?  Data from the publicly available National Geophysical Data Centre (NGDC) sunspot groups and flares catalogues are used in our study.  NGDC keeps record of data from several observatories around the world and holds one of the most comprehensive publicly available databases for solar features and activities.

SIPWORKIII 08/09/06 The NGDC sunspots catalogue  The NGDC sunspot catalogue holds records of sunspot groups supplying their date, time, location, physical properties, sunspot area and classification data.  Two classification systems exist for sunspots: McIntosh, which depends on the size, shape and spot density of sunspots, and Mt. Wilson., which is based on the distribution of magnetic polarities within spot groups.

SIPWORKIII 08/09/06 The NGDC Flares catalogue  This catalogue provides information about dates, starting and ending times for flare eruptions, location, NOAA number of the corresponding active region and x-ray classification for the detected flares.  Not all the flares have associated NOAA numbers. Flares without NOAA numbers are not included in our study.

SIPWORKIII 08/09/06 Data

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association and prediction model  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06  We’ve investigated all the sunspot groups that were associated with flares from 01 Jan1992 till 31 Dec  The degree of association was determined based on the NOAA region number and the timing information.  A C++ platform that extracts online flares and sunspots info from NGDC catalogues was created.  Our software has analysed the data related to flares and sunspots and has managed to associate 1425 M and X flares with their corresponding sunspot groups. Associating Flares and Sunspots

SIPWORKIII 08/09/06 Associating Flares and Sunspots

SIPWORKIII 08/09/06 Flare Prediction

SIPWORKIII 08/09/06 The Theoretical Model

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06  Various neural network topologies, support vector machines (SVM) and Radial Basis Function Networks (RBFN) are optimized and compared.  In our previous work (Qahwaji & Colak, CITSA 2006 and Colak & Qahwaji, WSC11) the performance of several NN topologies (i.e., Elman BP, FFBP, cascade FFBP, etc.) was compared and it was concluded that CCNN provides better association between solar flares and sunspot classes.  CCNN and RBFN are used because of their efficient performance in classification and time-series prediction (Frank et al. 1997).

SIPWORKIII 08/09/06  Thank You for Listening SVM vs NN?

SIPWORKIII 08/09/06  It is one of the recent trends in machine learning to compare the performance of SVMs and NNs.  The work reported in (Acir & Guzelis 2004), (Pal & Mather 2004), (Huang et al. 2004), and (Distante et al. 2003) supports this.  Similar performance for SVMs was reported for flares detection in (Qu et al. 2003),

SIPWORKIII 08/09/06 Cascade FFBP  In cascade FFBP, the first layer has connecting weights with the input layer. Each subsequent layer has weights connecting it to the input layer and all previous layers..

SIPWORKIII 08/09/06 SVM (Support Vector Machines) maximises the distance between the closest vectors in both classes to the hyperplane

SIPWORKIII 08/09/06 Radial Basis Function Networks (RBFN)

SIPWORKIII 08/09/06 Optimising the Learning Algorithms  A learning algorithm provides best generalisation if it is optimised.  A NN is optimised if the optimum topology, learning algorithm and learning times are found.  After finding that CCNN provides best performance, we compared 100 different CCNN topologies.  We found that a CCNN with 6 hidden nodes in the first layer and 4 hidden nodes in the second layer gives the best results for CFP and CFTP.  Similar approaches were followed for SVM and RBNN.

SIPWORKIII 08/09/06

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06  Both NGDC catalogues were used and our software has analysed the data related to flares and sunspots and has managed to associate 1425 M and X flares with their corresponding sunspot groups.  The total number of samples used for our training set is 2882, where 1425 samples represent sunspots that produced flares.  The remaining samples represent sunspots that existed in non-flaring days and are not related to any sunspot groups within the previous flaring sunspot samples.

SIPWORKIII 08/09/06 The Training and Testing Sets  The NN training and testing was carried out based on the stati stical Jack-knife technique (Fukunaga 1990).  For all the experiments, 80% of the samples are randomly selected and used for training while the remaining 20% are used for testing. These experiments are repeated for number of times and the average is taken.

SIPWORKIII 08/09/06 Initial Experiments  For each sample, the training vector consists of 5 elements ( 3 for inputs; 2 for outputs).

SIPWORKIII 08/09/06 Initial Experiments  Several experiments based on the Jack-knife technique were carried out and we found that the prediction rate for flares in the best case scenario was 72.9%.  This indicated that a correlation existed between the input and output sets. But this value is not high enough to provide reliable prediction of solar activities.  To improve the learning performance we tried to associate the classified sunspots with the sunspot cycle.

SIPWORKIII 08/09/06  This seemed logical because the rise and fall of solar activity coincides with the sunspot cycle (Pap et al. 1990).  When the solar cycle is at a maximum, plenty of large active regions exist and many solar flares are detected. These decreases in number as the Sun approaches the minimum part of its cycle (Pap et al. 1990).

SIPWORKIII 08/09/06 Solar Cycle and Flares NASA,"Solar Minimum Explodes",

SIPWORKIII 08/09/06 Solar Cycle Modelling-Hathaway’s Model a represents the amplitude and is related to the rise of the cycle minimum, b is related to the time in months from minimum to maximum; c gives the asymmetry of the cycle; and t o denotes the starting time

SIPWORKIII 08/09/06  For each sample, the training vector consists of 6 elements ( 4 for inputs; 2 for outputs).

SIPWORKIII 08/09/06  Hence, for Fkc sunspot at solar maximum that produced an M flare, the training vector looks like this:

SIPWORKIII 08/09/06

SIPWORKIII 08/09/06

SIPWORKIII 08/09/06 Organisation of this talk  Objectives & related work  Solar data (features and activities)  Data Association  Machine learning algorithms  Practical results  Conclusions and future work

SIPWORKIII 08/09/06 Conclusions  A fully automated computer platform that could verify this correlation between sunspot classes and solar flares relation using machine learning, is designed.  The association and learning softwares will become public shortly at  Our findings show that there is a direct relation between the eruptions of flares and certain McIntosh classes of sunspots such as Ekc, Fki and Fkc. Our findings are in accordance with (McIntosh 1990), (Warwick 1966), and (Sakurai 1970). /

SIPWORKIII 08/09/06

SIPWORKIII 08/09/06 A hybrid system, which combines both SVM and CCNN, will give better results for flare prediction.

SIPWORKIII 08/09/06 Future Work  Apply image segmentation and classification algorithms to detect sunspots and classify them automatically, so that the platform is completed.  To track the individual sunspot groups over their lifetime. The development of the sunspot group can contribute to the knowledge of the machine learning systems.  Will better prediction be achieved if the magnetic configuration of sunspots (Mt. Wilson classification) is combined with the sunspot area to replace the McIntosh classification (Sammis, Tang & Zirin, 2000, ApJ)?

SIPWORKIII 08/09/06  To compare our findings with other authors who tested the correlations of the various McIntosh classes on flare rates and the applications to solar flare prediction (e.g. McIntosh 1990; Bornmann & Shaw 1994, Sol. Phys. 150, p. 127; Gallagher et al. 2002, Sol. Phys. 209, p. 171; Wheatland 2004, ApJ 609, p. 1134).

SIPWORKIII 08/09/06  Acknowledgment. This work is supported by an EPSRC Grant (GR/T17588/01), which is entitled “Image Processing and Machine Learning Techniques for Short-Term Prediction of Solar Activity”.