Outline Tourism previsioning methods Previsioning in Croatia

Slides:



Advertisements
Similar presentations
Managerial Economics Estimation of Demand
Advertisements

Neural Network I Week 7 1. Team Homework Assignment #9 Read pp. 327 – 334 and the Week 7 slide. Design a neural network for XOR (Exclusive OR) Explore.
SEKE 2014, Hyatt Regency, Vancouver, Canada
Copyright © 2012 Pearson Addison-Wesley. All rights reserved. Chapter 1 What Is Macroeconomics?
Pontifical Catholic University of the Rio Grande do Sul Brazil Applying Artificial Neural Networks to Energy Quality Measurement Fernando Soares dos Reis.
Data Mining Techniques Outline
Prénom Nom Document Analysis: Artificial Neural Networks Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Forecasting.
Copyright ©2004 Pearson Education, Inc. All rights reserved. Chapter 14 Stock Analysis and Valuation.
Tree-based methods, neutral networks
Prénom Nom Document Analysis: Artificial Neural Networks Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Chapter 3 Forecasting McGraw-Hill/Irwin
Forecasting McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Self-organizing Learning Array based Value System — SOLAR-V Yinyin Liu EE690 Ohio University Spring 2005.
Applied Business Forecasting and Planning
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
NEURAL NETWORKS FOR TECHNICAL ANALYSIS: A STUDY ON KLCI 授課教師:楊婉秀 報告人:李宗霖.
1 Prediction of Software Reliability Using Neural Network and Fuzzy Logic Professor David Rine Seminar Notes.
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.
Outline What Neural Networks are and why they are desirable Historical background Applications Strengths neural networks and advantages Status N.N and.
NIMIA October 2001, Crema, Italy - Vincenzo Piuri, University of Milan, Italy NEURAL NETWORKS FOR SENSORS AND MEASUREMENT SYSTEMS Part II Vincenzo.
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
Outline 1-D regression Least-squares Regression Non-iterative Least-squares Regression Basis Functions Overfitting Validation 2.
Gap filling of eddy fluxes with artificial neural networks
Statistical Tools for Solar Resource Forecasting Vivek Vijay IIT Jodhpur Date: 16/12/2013.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
Why Do Countries Use Capital Controls? Prepared by R. Barry Johnston and Natalia T. Tamirisa - December 1998 Presented by: Alyaa Ezzat.
Chapter 16 Data Analysis: Testing for Associations.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Fault Tolerance Benchmarking. 2 Owerview What is Benchmarking? What is Dependability? What is Dependability Benchmarking? What is the relation between.
Lecture 5 Neural Control
Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.
Downscaling Global Climate Model Forecasts by Using Neural Networks Mark Bailey, Becca Latto, Dr. Nabin Malakar, Dr. Barry Gross, Pedro Placido The City.
Neural Networks for EMC Modeling of Airplanes Vlastimil Koudelka Department of Radio Electronics FEKT BUT Metz,
Seth Kulman Faculty Sponsor: Professor Gordon H. Dash.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Introduction and harmonization of the Harmonized Index of Consumer Prices (HICP) in EU A quick overview Twinning project: Activity 4.1- HICP Short term.
HARMONISED CONSUMER PRICE INDEX Nadiežda Alejeva Head, Price Statistics Division Statistics Lithuania WORKSHOP, 13 JULY 2012, YEREVAN 1.
Multi-Area Load Forecasting for System with Large Geographical Area S. Fan, K. Methaprayoon, W. J. Lee Industrial and Commercial Power Systems Technical.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
A Document-Level Sentiment Analysis Approach Using Artificial Neural Network and Sentiment Lexicons Yan Zhu.
Pantelis Pantelidis, University of Piraeus Dimitrios Kyrkilis, University of Macedonia Efthymios Nikolopoulos, University of Macedonia February 2011 The.
Lecture 9 Forecasting. Introduction to Forecasting * * * * * * * * o o o o o o o o Model 1Model 2 Which model performs better? There are many forecasting.
ACCOMMODATION STATISTICS Akhmad Tantowi BPS-Statistics Indonesia Paramaribo, 9 – 11 August 2016 Statistical, Economic and Social Research and Training.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
Sharing of previous experiences on scraping Istat’s experience
How do we measure economic performance?
Quantified perceived and Expected Inflation in the Euro Area
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Eighth Edition by Frank K. Reilly & Keith C. Brown Chapter 12.
THE MODEL OF KEY COMPETITIVE FACTORS OF GLOBAL CHAIN HOTELS
Regression Analysis Module 3.
Neural Network Models Neural Network Models in Business and Economic Modelling Tourist Segmentation based on Political Instability Results Conclusions.
Advanced information retreival
Artificial Intelligence (CS 370D)
第 3 章 神经网络.
Kaniz Rashid Lubana Mamun MS Student: CSU Hayward Dr. Eric A. Suess
Mid Semester UEB Energy Analysis of a Wastewater Treatment Plant by using Artificial Neural Network Presented by: Dr. Nor Azuana Ramli Electrical Engineering.
The “status” of the Crisis in Europe A General Outlook
Generalization ..
Data Mining Practical Machine Learning Tools and Techniques
Collaborative Filtering Nearest Neighbor Approach
Neural Network Models Neural Network Models in Business and Economic Modelling Tourist Segmentation based on Political Instability Results Conclusions.
Possible complements to improve EU/EA estimation techniques
The Determinants of FDI Inflows to Greece
Kreshna GOPAL C. Prakash KHEDUN Anoop SOHUN
Primer on Neural networks
Presentation transcript:

Outline Tourism previsioning methods Previsioning in Croatia A creative application of algorithms to tourism sector The ANNs models The esperiment: Dataset Materials and methods Results and discussion Conclusion

Motivation and aims

Tourism previsioning variables Tourism influenced by several variables indicating the general lifestyle of a country and by visitors’ perception. Generally, previsioning methods : based on incomplete stats (usually number of night stays/arrivals) Heterogeneous data needed to obtain a better prevision.

Previsioning in Croatia Several actions to make improvements in tourism sector A big part of GDP devoted to environmental protection, to exploit the natural attractions and to prevent pollution. Many data which could help to obtaining efficient previsions. Recent techniques such as the so called sentiment analysis could help in collecting this kind of data (rarely used in tourist demand forecasting).

A creative application of algorithms to tourism sector In this work a creative applications of algorithms to tourism sector, such as the Artificial Neural Networks (ANN). The aim of this study is to use ANN to model and forecast tourism demand in terms of total overnight stays in Croatia, on the basis of data collected for the period 2004-2006. Preliminary results from a joint project (University of Rijeka and University of Milan)

Biological neuron vs ANN Artificial Neural Networks (ANN): computational tools extensively used in several research fields. ANNs keep inspiration by the biological neural networks and they are attractive for processing information with high parallelism, nonlinearity and especially noise tolerance. activation function output treshold weights inputs axon synapse dendrite

ANN for tourism demand forecasting in Croatia Forecasting models usually consist of statistical regressions mainly focused on previous years tourists’ inflow and weather reports. The advantages of adopting ANN: the computational speed and the ability to learn general solutions of presented training data, the absence of the need to develop an explicit model of a process and the ability to model parts of a process that cannot be modelled or are unidentified, the ability to learn from noisy and incomplete data, the adaptability to system changes from initial training model.

The experiment: - materials & methods - the dataset Raffaella Folgieri, 2017 The experiment: - materials & methods - the dataset

The dataset Several sources accessible on the Internet. Statistical data from: the European Union Official statistical website, the Croatian Bureau of Statistics, ISTAT (Italian National Institute for Statistics), Trading Economics (http://www.Tradingeconomics.com), the United Nations Sustainable Development Solution Network. Data collected in different periods, so we considered monthly data from 1st January 2007 and ending up with 31st December 2012.

The dataset: considered variables TempC: average monthly temperature in Croatia, Celsius ; HICP: the inflation in the Euro area, measured by the MUICP (Monetary Union Index of Consumer Prices) index. EORD: percentage of contracts received by orders online, all the commercial activities with more than 10 employees. GDPEP: Percentage of GDP (Gross Domestic Product) reserved for the environmental protection. ECUEUR: ECU/EUR exchange rates versus national currency. FTA - DTA: num. of Foreign Tourists Arrivals-Domestic Tourists Arrivals. GTI: Croatia Terrorism Index 2002-2015. The Global Terrorism Index (GTI) ranks the nations according to terrorist activity. ROH: Ranking Of Happiness. Overnight stays: total overnight stays in Croatian tourist accommodation sector, monthly recorded.

Preliminary analysis of collected data Matrix of Pearson correlations: the more correlated variables with the variable to predict (number of overnight stays in accommodations structures) and less with each other were selected and tested for the input layer. We selected the variables having a significant correlation with the output variable, and not highly correlated with the others.

The ANNs the same for the linear regression. With the aim of obtaining a monthly prediction: one ANN per month the same for the linear regression. Accuracy of the neural network measured by the Mean Squared Error (MSE), average of the squares of the errors or deviations (the difference between the estimator and what is estimated) - a measure of the quality of an estimator. We also performed a linear regression implemented in the software environment R to compare the results obtained from the two methods. With the aim of exploiting the generalization characteristics of the Artificial Neural Networks we included the incomplete data to demonstrate the efficiency of the method.

The ANNs

DATA PREPROCESSING Data normalized to prevent useless results and to avoid the algorithm non-convergence We normalized data through the min-max method and scaled the data in the interval [0,1] (usually tends to give better results). After normalization we splitted the data in training-set and test-set: the first used to train the ANN (training set composed by the 70% of the records of the initial dataset) the second to test the model in the prediction phase. (test-set composed by the remaining 30% of the records of the initial dataset) records randomly selected from the complete dataset.

5-FOLD CROSS VALIDATION Adopted to have a further validation of either the ANN or the linear regression. Goal: to define a dataset to validate the model in the training set in order to limit problems like overfitting, or for assessing how the results of the analysis generalize to the independent dataset.

The experiment: - results & discussion Raffaella Folgieri, 2017 The experiment: - results & discussion

ERROR of the ANNs The error of the ANNs (one per month) performed on all the selected variables of the dataset (for the period going from 1st January 2007 to 31st December 2012) resulted in average 0.0000347. The ANN resulting for the prediction for January and the respective registered prediction errors:

R2 for each Linear Regression Result of the linear regression on the same data: R2 (comparable to the MSE of the ANNs) in average, 1. Comparison between the MSE (Mean Squared Error) values registered for each ANN and the R2 for each Linear Regression performed on the same data (from 1st January 2007 to 31st December 2012): The neural network outperforms linear regression technique in modelling tourists arrivals.

ANNs with 4 variables (1) 4 variables (values from 2002-2014) tempC (average monthly temperature, in degrees Celsius), HICP (Croatian Harmonised Indices of Consumer Prices), GTI (Croatia Terrorism Index), nights (number of monthly nights spent in Croatian touristic accommodation by incoming tourists). In this case the average error (across the monthly datasets) has been about 0.003376 that is higher than 0.0000347 obtained for the ANNs previously trained. This means that the more information per year is collected (despite less information and more years), the more accurate the prediction will be.

Linear regression on 4 variables linear regression on the same data the MSE of the ANNs resulted lower than the R2 of the linear regressions, in average. Comparison between the MSE (Mean Squared Error) values registered for each ANN and the R2 for each Linear Regression performed on the 4-variables dataset (from 1st January 2002 to 31st December 2014): Also with the 4-variables datasets, the neural network outperforms linear regression technique (tourists arrivals pred.)

ANNs on 2 variables Similar results when we considered only the variables temperature and nights (values from 2002 to 2014). The average error has been about 0,02 (much higher than the values obtained for the ANNs previously trained) for the linear regression we obtained an average R2 equal to 0.9 The obtained values were also confirmed also by the 5-fold cross validation procedure

CONCLUSION The results showed that using the Artificial Neural Network model in predicting tourists arrivals is a robust method giving an interesting low error rate and outperforming the linear regression technique. ANN can make long-term prediction on tourist inflow and also a short-term analysis, useful in case of unexpected events influencing tourism. ANN could operate as a Decision Making support (proactive and reactive solutions to improve tourist services). Data coming from a reputation analysis on the Internet, as data from sentiment analysis performed on the social networks, either for the touristic sites, in general, or also on specific accommodation structures can be included in dataset values.

Thank you for your attention! Prof. Raffaella Folgieri Università degli Studi di Milano, Milan, Italy E-mail: raffaella.folgieri@unimi.it